用户名: 密码: 验证码:
条件区间分位数超高维特征筛选研究
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Feature Screening for Ultrahigh Dimensional Data Based on the Conditional Interval Quantile
  • 作者:来鹏 ; 张洁 ; 季静雯
  • 英文作者:LAI Peng;ZHANG Jie;JI Jingwen;School of Mathematics and Statistics,Nanjing University of Information Science and Technology;
  • 关键词:超高维 ; 特征筛选 ; 区间分位数 ; 确定性筛选性质
  • 英文关键词:ultrahigh dimension;;feature screening;;interval quantile;;sure screening property
  • 中文刊名:ZZDZ
  • 英文刊名:Journal of Zhengzhou University(Natural Science Edition)
  • 机构:南京信息工程大学数学与统计学院;
  • 出版日期:2018-05-02 11:58
  • 出版单位:郑州大学学报(理学版)
  • 年:2019
  • 期:v.51
  • 基金:国家自然科学基金项目(11771215);; 江苏省自然科学基金项目(BK20161530);; 江苏省“青蓝工程”项目(2016);; 国家社科基金重大项目(16ZDA047)
  • 语种:中文;
  • 页:ZZDZ201901008
  • 页数:5
  • CN:01
  • ISSN:41-1338/N
  • 分类号:42-46
摘要
超高维数据下的特征筛选是模型降维建模的重要环节.基于条件分位数的改进超高维特征筛选方法在给定分位点有扰动情况下可能会导致筛选变量不稳定,针对该问题,引入全局条件分位数的思想,提出基于条件区间分位数的超高维特征筛选方法,并通过理论及数值模拟证明其特征筛选的确定性独立筛选性质和所提方法的有限样本性质.
        Feature screening was an important step for model dimension reduction of ultrahigh dimensional data. Focusing on this problem,to tackle the instability of the feature screening procedure based on the conditional quantile technique when the given quantile values had small disturbance,one global quantile technique was introduced. The generalized feature screening procedure based on the conditional interval quantile was proposed. The theoretical proof and numerical simulations were completed to prove the proposed screening procedure could processe the sure screening property and,showed its finite sample properties.
引文
[1] TIBSHIRANI R. Regression shrinkage and selection via the lasso[J]. Journal of the royal statistical society:series B,1996,58(3):267-288.
    [2] FAN J Q,LI R Z. Variable selection via nonconcave penalized likelihood and its oracle properties[J]. Journal of the American statistical association,2001,96(456):1348-1360.
    [3] ZOU H. The adaptive lasso and its oracle properties[J]. Journal of the American statistical association,2006,101(476):1418-1429.
    [4] FAN J Q,LV J C. Sure independence screening for ultrahigh dimensional feature space[J]. Journal of the royal statistical society:series B,2008,70(5):849-911.
    [5] FAN J Q,SONG R. Sure independence screening in generalized linear models with NP-dimensionality[J]. The annals of statistics,2010,38(6):3567-3604.
    [6] ZHU L P,LI L X,LI R Z,et al. Model-free feature screening for ultrahigh-dimensional data[J]. Journal of the American statistical association,2011,106(496):1464-1475.
    [7] LI R Z,ZHONG W,ZHU L P. Feature screening via distance correlation learning[J]. Journal of the American statistical association,2012,107(499):1129-1139.
    [8] WU Y S,YIN G S. Conditional quantile screening in ultrahigh-dimensional heterogeneous data[J]. Biometrika,2015,102(1):65-76.
    [9]脱倩娟,赵红.基于局部邻域嵌入的无监督特征选择[J].郑州大学学报(理学版),2016,48(3):57-62.
    [10] KOENKER R,BASSETT G. Regression quantiles[J]. Econometrica:journal of the econometric society,1978,46(1):33-50.
    [11] ZHENG Q,PENG L M,HE X M. Globally adaptive quantile regression with ultra-high dimensional data[J]. Annals of statistics,2015,43(5):2225-2258.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700