用户名: 密码: 验证码:
数据缺失机制识别联合模型及评价
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Combined Model of Data Missing Mechanism Recognition and Its Evaluation
  • 作者:岳廷妍 ; 邱建青 ; 裴姣 ; 张韬
  • 英文作者:Yue Tingyan;Qiu Jianqing;Pei Jiao;Zhang Tao;West China School of Public Health/West China Fourth Hospital,Sichuan University;Institute of Sichuan Cancer Hospital/Sichuan Cancer Prevention & Control Center;School of Medicine,University of Electronic Science and Technology of China;
  • 关键词:缺失机制 ; 缺失比例 ; 联合模型 ; Little ; MCAR法 ; Logit响应模型法
  • 英文关键词:missing mechanism;;missing proportions;;combined model;;Little MCAR method;;Logit response model method
  • 中文刊名:统计与决策
  • 英文刊名:Statistics & Decision
  • 机构:四川大学华西公共卫生学院/四川大学华西第四医院;四川省肿瘤医院研究所/四川省癌症防治中心;电子科技大学医学院;
  • 出版日期:2019-08-09 17:04
  • 出版单位:统计与决策
  • 年:2019
  • 期:16
  • 基金:国家自然科学基金青年项目(81602935);; 四川大学青年教师科研启动基金(2016SCU11006)
  • 语种:中文;
  • 页:73-76
  • 页数:4
  • CN:42-1009/C
  • ISSN:1002-6487
  • 分类号:O212
摘要
文章提出了数据缺失机制识别联合模型,并运用R 3.4.1软件、采用Bootstrap法重复模拟对所提出的联合模型在不同缺失机制、不同缺失比例下的识别效果进行评价。从重复模拟结果可知,联合模型在不同缺失比例下对完全随机缺失(MCAR)机制的识别效果较好(正确识别率为94.79%~95.29%),对随机缺失(MAR)机制的识别效果尚可(正确识别率为77.64%~78.72%)。联合模型在两种缺失机制下在各缺失比例下的正确识别率均较为稳健。
        This paper proposes a data missing mechanism recognition combined model, and uses R 3.4.1 software and Bootstrap method to duplicate the simulation to evaluate the recognition effect of the combined model under different missing mechanisms and different missing proportions. The repeated simulation results show that under different missing proportion, the combined model has better recognition effect on the missing completely at random(MCAR) mechanism(correct recognition rate:94.79%~95.29%), and the recognition effect for the missing at random(MAR) mechanism is also acceptable(correct recognition rate:77.64%~78.72%). In the two missing mechanisms, the combined model is robust to the missing proportion.
引文
[1]Netten A P, Dekker F W, Rieffe C, et al. Missing Data in the Field of Otorhinolaryngology and Head&Neck Surgery:Need for Improvement[J]. Ear&Hearing, 2017, 38(1).
    [2]Beaulieu-Jones B K, Lavage D R, Snyder J W, et al. Characterizing and Managing Missing Structured Data in Electronic Health Records:Data Analysis[J]. Jmir Medical Informatics, 2018, 6(1).
    [3]沈琳,陈千红,谭红专.缺失数据的识别与处理[J].中南大学学报(医学版), 2013, 38(12).
    [4]周静,周正松,高旸等.神经网络模型应用于数据缺失机制识别的可行性分析[J].现代预防医学, 2017,44(21).
    [5]Li J, Yu Y. A Nonparametric Test of Missing Completely at Random for Incomplete Multivariate Data[J]. Psychometrika, 2015, 80(3).
    [6]邱建青,杜春霖,周婷等.多变量数据缺失机制的识别方法[J].中国卫生统计, 2017,(6).
    [7]Rubin D B. Inference and Missing Data[J]. Biometrika, 1976, 63(3).
    [8]孙婕,金勇进,戴明锋.关于数据缺失机制的检验方法探讨[J].数学的实践与认识, 2013, 43(12).
    [9]李春林,高玉鹏,李圣瑜.不完全数据多重插补的Bootstrap方差估计[J].统计与决策, 2017,(18).

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700