用户名: 密码: 验证码:
基于ARIA的K均值聚类算法研究
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Research on Kmeans Clustering Algorithm Based on ARIA
  • 作者:王雷 ; 刘小芳 ; 赵良军
  • 英文作者:WANG Lei;LIU Xiaofang;ZHAO Liangjun;School of Automation and Information Engineering,Sichuan University of Science &Engineering;School of Computer Science,Sichuan University of Science &Engineering;
  • 关键词:聚类分析 ; 局部最优 ; 自适应半径免疫算法 ; K均值聚类算法 ; 聚类中心 ; 优化
  • 英文关键词:clustering analysis;;local optimum;;adaptive radius immune algorithm;;Kmeans clustering algorithm;;clustering center;;optimization
  • 中文刊名:SCQX
  • 英文刊名:Journal of Sichuan University of Science & Engineering(Natural Science Edition)
  • 机构:四川轻化工大学自动化与信息工程学院;四川轻化工大学计算机学院;
  • 出版日期:2019-04-20
  • 出版单位:四川理工学院学报(自然科学版)
  • 年:2019
  • 期:v.32;No.150
  • 基金:四川省科技计划项目(2017GZ0303);; 四川理工学院人才引进项目(2018RCL21)
  • 语种:中文;
  • 页:SCQX201902010
  • 页数:6
  • CN:02
  • ISSN:51-1687/N
  • 分类号:70-75
摘要
针对传统K均值聚类算法对初始聚类中心敏感,易陷入局部最优和对大数据集聚类速度慢的缺点,将ARIA与Kmeans算法相结合,提出了一种ARIA-Kmeans算法,即基于自适应半径免疫的K均值聚类算法。首先利用自适应半径免疫算法对数据进行预处理,产生能够代表原始数据分布以及密度信息的内部镜像数据;然后用K均值聚类算法对其进行多次聚类,获得最佳聚类中心,并将其作为初始聚类中心,推广到全部数据优化聚类效果;最后对其结果进行评价。实验结果表明,相对于传统Kmeans算法,新算法在保证聚类准确度的前提下,提高了算法运行的时间效率和稳定性。
        Considering the shortcomings of traditional Kmeans algorithm,which is sensitive to initial clustering center and easy to fall into local optimization,an ARIA-Kmeans algorithm is proposed by an idea that combines adaptive radius algorithm( ARIA) with Kmeans clustering algorithm,which is called Kmeans clustering algorithm based on ARIA. Firstly,the adaptive radius immune algorithm is used to preprocess the data to generate internal images data that can represent the original data distribution and density information. Then,the Kmeans clustering algorithm is used to cluster the internal images data several times,and the obtained best center is taken as the initial cluster center,which is extended to all data to obtain the global optimal results. Finally,the results are estimated by corresponding indexes. Experimental results show that the new algorithm achieves better results than the traditional Kmeans algorithm in terms of both efficiency and stability,while ensuring the accuracy of clustering.
引文
[1]曹跃,王雅琳,何海明,等.Canopy-Kmeans聚类和组合优化的铁矿预配料智能调度[J].控制理论与应用,2017,34(7):947-955.
    [2]刘倩颖,阮应君,时翔,等.基于kmeans聚类与BP神经网络算法的办公建筑逐时电负荷预测[J].热能动力工程,2018,33(3):138-144.
    [3]王亚涛,王新珩,董育宁,等.基于Kmeans和动态WKNN的两层Wi-Fi改进定位方法[J].南京邮电大学学报:自然科学版,2017,37(5):41-47.
    [4]洪月华.基于MPI蜂群K均值聚类算法并行化计算[J].计算机工程与设计,2017,38(12):3339-3343.
    [5]廖伍代,朱范炳,王海泉,等.基于人工蜂群优化的K均值聚类算法[J].计算机测量与控制,2018,26(4):136-138.
    [6]CHEN Z,TANG T H.Coding technology in Galileo system based on chaotic spreading spectrum CDMA[J].Information&Electric Engineering,2010,8(2):25-31.
    [7]LEE J H,SHIM D S.Fast acquisition of GPS L5 PRNand NH code using L1 signal for software receivers[J].International Journal of Control Automation&System,2016,14(4):1-7.
    [8]DIESPOSTI R S.Global Position System(GPS)user receiver and geometric surface processing for all-in-view coherent GPS signal PRN codes acquisition and navigation solution:US,7688261B2[P].2010-03-30.
    [9]BEZERRA G B,BARRA T V,CASTRO L N D,et al.Adaptive radius immune algorithm for data clustering[C]//Proceeding of 4th International Conference on Artificial Immune System(ICARIS 2005),Banff,Alberta,Canada,August 14-17,2005:290-303.
    [10]JERNE N K.Towards a network theory of the immune network for data analysis[J].Data mining:a heuristic approach,2001,125C(2):373-383.
    [11]CASTRO L N D,ZUBEN F J V.aiNet:An artificial immune network for data analysis[J].Data Mining A Heuristic Approach,2002,40(11):1641-1645.
    [12]FRANCA F O D,COELHO G P,CASTRO P A D.Conceptual and practical aspects of the aiNet family of algorithms[J].International Journal of Natural Computing Research,2010,1(1):1-35.
    [13]HAN J W,KAMBER M,PEI J.数据挖掘概念与技术[M].3版.范明,孟小峰,译.北京:机械工业出版社,2012.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700