用户名: 密码: 验证码:
基于支持向量机的空间数据挖掘方法及其在旅游地理经济分析中的应用
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
本论文根据旅游地理经济分析预测管理需求,就基于支持向量机的空间数据挖掘分类或回归的理论与方法以及在旅游地理经济管理中的应用展开了系统研究,建立了基于支持向量机的空间数据分类或回归挖掘模型与算法、特征选择算法,设计实现了面向旅游地理经济应用的数据挖掘软件。主要内容如下:
     (1)提出了SVM若干算法。通过组合优化方法和最小二乘方法,以及多分类支持向量机方法,提出了MC-COLS-SVM分类机算法;通过组合优化方法,以及减少约束,降低问题复杂度,提出了组合优化COLS-BSVR回归机算法。提出了对于支持向量回归机特征选择的算法,并进行了实证分析。
     (2)构建了基于支持向量机的空间数据挖掘理论与方法体系。研究设计了基于支持向量机的空间数据挖掘工作流程与框架以及实现方法;基于MC-COLS-SVM多分类组合优化思路,设计了空间数据分类算法;基于COLS-BSVR组合优化最小二乘支持向量回归机思路,设计了空间数据回归算法。
     (3)提出了时政指数、景区景点分布指数,并成功应用于旅游地理经济分析之中。通过对旅游收入、游客人数、时政指数、景区景点分布指数、GDP、CPI等变量时间序列的统计描述分析及其它们对旅游地理经济影响分析,提取了旅游地理经济数据特征,设计了相应的旅游地理经济数据库。
     (4)建立了基于支持向量机的旅游地理经济预测模型。基于提出的COLS-BSVR支持向量回归机算法,建立了基于支持向量机的旅游地理经济分析预测数学模型;设计了数据挖掘中的数据构造模式,验证了模型与模式的有效性。
     (5)建立了基于支持向量机的旅游地理经济风险管理模型。基于设计的空间数据分类算法、回归算法以及特征选择算法,结合旅游地理经济特征敏感性分析,建立了风险管理数学模型,验证了模型的有效性。
     (6)设计并实现了基于支持向量机的旅游地理经济数据挖掘软件。该数据挖掘软件分三层结构构建;各种数据采集预处理后,存入旅游地理经济数据库,通过基于支持向量回归机的算法运算,生成预测信息,供分析决策参考。
     该论文有图51幅,表44个,参考文献116篇。
This Dissertation is based on the economic characteristic of tourism geography,the needs for economic analysis management of tourism geography. Using the theoryand method of classification or regression algorithms for spatial data mining which isthe application of support vector machine, systematic researching on the spatial datamining method of support vector machine and its application on tourism geographyeconomy, some innovation achievement have been accomplished, listed below:
     (1) Define severral SVM algorithms. Using combinatorial optimization and leastsquare method, and Multi-class Support Vector Machine method, theMC-COLS-SVM classification algorithm has been raised; using combinatorialoptimization method, reducing limitation and complexity of problems, thecombinatorial optimization COLS-BSVR regression algorithm has been raised.Coming up with the feature selection algorithm of support vector regression, and alsoconducted the confirmed analysis.
     (2) Applying the SVM classification and regression method and theory ontospatial data mining, the spatial data mining theory and method system has beenconstructed. The working procedure and framework of spatial data mining have beendesigned and built, based on the supporting vector machine. Using MC-COLS-SVM(Multi-class Combinatorial Optimization Least Squares Support Vector Machine)optimization idea, the spatial data classification algorithm has been designed.Referring to the idea of COLS-BSVR (Combinatorial Optimization Least SquaresSupport Vector Regression), the spatial data regression algorithm has been designed.
     (3) Define the index of events and policies and the index of scenic areadistribution. They are widely used in the tourism geography economy. By means oftime series analysis and statistical analysis, focusing on the index of scenic areadistribution, the influence of GDP, CPI on tourism geography economy, and theimpact of events, policies, we have analyzed and extracted the characters of tourismgeography economy. The corresponding tourism geography economy database hasbeen built.
     (4) The design of the forecasting model of tourism geography economy. Bymeans of COLS-BSVR support vector regression algorithm, creatively built theanalytical forecasting math model which is based on supporting vector machine forthe tourism geography economy. The data structure mode in data mining has been designed; also the effectiveness of analytical forecasting math model has been proved.
     (5) The design of the risk management model of tourism geography economy.Referring to the MC-COLS-SVM algorithm, COLS-BSVR algorithm and featureselection algorithm, analyzing and extracting the risk characteristics of tourismgeography economy, creatively built the risk management math model, with itseffectiveness being verified.
     (6) The design and realization of the forecasting platform of tourism geographyeconomy. This platform has three parts. After collecting and preprocessing mass data,save them into tourism geography economy database, then generate forecastinginformation with the help of supporting vector regression algorithm. This informationwill be used for analysis and decision making.
     This paper contains51Figures,44tables,116references.
引文
[1]李德仁,王树良,李德毅.空间数据挖掘理论与应用[M].北京:科学出版社,2006.
    [2]邓乃扬,田英杰.数据挖掘中的新方法——支持向量机[M].北京:科学出版社,2004.
    [3]张学工.关于统计学习理论与支持向量机[J].自动化学报,2000,26(1):32-42.
    [4] Cristianini N, Shawe-Taylor J.支持向量机导论[M].李国正,王猛,曾华军译.北京:电子工业出版社,2004.
    [5] Cortes C, Vapnik V. Support vector networks [J]. Machines Learning,1995,20(3):273-279.
    [6] Vapnik V, Golowich S, Smola A. Support Vector method for function approximation,regression estimation, and signal processing [A]. In: Mozer M, Jordan M, Petsche T, eds.Advances in Neural Information Processing Systems[M]. Cambridge, MA: MIT Press,1997,281-287.
    [7] Mangasarian O L, David R M. Successive Overrelaxation for Support Vector Machines [J].IEEE Transactions on Neural Networks,1999,10:1032-1037.
    [8] Lee Y J, Mangasarian O L. SSVM: A smooth support vector machine for classification [J].Computational Optimization and Application,2001,20(1):5-22.
    [9] Mangasarian O L, David R M. Active Support Vector Machines Classification [J]. Advancesin Neural Information Processing Systems (NIPS2000),2000.
    [10] Mangasarian O L, David R M. Lagrangian Support Vector Machine [J]. Journal of MachineLearning Research, March2001,161-177.
    [11] Keerthi S S, Shevade S K, Bhattacharyya C, et al. A fast iterative nearest point algorithm forsupport vector machines classifier design [J]. IEEE Transactions on Neural Network,2000,11(1):124-136.
    [12] Cauwenberghs G, Poggio T. Incremental and decremental support vector machine [A]. In:Leen T, Dietterich T, Tresp V eds. Advances in neural Information Processing Systems13[M].Cambridge, MA: MIT Press,2001,409-415.
    [13] Syed N, Liu H, Sung K. Handing concept drifts in incremental learning with support vectormachines [A]. In: Proceedings of the First International Conference on Knowledge Discoveryand Data Mining[C]. San Diego,1999,317-321.
    [14]萧嵘,王继成,孙正兴等.一种增量学习算法-SVM[J].软件学报,2001,12(12):1818-1824.
    [15] Rychetsky M, Ortmann S, Ullmann M etal. Accelerated training of support vectormachines[A]. In: Proceedings of IEEE/INNS International Joint Conference on NeuralNetworks [C]. Washington,1999,998-1003.
    [16] Yang M H, Ahuja N.A geometric approach to train support vector machines [A]. In:Proceedings of IEEE Conference on Computer Vision and Pattern Recognition [C]. HiltonHead Island,2000,430-437.
    [17] Lee Y J, Mangasarian O L. RSVM: Reduced support vector machines. In: proceedings ofthe First SIAM International Conference on Data Mining,2001.
    [18] Takuyalnoue T, Abe S. Fuzzy support vector machines for Multi-class Pattern Recognition[A]. In: Proceedings of International Joint Conference on Neural Networks [C]. Washington,2001,1449-1455.
    [19] Smits G F, Jordan E M. Improving SVM regression using mixture of kernels [J]. IEEEproceedings of the2002international joint conference on neural networks,2002,3:2785-2790.
    [20] Suykens J A K, Vandewalle J. Least squares support vector machine classifiers [J]. NeuralProcessing Letters,1999,9:293-300.
    [21] Scholkopf B, Smola A, Williamson R C etal. New Support Vector Algorithms [J]. NeuralComputation,2000,12:1207-1245.
    [22] Tsang E C C, Yeung D S, Chan P P K. Fuzzy support vector machines for solving two-classproblems[A].In: Proceedings of the Second International Conference on Machine Learningand Cybernetics, Xi’an,2003,1:1080-1083.
    [23] Tsujinishi Daisuke, Abe Shigeo. Fuzzy least squares support vector machines for multiclassproblems [J]. Neural Networks,2003,16:785-792.
    [24] Hsu C, Lin C J. A simple decomposition method for support vector machines [J]. MachineLearning,2002,46:291-314.
    [25] David M J T, Robert P W D. Data Domain Description Using Support Vector [A].In:Proceedings of the European Symposium on Artificial Neural Networks[C], Bruges(Belgium),1999,251-256.
    [26] David M J T, Robert P W D. Support vector domain description [J].Pattern RecognitionLetters,1999,20:1191-1199.
    [27] Osuna E, Freund R, Girosi F. An improved training algorithm for support vectormachines[A]. In: Proceedings of the IEEE Workshop on Neural Network for SignalProcessing [C]. Amelia Island,1997,276-285.
    [28] Platt J C. Fast training of support vector machines using sequential minimal optimization[A]. In: Schokopf B, Burges C, Smola A, eds. Advances in Kernel Methods-Support VectorLearning[M]. Cambridge, MA: MIT Press,1999,185-208.
    [29] De Kruif B J, De Vries T. On using a support vector machine in learning feed forwardcontrol[A]. In: Proceedings of IEEE/ASME International Conference on AdvancedIntelligent Mecheatronics[C]. Como,2001,272-277.
    [30] Colin C. Algorithmic approaches to training support vector machines: a survey [A]. In:Proceedings of the Eighth European Symposium on Artificial Neural Networks[C]. Burges,2000,27-36.
    [31] Mangasarian O L. Generalized Support Vector Machines [A]. In: Smola A, Bartlett P L andSchokopf Betal. Advances in Large Margin Classifiers[M]. Cambridge, MA: MIT Press,2000,135-146.
    [32] Keerthi S S, Shevade S K, Bhattacharyya C, et al. Improvement to Platt SMO algorithm forSVM classifier design [J]. Neural Computation,2001,13:637-649.
    [33] Shevade S Keerthi S, Bhattacharyya C etal. Improvements to the SMO algorithm for SVMRegression[J]. IEEE Transactions on Neural Networks,2000,11(5):1188-1193.
    [34] Flake G, Lawrence S. Efficient SVM regression training with SMO [J]. Machine Lear-ning,2002,46(13):271-290.
    [35] Chang C, Lin C J. LIBSVM: a Library for Support Vector Machines (Version2.3)[EB/OL].http://www.csie.ntu.edu.tw/~cjlin/papers/libsvm.pdf,2001.
    [36] Francis E H, Cao L J. Applications of support vector machines in financial time series forecasting [J]. Omega: The International Journal of Management Science,2001,29(4):309-317.
    [37]沈翠华,邓乃扬,肖瑞彦.基于支持向量机的个人信用评估[J].计算机工程与应用,2004,30(23):198-199.
    [38]姚奕,叶中行.基于支持向量机的银行客户信用评估系统研究[J].系统仿真学报,2004,16(4):783-786.
    [39] Huang C-L, Chen M-C, Wang C-J. Credit scoring with a data mining approach based onSupport vector machines[J]. Expert Systems with Applications,2006,8(7):1-10.
    [40]刘广利,邓乃扬.基于SVM分类预警系统[J].中国农业大学学报,2002,7(6):97-100.
    [41]陆阳,王海燕,田娜.组合核函数支持向量机在水中目标识别中的应用[J].声学技术,2005,24(3):144-147.
    [42] Cai C-Z, Wang W-L, Chen Y-Z. Support vector machine classification of physical andbiological datasets [J].International Journal of Modern Physics C,2003,14(5):575-585.
    [43] Song J, Tang H W. Support Vector Machines for classification of homo-oligomeric proteinsby incorporating subsequence distributions [J]. Journal of Molecular Structure: THEOCHEM,2005(1-3),722:97-101.
    [44] Byun H, Lee S W. Applications of support vector machines for pattern recognition: a surcey[A]. In: Proceedings of the First International Workshop on Pattern Recognition with SupportMachines[C]. Niagara Falls,2002,213-236.
    [45]李德仁,王树良,李德毅,王新洲.论空间数据挖掘和知识发现的理论与方法[J].武汉大学学报(信息科学版),2002,27(3):221-233.
    [46]王新华,米飞,冯英春,赵玮.空间数据挖掘技术的研究现状与发展趋势[J].计算机应用研究,2009,26(07):2401-2403.
    [47]蒋良孝,蔡之华.空间数据挖掘的回顾与展望[J].计算机工程,2003,29(6):9-10.
    [48]王海起,王劲峰.空间数据挖掘技术研究进展[J].地理与地理信息科学,2005,21(04):6-10.
    [49]毛克彪,田庆久.空间数据挖掘技术方法及应用[J].遥感技术与应用,2002,17(04):198-204.
    [50]汪云甲.基于空间信息技术的矿区生态环境监测与治理研究进展与展望[A].周光召.加入WTO和中国科技与可持续发展——挑战与机遇、责任和对策(下册)[C].中国四川成都:中国科学技术出版社,2002.
    [51]陈述彭.空间数据挖掘的里程碑式力作——评《空间数据挖掘理论与应用》[J]科学通报,2007,52(21):2577.
    [52]王树良.空间数据挖掘进展[J].地理信息世界,2009,7(02):34-41.
    [53]王新洲.论空间数据处理与空间数据挖掘[J].武汉大学学报(信息科学版),2006,32(01):1-4.
    [54]胡彩平,秦小麟.空间数据挖掘研究综述[J].计算机科学,2007,34(05):14-19.
    [55]徐胜华,刘纪平,胡明远.空间数据挖掘与发展趋势探讨[J].地理与地理信息科学,2008,24(03):24-27.
    [56]李德仁,王树良,史文中.论空间数据挖掘和知识发现[J].武汉大学学报(信息科学版),2001,26(06):491-499.
    [57] Ester Martin. R-tree: A fully dynamic index structure for data warehouses. ProceedingsInternational Conference on Data Engineering[c].2000:379-388.
    [58]周成虎,张健挺.基于信息熵的地学空间数据挖掘模型[J].中国图象图形学报,1999,4(11):48-53.
    [59] Agrawal. Parallel mining of association rules[J]. IEEE Transactions on Knowledge and DataEngineering,1996,8(6):962-969.
    [60]程继华,施鹏飞.多层次关联规则的有效挖掘算法[J].软件学报,1998,9(12):58-62.
    [61]许龙飞,杨晓昀.KDD中广义关联规则发现技术研究[J].计算机工程与应用,1998,34(9):33-36.
    [62]闫志刚,杜培军,汪云甲.数据挖掘的SVM-RS方法[A].2007年中国智能自动化会议论文集[C].中国甘肃兰州,2007,756-762.
    [63]张策,臧淑英,金竺,张玉红.基于支持向量机的扎龙湿地遥感分类研究[J].湿地科学,2011,9(03):263-269.
    [64]员永生.基于支持向量机分类的面向对象土地覆被图像分类方法研究[D].陕西杨凌:西北农林科技大学,2010.
    [65]吴兆福,宫鹏,高飞,王侬.基于支持向量机的GPS似大地水准面拟合[J].测绘学报,2004,33(04):303-306.
    [66]杨敏.矿山数据挖掘的方法与模型研究[D].徐州:中国矿业大学,2007.6.
    [67]胡国杰,魏晓妹,蔡明科.混沌-支持向量机模型及其在地下水动态预报中的应用[J].西北农林科技大学学报(自然科学版),2011,39(2):229-234.
    [68]张真真,李智录,王科,李波.最小二乘支持向量机在大坝渗流监测中的应用[J].电网与水力发电进展,2008,24(2):65-68.
    [69]闫志刚.SVM及其在矿井突水信息处理中的应用研究[J].岩石力学与工程学报,2008,27(1):215.
    [70]谭琨,杜培军,郑辉.支持向量机在空间信息处理领域的应用研究[J].测绘科学,2007,32(02):87-91.
    [71] Li G, Song H, Witt S F. Time varying parameter and fixed parameter linear: an applicationto tourism demand forecasting[J].The International Journal of Forecasting,2006,36(22):57-71.
    [72]汪倩雯.国内旅游形象研究综述[J].云南地理环境研究.2008,20(7):122-126.
    [73]魏婧,潘秋玲.近20年国外旅游目的地市场营销研究综述[J].人文地理,2008,23(1):92-97.
    [74]任来玲,刘朝明.旅游需求预测方法文献述评[J].旅游学刊,2006,21(8):90-92.
    [75]胡华.层次分析法在旅游综合决策中的应用[J].宁夏大学学报自然科学版,2003,24(2):343-344.
    [76]宋枚枚,李海霞.层次分析法在旅游景区价值评价中的应用[J].中国科技信息,2008(,20):200-201.
    [77]潘娣,郑大宾,汪慧琴.AHP法在旅游节庆评估体系中的应用[J].武汉职业技术学院学报,2008,7(7):91-94.
    [78]李光金,谭林,杨刚.对四川旅游业投资环境评价分析[J].西南民族学院学报哲学社会科学版,2001,22(7):60-64.
    [79]文斌,吴健冰.桂林人文生态旅游资源分类及评价[J].广西右江民族师专学报,2006,19(4):90-93.
    [80]魏少琴,贾铁飞.杭州市旅游资源空间分析及其整合[J].旅游科学,2005,19(6):21-26.
    [81]王唏.旅游产业的结构分析、区位优势与发展预测——湖北省神农架林区案例[J].商场现代化,2006,1(上旬刊):267-268.
    [82]李艳娜,张国智.旅游环境容量的定量分析—以九寨沟为例[J].重庆商学院学报,2006,16(6):32-34.
    [83]葛洪朋.旅游企业竞争力评价分析[J].财务与金融,2008,(5):91-94.
    [84]陈焕炯,李翠文.旅游体验质量的测度方法构建[J].北方经贸,2007,(9):121-123.
    [85]李景宜.旅游系统市场竞争态及市场动态发展模型[J].经济地理,2002,增刊:219-222.
    [86]张友兰,周爱民,王新学.旅游预测模型及应用[J].河北省科学院学报,2000,17(5):86-89.
    [87]赵哲,尹怀庭.试论新开发旅游区的游客量预测分析[J].人文地理,2004,19(6):58-61.
    [88]张永庆,张冬冬.上海都市旅游发展潜力的综合评价分析[J].工业技术经济,2005,24(5):52-57.
    [89]张朝元,陈丽.基于LS-SVM的大理州入境游客流量时间序列预测[J].科学技术与工程,2008,8(10):5695-5697.
    [90]殷英,胡光华,邱宇青.基于统计学习理论的云南旅游需求预测与分析[J].云南大学学报(自然科学版),2004,26(增刊):23~26.
    [91]朱云涛,尹怡欣,杜军平.SVM增量算法及在旅游信息分类中的应用[J].计算机工程与设计,2007,28(2):700-703.
    [92]林溯.基于支持向量回归的旅游客流量预测[J].科技信息,2006,(12):254-257.
    [93]杨立勋,殷书炉.人工智能方法在旅游预测中的应用及评析.统计与信息论坛,2008,23(4):90-95.
    [94] Chen Kuan-Yu, Wang Cheng-Hua. Support vector regression with genetic algorithmsinforecasting tourism demand [J]. Tourism Management,2007,(28):215-226.
    [95]南剑飞,李蔚.基于灰色系统理论的旅游景区游客满意度评价研究[J].商业研究,2008,(12):46-49.
    [96]刘太安,汪云甲,李永峰,闫志刚.基于LS-BSVR的旅游地理经济预测应用[J].2011.32(12):4169-4172.
    [97] Hsu C, Lin C J.A comparison of methods for multi-class support vector machines [J]. IEEETransactions on Neural Networks,2002,13(2):415-425.
    [98] Platt J C, Cristianini N, Shawe-Taylor J. Large margin DGAs for multi-class classification
    [A]. In: Solla S, Leen T K, Muller K R, eds. Advances in Neural Information. ProcessingSystem12[M]. Cambridge, MA: MIT Press,2000,547-553.
    [99] Takahashi F, Abe S. Decision-Tree-Based Multi-Class Support Vector Machines [A]. In:Proceedings of the Ninth International Conference on Neural Information Proceedings [C].Singapore,2002,1418-1422.
    [100] Schwenker F, Palm G. Tree-structured support vector machines for Multi-class PatternRecognition[A]. In: Kittler J, Roli F, eds. Multiple classifier Systems[M]. Springer,2001,409-417.
    [101] Cheong S, Oh S H, Lee S Y. Support vector machines with binary tree architecture forMulti-class classification[J]. Neural Information Proceedings,2004,2(3):47-51.
    [102] Vapnik, V. N. Book review: the nature of statistical learning theory [J]. Technometrics,1996,38(4):400-406.
    [103] Schwenker F. Hierarchical Support Vector Machines for Multi-class PatternRecognition[A].In:Proceedings of the Fourth International Conference on Knowledge-basedIntelligent Engineering System&Allied Technologies Information proceedings[C].Chennai,2000:561-565.
    [104]曾绍华.LS-SVM的组合优化算法研究[J].计算机工程与应用,2007,43(22):89-92.
    [105] Bradley P S, Mangasarian O L. Feature Selection via Concave Minimization and SupportVector Machine[J]. In: Proceedings of the Fifteenth International Conference on MachineLearning (ICML98),1998,801-807
    [106] Yves Grandvalet, Stephane Canu. Adaptive Scaling for Feature Selection in SVMs[J]. In:Advances in Neural Information Proceedings Systems15, MIT Press,2003.
    [107] Weston J,Mukherjee S,Chapelle O,et al. Feature selection for SVMs.Advances in NeuralInformation Processing system13. MIT Press,2000.
    [108] Chapelle O,Vapnik V,Bousquet O,et al. Choosing multiple para-meters for support vectormachines.Machine Learning,2002,461,46(1):131-159.
    [109]刘太安,杨柏翠,刘欣颖,基于特征选择的最少核分类器研究[J].计算机工程与应用,2007,43(16):169-171.
    [110]泰安市统计局.泰安统计年鉴[M].泰安,2011.6.
    [111]王妍.相空间重构、分叉及经济系统吸引子分析[D].西北工业大学,2007.
    [112]张强,李立华.基于相空间重构技术的金融系统混沌识别[J].经济数学,2011,28(2):40-43.
    [113]熊天安,刘邦兵,雷畅.相空间重构理论支持下的滑坡预测方法[J].地理空间信息,2011,6(3):162-164.
    [114]王振朝,赵宇茜,赵晨.在重构相空间选取样本的时间序列分形预测[J].计算机工程与应用.2011,47(21):126-129.
    [115]吴晋峰,包浩生.旅游系统的空间结构模式研究[J].地理科学,2002,22(1):96-101.
    [116]许谨良.风险管理[M].上海:上海财经大学出版社,2011.4.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700