用户名: 密码: 验证码:
Sentiment Classification with Graph Sparsity Regularization
详细信息    查看全文
  • 作者:Xin-Yu Dai (14)
    Chuan Cheng (14)
    Shujian Huang (14)
    Jiajun Chen (14)

    14. National Key Laboratory for Novel Software Technology
    ; Nanjing University ; Nanjing ; 210023 ; China
  • 关键词:Text Graph Representation ; Graph Regularization ; Sentiment Classification
  • 刊名:Lecture Notes in Computer Science
  • 出版年:2015
  • 出版时间:2015
  • 年:2015
  • 卷:9042
  • 期:1
  • 页码:140-151
  • 全文大小:1,778 KB
  • 参考文献:1. Duric, A., Song, F.: Feature Selection for Sentiment Analysis Based on Content and Syntax Models. In: Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis, ACL-HLT (2011)
    2. Socher, R., Pennington, J., Huang, E.H., Andrew, Y.N., Manning, C.D.: Semi-supervised recursive autoencoders for predicting sentiment distributions. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2011)
    3. Xia, R., Zong, C.Q., Li, S.S. (2011) Ensemble of feature sets and classification algorithms for sentiment classification. Information Sciences 181: pp. 1138-1152 CrossRef
    4. Das, S.R., Chen, M.Y. (2007) Yahoo! for Amazon: Sentiment extraction from small talk on the Web. Management Science 53: pp. 1375-1388 CrossRef
    5. Ponomareva, N., Thelwall, M.: Do neighbours help? an exploration of graphbased algorithms for cross-domain sentiment classification. In: Proceedings of the 2012 EMNLP-CoNLL, Jeju Island, Korea, pp. 655鈥?65 (2012)
    6. Wu, Y.B., Zhang, Q., Huang, X.J., Wu, L.D.: Structural Opinion Mining for Graph-based Sentiment Representation. In: Proceedings of the 2011 EMNLP, Edinburgh, Scotland, UK, pp. 1332鈥?341 (2011)
    7. Kim, S.-M., Hovy, E.: Determining the Sentiment of Opinions. In: Proceedings of the 20th International Conference on Computational Linguistics. Association for Computational Linguistics (2004)
    8. Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of EMNLP (2002)
    9. Martins, A.F.T., Smith, N.A., Aguiar, P.M.Q., Figueiredo, M.A.T.: Structured Sparsity in Structured Prediction. In: Proceedings of EMNLP (2011)
    10. Socher, R., Bauer, J., Manning, C.D., Andrew, Y.N.: Parsing With Compositional Vector Grammars. In: Proceedings of ACL (2013)
    11. Fu, Q., Dai, X., Huang, S., Chen, J. Forgetting word segmentation in Chinese text classification with L1-regularized logistic regression. In: Pei, J., Tseng, V.S., Cao, L., Motoda, H., Xu, G. eds. (2013) Advances in Knowledge Discovery and Data Mining. Springer, Heidelberg, pp. 245-255 CrossRef
    12. Tibshirani, R. (1994) Regression Shrinkage and Selection Via the Lasso. Journal of the Royal Statistical Society, Series B 58: pp. 267-288
    13. Yang, Y.M., Pedersen, J.O.: A comparative study on feature selection in text categorization. In: Proceedings ICML (1997)
    14. Rafrafi, A., Guigue, V., Gallinari, P.: Coping with the Document Frequency Bias in Sentiment Classification. In: AAAI ICWSM (2012)
    15. Buhlmann, P.L., Geer, S.A., Geer, S. (2011) Statistics for High-dimensional Data Methods, Theory and Applications. Springer, Heidelberg CrossRef
    16. Huang, J., Zhang, Z., Metaxas, D. (2011) Learning with structured sparsity. Journal of Machine Learning Research 12: pp. 3371-3412
    17. Bach, F., Jenatton, R., Mairal, J., Obozinski, G. (2012) Optimization with sparsity-inducing penalties. Foundations and Trends in Machine Learning 4: pp. 1-106 CrossRef
    18. Aggarwal, C.C., Zhao, P.X.: Towards graphical models for text processing. Knowledge and Information Systems (2013)
    19. Mairal, J., Yu, B.: Supervised Feature Selection in Graphs with Path Coding Penalties and Network Flows. Journal of Machine Learning Research (2013)
    20. Jacob, L., Obozinski, G., Vert, J.-P.: Group Lasso with overlap and graph Lasso. In: Proceedings of the International Conference on Machine Learning (ICML) (2009)
    21. Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J. (2008) LIBLINEAR: A library for large linear classification. Journal of Machine Learning Research 9: pp. 1871-1874
    22. Chang, C.C., Lin, C.J.: LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology聽2 (2011)
    23. Nakagawa, T., Inui, K., Kurohashi, S.: Dependency tree-based sentiment classification using CRFs with hidden variables. In: Proceedings of ACL:HLT (2010)
    24. Socher, R., Pennington, J., Huang, E.H., Andrew, Y.N., Manning, C.D.: Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions. In: Proceedings of EMNLP (2011)
    25. Pang, B., Lee, L.: A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts. In: Proceedings of ACL (2004)
    26. Goodman, J.: Exponential priors for maximum entropy models. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics (2004)
  • 作者单位:Computational Linguistics and Intelligent Text Processing
  • 丛书名:978-3-319-18116-5
  • 刊物类别:Computer Science
  • 刊物主题:Artificial Intelligence and Robotics
    Computer Communication Networks
    Software Engineering
    Data Encryption
    Database Management
    Computation by Abstract Devices
    Algorithm Analysis and Problem Complexity
  • 出版者:Springer Berlin / Heidelberg
  • ISSN:1611-3349
文摘
Text representation is a preprocessing step in building a classifier for sentiment analysis. But in vector space model (VSM) or bag-of -features (BOF) model, features are independent of each other when to learn a classifier model. In this paper, we firstly explore the text graph structure which can represent the structural features in natural language text. Different to the BOF model, by directly embedding the features into a graph, we propose a graph sparsity regularization method which can make use of the the graph embedded features. Our proposed method can encourage a sparse model with a small number of features connected by a set of paths. The experiments on sentiment classification demonstrate our proposed method can get better results comparing with other methods. Qualitative discussion also shows that our proposed method with graph-based representation is interpretable and effective in sentiment classification task.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700