用户名: 密码: 验证码:
Computational reconstruction of proteome-wide protein interaction networks between HTLV retroviruses and Homo sapiens
详细信息    查看全文
  • 作者:Suyu Mei (1) (2)
    Hao Zhu (1)

    1. Bioinformatics Section
    ; School of Basic Medical Sciences ; Southern Medical University ; Guangzhou ; 510515 ; China
    2. Software College
    ; Shenyang Normal University ; Shenyang ; 110034 ; China
  • 刊名:BMC Bioinformatics
  • 出版年:2014
  • 出版时间:December 2014
  • 年:2014
  • 卷:15
  • 期:1
  • 全文大小:1,094 KB
  • 参考文献:1. Wu, X, Zhu, L, Guo, J, Zhang, D, Lin, K (2006) Prediction of yeast protein-protein interaction network: insights from the gene ontology and annotations. Nucleic Acids Res 34: pp. 2137-2150 CrossRef
    2. DeBodt, S, Proost, S, Vandepoele, K, Rouz茅, P, Peer, Y (2009) Predicting protein-protein interactions in Arabidopsis thaliana through integration of orthology, gene ontology and co-expression. BMC Genomics 10: pp. 288 CrossRef
    3. Shen, J, Zhang, J, Luo, X, Zhu, W, Yu, K, Chen, K, Li, Y, Jiang, H (2007) Predicting protein鈥損rotein interactions based only on sequences information. PNAS 104: pp. 4337-4341 CrossRef
    4. von Mering, C, Krause, R, Snel, B, Cornell, M, Oliver, SG, Fields, S, Bork, P (2002) Comparative assessment of large-scale datasets of protein-protein interactions. Nature 417: pp. 399-403 CrossRef
    5. Edwards, AM, Kus, B, Jansen, R, Greenbaum, D, Greenblatt, J, Gerstein, M (2002) Bridging structural biology and genomics: assessing protein interaction data with known complexes. Trends Genet 18: pp. 529-536 CrossRef
    6. Fu, W, Sanders-Beer, BE, Katz, KS, Maglott, DR, Pruitt, KD, Ptak, RG (2009) Human immunodeficiency virus type 1, human protein interaction database at NCBI. Nucleic Acids Res 37: pp. D417-D422 CrossRef
    7. Wuchty, S (2011) Computational prediction of host-parasite protein interactions between P. falciparum and H. sapiens. PLoS ONE 6: pp. e26960 CrossRef
    8. Schleker, S, Sun, J, Raghavan, B, Srnec, M, M眉ller, N, Koepfinger, M, Murthy, L, Zhao, Z, Klein-Seetharaman, J (2012) The current Salmonella-host interactome. Proteomics Clin Appl 6: pp. 117-133 CrossRef
    9. Simonis, N, Rual, JF, Lemmens, I, Boxus, M, Hirozane-Kishikawa, T, Gatot, JS, Dricot, A, Hao, T, Vertommen, D, Legros, S, Daakour, S, Klitgord, N, Martin, M, Willaert, JF, Dequiedt, F, Navratil, V, Cusick, ME, Burny, A, Van Lint, C, Hill, DE, Tavernier, J, Kettmann, R, Vidal, M, Twizere, JC (2012) Host-pathogen interactome mapping for HTLV-1 and -2 retroviruses. Retrovirology 9: pp. 26 CrossRef
    10. Tastan O, Qi Y, Carbonell J, Klein-Seetharaman J: Prediction of interactions between HIV-1 and human proteins by information integration. / Proceedings of the Pacific Symposium on Biocomputing (PSB-2009)516鈥?27.
    11. Qi, Y, Tastan, O, Carbonell, JG, Klein-Seetharaman, J, Weston, J (2010) Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins. Bioinformatics 26: pp. i645-i652 CrossRef
    12. Dyer, M, Muralib, T, Sobrala, B (2011) Supervised learning and prediction of physical interactions between human and HIV proteins. Infect Genet Evol 11: pp. 917-923 CrossRef
    13. Doolittle, J, Gomez, S (2010) Structural similarity-based predictions of protein interactions between HIV-1 and Homo sapiens. Virol J 7: pp. 82 CrossRef
    14. Mukhopadhyay, A, Maulik, U, Bandyopadhyay, S (2012) A novel biclustering approach to association rule mining for predicting HIV-1鈥揾uman protein interactions. PLoS ONE 7: pp. e32289 CrossRef
    15. Dyer, M, Murali, T, Sobral, B (2007) Computational prediction of host-pathogen protein-protein interactions. Bioinformatics 23: pp. i159-i166 CrossRef
    16. Schleker, S, Garcia-Garcia, J, Klein-Seetharaman, J, Oliva, B (2012) Prediction and comparison of Salmonella-human and Salmonella-Arabidopsis interactomes. Chem Biodivers 9: pp. 991-1018 CrossRef
    17. Kshirsagar, M, Carbonell, J, Judith, K (2012) Techniques to cope with missing data in host鈥損athogen protein interaction prediction. Bioinformatics 28: pp. i466-i472 CrossRef
    18. Kshirsagar, M, Carbonell, J, Judith, K (2013) Multitask learning for host鈥損athogen protein interactions. Bioinformatics 29: pp. i217-i226 CrossRef
    19. Mei, S (2013) Probability weighted ensemble transfer learning for predicting interactions between HIV-1 and human proteins. PLoS ONE 8: pp. e79 CrossRef
    20. Yu, J, Guo, M, Needham, CJ, Huang, Y, Cai, L, Westhead, DR (2010) Simple sequence-based kernels do not predict protein-protein interactions. Bioinformatics 26: pp. 2610-2614 CrossRef
    21. Venkatesan, K, Rual, JF, Vazquez, A, Stelzl, U, Lemmens, I, Hirozane-Kishikawa, T, Hao, T, Zenkner, M, Xin, X, Goh, KI, Yildirim, MA, Simonis, N, Heinzmann, K, Gebreab, F, Sahalie, JM, Cevik, S, Simon, C, de Smet, AS, Dann, E, Smolyar, A, Vinayagam, A, Yu, H, Szeto, D, Borick, H, Dricot, A, Klitgord, N, Murray, RR, Lin, C, Lalowski, M, Timm, J (2009) An empirical framework for binary interactome mapping. Nat Methods 6: pp. 83-90 CrossRef
    22. Rual, JF, Venkatesan, K, Hao, T, Hirozane-Kishikawa, T, Dricot, A, Li, N, Berriz, GF, Gibbons, FD, Dreze, M, Ayivi-Guedehoussou, N, Klitgord, N, Simon, C, Boxem, M, Milstein, S, Rosenberg, J, Goldberg, DS, Zhang, LV, Wong, SL, Franklin, G, Li, S, Albala, JS, Lim, J, Fraughton, C, Llamosas, E, Cevik, S, Bex, C, Lamesch, P, Sikorski, RS, Vandenhaute, J, Zoghbi, HY (2005) Towards a proteome scale map of the human protein-protein interaction network. Nature 437: pp. 1173-1178 CrossRef
    23. Chatr-aryamontri, A, Ceol, A, Peluso, D, Nardozza, A, Panni, S, Sacco, F, Tinti, M, Smolyar, A, Castagnoli, L, Vidal, M, Cusick, ME, Cesareni, G (2009) VirusMINT: a viral protein interaction database. Nucleic Acids Res 37: pp. D669-D673 CrossRef
    24. Navratil, V, de Chassey, B, Meyniel, L, Delmotte, S, Gautier, C, Andr茅, P, Lotteau, V, Rabourdin-Combe, C (2009) VirHostNet: a knowledge base for the management and the analysis of proteome-wide virus-host interaction networks. Nucleic Acids Res 37: pp. D661-D668 CrossRef
    25. Freund, Y, Schapire, RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55: pp. 119-139 CrossRef
    26. Vezhnevets, A, Vezhnevets, V (2005) Modest AdaBoost 鈥?Teaching AdaBoost to Generalize Better. Graphicon 12: pp. 987-997
    27. Boeckmann, B, Bairoch, A, Apweiler, R, Blatter, MC, Estreicher, A, Gasteiger, E, Martin, MJ, Michoud, K, O'Donovan, C, Phan, I, Pilbout, S, Schneider, M (2003) The SWISS-PROT protein knowledgebase and its supplement TrEMBL. Nucleic Acids Res 31: pp. 365-370 CrossRef
    28. Altschul, S, Madden, T, Schaffer, A, Zhang, J, Zhang, Z, Miller, W, Lipman, D (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: pp. 3389-3402 CrossRef
    29. Barrell, D, Dimmer, E, Huntley, RP, Binns, D, O'Donovan, C, Apweiler, R (2009) The GOA database in 2009鈥攁n integrated gene ontology annotation resource. Nucleic Acids Res 37: pp. D396-D403 CrossRef
    30. Meir, R, Ratsch, G (2003) An introduction to boosting and leveraging. Lect Notes Artif Int 2600: pp. 118-183
  • 刊物主题:Bioinformatics; Microarrays; Computational Biology/Bioinformatics; Computer Appl. in Life Sciences; Combinatorial Libraries; Algorithms;
  • 出版者:BioMed Central
  • ISSN:1471-2105
文摘
Background Human T-cell leukemia viruses (HTLV) tend to induce some fatal human diseases like Adult T-cell Leukemia (ATL) by targeting human T lymphocytes. To indentify the protein-protein interactions (PPI) between HTLV viruses and Homo sapiens is one of the significant approaches to reveal the underlying mechanism of HTLV infection and host defence. At present, as biological experiments are labor-intensive and expensive, the identified part of the HTLV-human PPI networks is rather small. Although recent years have witnessed much progress in computational modeling for reconstructing pathogen-host PPI networks, data scarcity and data unavailability are two major challenges to be effectively addressed. To our knowledge, no computational method for proteome-wide HTLV-human PPI networks reconstruction has been reported. Results In this work we develop Multi-instance Adaboost method to conduct homolog knowledge transfer for computationally reconstructing proteome-wide HTLV-human PPI networks. In this method, the homolog knowledge in the form of gene ontology (GO) is treated as auxiliary homolog instance to address the problems of data scarcity and data unavailability, while the potential negative knowledge transfer is automatically attenuated by AdaBoost instance reweighting. The cross validation experiments show that the homolog knowledge transfer in the form of independent homolog instances can effectively enrich the feature information and substitute for the missing GO information. Moreover, the independent tests show that the method can validate 70.3% of the recently curated interactions, significantly exceeding the 2.1% recognition rate by the HT-Y2H experiment. We have used the method to reconstruct the proteome-wide HTLV-human PPI networks and further conducted gene ontology based clustering of the predicted networks for further biomedical research. The gene ontology based clustering analysis of the predictions provides much biological insight into the pathogenesis of HTLV retroviruses. Conclusions The Multi-instance AdaBoost method can effectively address the problems of data scarcity and data unavailability for the proteome-wide HTLV-human PPI interaction networks reconstruction. The gene ontology based clustering analysis of the predictions reveals some important signaling pathways and biological modules that HTLV retroviruses are likely to target.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700