用户名: 密码: 验证码:
复合服务自恢复关键技术研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
面向服务的体系架构SOA为Internet环境下资源共享与业务集成提供了很好的应用模式。为了提高Web服务的可重用性,并分散和简化应用逻辑,单个Web服务通常只提供简单的业务功能,通过组合多个业务关联的服务形成复合服务流程来满足快速多变、需求多样化的应用需求。但是,复合服务应用中的web服务自主性和网络环境的复杂多变性给这类应用带来了新的问题,即复合服务在运行时易于出现服务失效异常。复合服务自恢复技术是保障复合服务可靠性和提高自动化管理的重要机制。自恢复技术可以减少用户对复合服务异常处理的负担,使得复合服务在有服务失效发生时仍然能够继续正常执行,实现复合服务的自动管理,降低管理成本。因此复合服务自恢复技术的研究对于推动面向服务架构的成功应用与实施有着重要的意义。
     本文针对复合服务自恢复中的若干关键问题展开下列研究:
     (1)基于容错的复合服务自恢复模型
     为了提供复合服务自动恢复的能力,本章提出了一种基于容错的复合服务自恢复模型,利用基于容错的恢复策略保障复合服务的可靠性。基于容错的恢复策略包括重试策略、顺序冗余策略和并行冗余策略。重试策略是指异常发生后对绑定服务进行重新调用的策略,顺序冗余策略是指异常发生后对绑定服务进行顺序的服务替换的策略,而并行冗余策略是指异常发生后对绑定服务进行并行的服务替换的策略。相比传统的恢复机制,所提出的基于容错的复合服务自恢复模型,能够更加有效的提高复合服务的可靠性。
     (2)收益驱动的自恢复策略选择方法
     针对复合服务执行时的业务特征,提出了一种收益驱动的复合服务恢复策略选择方法。我们的方法能准确计算在动态多变的Internet环境下复合服务执行的收益,并通过启发式算法有效保障复合服务执行的收益最大化。通过数值实验,验证了方法的有效性,并比较了不同的服务策略选择方法,收益驱动的恢复策略选择方法能有效选择出更优的恢复策略,而且保证在各种服务失效概率下获得的收益最高。
     (3)基于贝叶斯网络的异常诊断方法
     提出一种采用贝叶斯网络对复合服务异常根本原因进行诊断的方法。通过为复合服务建立基于贝叶斯网络的异常诊断模型,将复合服务中隐含的web服务之间异常关联的因果关系映射到贝叶斯网络中。利用Web服务出现失效的概率和复合服务历史运行的数据进行统计分析,获取贝叶斯网络中服务节点的先验概率以及服务输出节点的条件概率。在此基础上,提出了基于贝叶斯推理的复合服务异常诊断算法。与已有的研究相比,本文提出的复合服务异常诊断方法考虑了异常诊断推理过程中的不确定性,而且在服务监控信息不完整的情况下也能进行异常的诊断。实验表明提出的方法能快速有效地识别导致复合服务运行异常的服务节点。
     (4)面向BPEL的复合服务自恢复系统实现
     最后,在上述研究的基础之上,对面向BPEL的复合服务自恢复实现技术进行研究,给出了自恢复系统的整体框架,基于软件所开发的OnceBPEL引擎进行了系统扩展实现,重点介绍了自恢复策略到BPEL代码的转换算法。
In the service-oriented environment, a single Web service can hardly satisfy the given request, so the composition of multiple Web services is required to fulfill the goal. Because the reliability and effectiveness of remote Web services are unclear, and the performance of Internet is also unpredictable, it is difficult to guarantee the performance of these service-oriented applications, which are developed on Web services. As web services in a business process are loosely coupled and autonomous, failures occur frequently during the execution of the business process. Therefore how to reduce failure occurrence and recover service compositions from failures automatically becomes a significant problem in the area of SOC.
     To address the above problems, we propose several self-recovery techniques for composite services in this thesis. Our techniques take into account the autonomy and heterogeneity of web services, and thus can improve the reliability of composite services in the service environment. The main research works and contributions of this thesis are described as follows:
     1. Redundancy-based fault tolerance strategies are proposed for building reliable service composition applications, which are usually developed on the unpredictable remote web services. We propose a self-recovery service composition framework. Based on this framework composite services can automatically generate their optimal fault tolerance strategy to achieve good service reliability as well as good overall performance.
     2. We propose a utility-driven recovery strategy selection algorithm which explicitly balances the cost of provisioning with the expected value of the service composition. We propose a business model for composite services, and analyze the relationship between the utility of a business process and the requirement of customers. According to the SLA analysis, we give a selection algorithm which chooses optimal fault tolerance strategy. We conduct various experiments and the results show our selection algorithm performs well.
     3. In this research, we study the model of fault diagnosis to detect and defuse the real cause of a problem when service errors (such as incorrect result or SLA violation) occur in a service composition. Our approach leverages Bayesian networks to identify the most likely problematic services in a process and selectively inspect those services. The performance study shows that our proposed mechanism is effective on identifying the root cause of problems and can achieve significant cost savings.
     4. A prototype system that support self-recovery service composition is implemented based on above research works. The design and implementation of the core components are exprained in details. Self-recovery strategies are automatically transformed into standard WS-BPEL language and integrated into normal business logic.
引文
[Aalst 2000] Aalst W M, Jablonski S. Dealing with workflow change identification of issues and solutions [J]. International Jouranal of Computer System, Science and Engineering,2000, 15(5):267-276.
    [Aaron 2001] Aaron B. Brown and David A. Patterson. Embracing Failure:A Case for Recovery-Oriented Computing [C]. In High Performance Transaction Systems Workshop, 2001.
    [Alan 2004] Alan Ganek, IBM initiative in autonomic computing and policy [C]. IEEE 5th International Workshop on Policies for Distributed Systems and Networks,2004
    [Alexander 2001] Alexander Bronstein, Joydip Das, Marsha Duro, Rich Friedrich, Gary Kleyner, Martin Mueller, Sharad Singhal, Ira Cohen, Self-Aware Services:Using Bayesian Networks for Detecting Anomalies in Internet-based Services[J], Technical Report HPL-2001-23(R.1), HP Laboratories Palo Alto, "www.hpl.hp.com/techreports/2001/HPL2001-23Rl.ps",2001
    [Andreas 2004] Andreas Hanemann, Martin Sailer, David Schmitz, Assured Service Quality by Improved Fault Management-Service-oriented Event Correlation[C], Proceedings of the 2nd International conference on Service oriented computing, November2004
    [Andrew 1990] Andrew S. Tanenbaum, Robbert van Renesse, Hans van Staveren, Gregory J. Sharp, Sape J. Mullender, Jack Jansen, and Guido van Rossum. Experiences with the Amoeba Distributed Operating System. Communications of the ACM,33:46-63, December 1990.
    [Andrews 2003] Andrews T, Curbera F, Dholakia H, Goland Y, Klein J, Leymann F, Liu K, Roller D, Smith D, Thatte S, Trickovic I, Weerawarana S. Business Process Execution Language for Web Services Versionl.1[S].2003. http://wvw-128.ibm.com/developerworks/library/specification/ws-bpel/.
    [Baresi 2007] L. Baresi, and S. Guinea. Dynamo and Self-Healing BPEL Compositions[C]. In ICSE Companion, pages 69-70. IEEE Computer Society,2007.
    [Brugnoni 1993] S.Brugnoni and GBruno, An expert system of real time fault diagnosis of the Italian telecommunications network[J], Integrated Network Management, Pages:617-628, North-Holland,1993.Elsevler.
    [Casati 1998] Casati F. Models, Semantics, and Formal Methods for the Design of Workflows and their Exceptions [D]. Milano:1998.
    [Cheng 2007] Cheng S, Chang CK, Zhang L-J, Kim T-H.Towards Competitive Web Service Market [C]. In Proeeedings of IEEE International Workshop on Future Trends of Distributed Computing Systems,2007.213-219.
    [Chen 2008]Chen W, Wei J, Wu GQ and Qiao XQ. Developing a Concurrent Service Orchestration Engine Based on Event-driven Architecture[C]. DOA'08
    [Chiu 1999] Chiu D K, LI Q, Karlapalem K. Exception handling with workflow evolution in ADOME-WFMS:a taxonomy and resolution techniques [J]. SIGGROUP Bull.1999, 20(3):8-8
    [Cnythia 1997] Cnythia S.Hood and Chuanyi Ji. Proactive network fault detection[C]. In Proceedings of the IEEE NIFOCOM, Kobe, Japan, April 1997
    [Cortellessa 2007] Cortellessa V and Grassi V.Reliability Modeling and Analysis of service-Oriented Architectures[M]. In:Baresi L, Di Nitto E.(eds.) Test and Analysis of Web Services, Springer, Heidelberg,2007.339-362.
    [Curbera 2002] Curbera F, Duftler M J, Khalaf R, Nagy W, Mukhi N, Weerawarana S. Unraveling the Web Services Web:An Introduction to SOAP, WSDL, and UDDI [J]. IEEE Internet Computing,2002.6(2):86-93
    [Darst 1999] Darst, C; Ramanathan, S., Measurement and Management of Internet Services, Integrated Network Management [C],Proceedings of the Sixth IFIP/IEEE International Symposium Distributed Management for the Networked Millennium,24-28 May 1999, Pages:125-140
    [David 2003]David Oppenheimer, Archana Ganapathi, David A.Patternson, "Why do Internet Services fail, and what can be done about it?"[C],4th Usenix Symposium Internet Technologies and Systems, USITS'03,2003
    [Davis 2006] Davis D, Karmarkar A, Pilz G, Winkler S, Yalcinalp U.Web Services Reliable Messaging (WS-ReliableMessaging)[S].2006. http://docs.oasis-open.Org/ws-rx/wsrm/200608/wsrm-l.1.spec-cd-04.html
    [Deborah 2000] Deborah Caswell, Srinivas Ramanathan, Using Service Models for Management of Internet Services[J], IEEE Journal on Selected Areas in Communications, Volume:18, Issue:5, Pages:686-701, May 2000
    [Erradi 2006] Erradi, A., Maheshwari, P., and Tosic, V. Recovery policies for enhancing web services reliability [C]. In Proceedings of the IEEE International Conference on Web Services (ICWS'06)
    [Georgakopoulos 2002] Georgakopoulos, D., Schuster, H., Cichocki, A. and Baker, D. Process-based e-service composition for modeling and automating zero latency supply chains[M], Information System Frontiers, Kluwer Academic Publishers,2002 Vol.4, No.1, pp.33-54.
    [George 2001] George Candea and Armando Fox. Recursive Restartability:Turning the Reboot Sledgehammer into a ScalPel [C]. Proc.8th Workshop on Hot Topics in Operating Systems. Schloss Elmau, Germany, May 2001
    [Hanemann 2005] Hanemann, A., Sailer, M., Schmitz, D., Towards a Framework for IT Service Fault Management[C], In Proceedings of the European University Information Systems Conference (EUNIS 2005), EUNIS, Manchester, England, June,2005.
    [Hansen 2002] Hansen, M., Madnick, S. and Siegel, M.Bussler, C. et al. (Eds.):Process Aggregation Using Web Services[J], WES, LNCS 2512, Springer-Verlag Berlin Heidelberg, pp.12-27.
    [Huang 2005]Xiaohui Huang, Shihong Zou, Wendong Wang, Shiduan Cheng, MDFM: Multi-domain Fault Management for Internet Services[C], IFIP/IEEE 8th International Conference on Management of Multimedia Networks and Services, MMNS2005, NLCS3754, pp.121-132
    [IBM 2004]IBM Corporation, An architectural blue print for autonomic computing[P], Autonomic Computing White Paper,2004 http://www-3.ibm.com/autonomic/pdfs/ACBP22004-10-04.pdf
    [Irwin 2004] Irwin, D. E., Grit, L. E., and Chase, J. S.Balancing risk and reward in a market-based task service[C]. In Proceedings of the 13th IEEE International Symposium on High Performance Distributed Computing (HPDC-13'04), Honolulu, USA, pages 160-169. IEEE Computer Society.
    [Jakobson 1993] GJakobson and M.Weissman, Alarm Correlation[M], IEEE Network,7(6), November,1993.
    [Jaeger 2004] Jaeger, M., Rojec-Goldmann, G, and M'uhl, G. QoS aggregation for web service composition using workflow patterns[C]. EDOC 2004
    [Jeffrey 2003] Jeffrey O. Kephart, David M. Chess. The Vision of Autonomic Computing[C], IEEE Computer Society, Jan 2003
    [Johann 1995]Johann Eder W L. The Workflow Activity Model WAMO [C]. In:Int. Conf. on Cooperative Information Systems. Vienna, Austria:1995.
    [Jordan 2007] Jordan D and Evdemon J.2007. Web Serivces Business Process Execution Language Version2.0[S], OASIS Standard. http://docs.oasis-open.org/wsbpel/2.0/serviceref
    [Karunamurthy 2006] Karunamurthy R, Khendek F, and Glitho R H. A Novel Business Model for Web Service Composition[C]. In Proceedings of IEEE International Conference on Services Computing,2006.431-437.
    [Katzela 1995]I.Katzela and M.Sehwartz, Schemes for Fault Identification in Communication Networks[J].IEEE/ACM Transactions on Networking, vol.3, December 1995
    [Kim 2004]Kim S.M., Rosu M.C. A Survey of Public web services[C]. In Proc. of the 13th International WWW Conference.2004
    [Klein 2000] Klein M, Dellarocas C. A Knowledge-based Approach to Handling Exceptions in Workflow Systems [J]. ComPuter Supported Cooperative Work,2000,9(3-4):399-412
    [Kuo 2004]Kuo D. Consistency in Loosely Coupled Distributed Systems. http://www3.ict.csiro.au/ict/content/displav/0.,al6254_b88969_d89556.00.html.2004.
    [Koopman 2003] P. Koopman. Elements of the self-healing system problem space[C]. In ICSE WADS03 Proc.,2003.
    [Kong 1998]Qinzheng Kong, Chen,G.; Hussain, R.Y. A Management Framework for Internet Services, Network Operations and Management Symposium[C].NOMS98., IEEE, Volumerl, 15-20 Feb.1998 Pages:21-30 vol.1
    [Liliana 2005]Liliana Ardissono, Luca Console, Anna Goy, Daniele Theseider Dupre, Towards self-diagnosing Web Services [C], IFIP/IEEE International Workshop on Self-Managed Systems & Services, SelfMan2005, Nice, France
    [Lewis 1993] L.Lewis, A Case-based Reasoning Approach for the Resolution of Faults in Communication Networks[C], Proceedings of the 3dr IFIP/IEEE Symposium on Integrated Network Management, San Francisco, California, USA, April,1993.
    [Lazovik 2006] Lazovik A, Aiello M, Papazoglou M. Planning and monitoring the execution of web service requests [J], International Journal on Digital Libraries,2006,6(3):235-246.
    [Lemos 2002]R.de Lemos, J.L.Fiadeiro, An Architectural Support forSelf-adaptive Software for Treating Faults[C],Proceedings of the 1st ACM SIGSOFT Workshop on Self-Healing Systems(WOSS'02),2002
    [Li 2005]Li, W., He, J., Ma, Q., Yen, I.-L., Bastani, F., and Paul, R. A framework to support survivable web services[C]. In Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05), Denver, USA, page 93.2. IEEE Computer Society.2005
    [Luo 2000]Luo Z W, Sheth A, Kochut K, et al. Exception handling in workflow systems [J]. APPLIED INTELLIGENCE,2000,13(2):125-147
    [Luo 2001]Luo Z. Knowledge Sharing, Coordinated Exception Handling, and Intelligent Problem Solving for Cross-Organizational Business Processes[C];2001.
    [Maximilien 2004]Maximilien E M and Singh M P. A Framework and Ontology for Dynamic Web Services Selection [J]. IEEE Internet Computing,2004.8(5):84-93.
    [Menasce 2002] Menasce D A.QoS Issues in Web Services [J]. IEEE Internet Computing, 2002.6(6):72-75.
    [Moser 2006] Moser, L.E. Melliar-Smith, P.M. Zhao,W., Making Web Services dependable,Availability,Reliability and Security[C]. ARES2006. April 2006
    [Naftaly 2003] Naftaly H. Minsky, On Conditions for Self-Healing in Distributed Software Systems[C], Autonomic Computing Workshop Fifth Annual International Workshop on Active Middleware Services(AMS'03),2003
    [Nilsson 1982] NJ Nilsson, Principles of artificial intelligence[M].1982.
    [Paul 1993]Paul Dagum, Michael Luby, Approximating Probabilistic inference in Bayesian belief networks is NP-hard[M], Artificial Intelligence, v.60.n.l, p.141-153, March 1993
    [Paul 2001] Paul Horn, Autonomic Computing:IBM's perspective on the State of Information Technology[P],2001 http://www.research.ibm.com/autonomic/manifesto/autonomic computing.pdf
    [Peter2000] Peter J K, Gregory A B, Richard N T, et al. Techniques for Supporting Dynamic and Adaptive Workflow [J]. Computer Supported Cooperative Work,2000,9(3-4):269-292.
    [Philip 2003] Philip Koopman, Elements of the self-healing system problem space[C], Workshop on Architecting Dependable Systems(WADS'03), May 2003
    [Piccinelli 2003]Piccinelli, G and Williams, S.C. van der Aalst, W.M.P. et al. (Eds.):Workflow:A Language for Composing Web Services[J], BPM, LNCS 2678, Springer-Verlag Berlin Heidelberg, pp.13-24.
    [Preuner 2002] Preuner, G. and Schrefl, M. Integration of web services into workflows through a multilevel schema architecture[C], Proceedings of the 4th IEEE Int'l Workshop on Advanced Issues of E-commerce and Web-based Information Systems (WECWIS 2002).
    [Rainer 2001]Rainer Hauck, Igor Radisic, Service oriented Application Management-Do Current Techniques meet the Requirements[C],New Development in Distributed Applications and Interoperable Systems:3rd IFIPI, International Working Conference(DAIS2001),2001
    [Rinderle 2004] Rinderle S, Reichert M, Dadam P. Correctness criteria for dynamie changes in workflow systems-a survey[J]. DATA & KNOWLEDGE ENGINEERING,2004,50(1):9-3
    [Rogerio 2003]Rogerio de Lemos, ICSE 2003 WADS Panel:Fault Tolerance and Self-Healing[C], ICSE 2003 Workshop on Software Architectures for dependable Systems, May 2003
    [Saastamoinen 1995] Saastamoinen H. On the Handling of Exceptions in Information Systems [D]. University of Jyvaskyla,1995.
    [Sadiq 2000]Sadiq S W, Orlowska M E. On capturing exceptions in workflow process models [C]. In Proceedings of the 4th International Conference on Business Information Systems.2000.3-19
    [Sape 1990]Sape J. Mullender, Guido van. Rossum, Andrew S. Tanenbaum, Robbert van Renesse, and Hans van Staveren. Amoeba:A Distributed Operating System for the 1990s. Computer, 23(5):44-53, May 1990.
    [Sirin 2003]Sirin E, Parsia B, Wu Dan, et al. HTN Planning for Web Service Composition Using SHOP2[C]. In Proceedings of 2nd International Semantic Web Conference(ISWC2003), Sanibel Island, Florida, USA.2003:20-23
    [Sirin 2005]Sirin, E., Parsia, B., and Hendler, J. Template-based composition of semantic web services[J]. AAAI Press.2005.
    [Srinivas 1999]Srinivas Ramanathan, Deborah Caswell, Scott Neal, Auto-Discovery Capabilities for Service Management:An ISP Case Study[P], Technical Report HPL-1999-68, HP Laboratories Palo Alto
    [Steinder 2004]M.Steinder and A.S.Sethi, Probabilistic Fault Localization in Communication Systems Using Belief Networks [J],IEEE/ACM Transactions on Networking, Vol.12, No.5, October 2004
    [Steinder 2003]M.Steinder and A.S.Sethi, Probabilistic event-driven fault diagnosis through incremental hypothesis updating[C],IFIP/IEEE Symposium on Integrated Network Management, Colorado Springs,CO,2003.
    [Setinder 2002]M.Setinder and A.S.Sethi, Increasing robustness of fault localization through analysis of lost spurious and positive symptoms.[C].21st Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOM), New York, NY,2002
    [Stein 2009] S Stein et al. Flexible Provisioning of Web Service[J]. ACM Transactions on Internet Technology vol.9, no.l, Feb.2009.
    [Stepniewska 2004]Stepniewska I. The pulvinar complex, in the Primate Visual System [M], ed. Kaas JH and Collins CE, CRC Press.
    [Steve 2004]Steve R. White, James E. Hanson, Ian Whalley, David M. Chess, Jeffrey O. Kephart, An Architectural Approach to Autonomic Computing[C], International Conference on Autonomic Computing(ICAC'04),2004
    [Stephenson 2000]Stephenson T A. An Introduction to Bayesian Network Theory and Usage[C]. ID1AP-RR00-03, Feb,2000
    [Tonti 2003]G Tonti, et al., Semantic web languages for policy representation and reasoning:A comparison of kaos, rei, and Ponder[C], Proceedings of International Semantic Web Conference 03,2003.
    [Vojevodina 2005]Vojevodina D, Kulvietis G, Bindokas P. The Method for E-business Exception Handling [C]. In Proceedings of the 5th International Conference on Intelligent Systems Design and Applications (ISDA'05). Washington DC, USA:IEEE Computer Society,2005.203-208
    [Wang 1993] C.Wang and M.Sehwartz, Fault Detection with multiple observers [J], IEEE Transaction on Networking, Vol.1, PP48-55, Feb.1993
    [Wei 2006]Wei Zhao, Rainer Hauser, et al.,Compiling business processes:untangling unstructured loops in irreducible flow graphs[J]. International Journal of Web and Grid Services, Volumn 2, Issue 1, pages:68-91,2006
    [Wil 2007]P. Wil M, Rosemann M, Dumas M. Deadline-based escalation in Process-aware information systems [J]. Decision Support Systems,2007.43(2):492-511
    [WSRM 2004] Bilorusets R, et al. Web service reliable messaging protocol[S]. IBM/BEA/ Microsoft/TIBCO Specification.2004
    [WS-ReliableMessaing 2007]OASIS, "Web services reliable messaging (WS-ReliableMessaging) version 1.1[S],2007.
    [Xia 2007]Xia Yonglin and Wei Jun. Automated Resource Management Framework for Adjusting Business Service Capability[C]. In proceedings of the 2007 IEEE Asia-Pacific Services Computing Conference.
    [Xia 2008]Xia Yonglin and Wei Jun. Context-driven Business Process Adaptation for ad hoc changes[C]. ICEBE 2008
    [Yemini 1996]Yemini, S.A., Kliger, S., Mozes, E., Yemini, Y., Ohsie, D., High speed and robust event correlation[M], Communications Magazine, IEEEVolume34, Issues, May 1996.Page(s):82-90
    [Zeng 2004]Zeng, L., Benatallah, B., Ngu, A. H., Dumas, M., Kalagnanam, J., and Chang, H. QoS aware middleware for web services composition[J]. IEEE Trans. Softw.2004.
    [Zhou 2005]Zhou C, Chia L-T, and Lee B-S. Web Services Discovery with DAML-QoS Ontology [J]. International Journal of Web Services Research,2005.2(2):43-66
    [Zheng 2008]Zibin Zheng and Michael R. Lyu. A Distributed Replication Strategy Evaluation and Selection Framework for Fault Tolerant Web Services[C]. In ICWS 08 Proc.,2008.
    [Zo 2007]Zo H, Nazareth D L, and Jain H K.2007. Measuring Reliability of Applications Composed of Web Services[C]. In Proceedings of the Annual Hawaii International Conference on System Sciences,278.
    [Xu 2005]徐伟,金蓓弘,李京,曹建农.一种基于移动Agent的复合Web服务模型[J].计算机学报,2005,28(4):558-567.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700