用户名: 密码: 验证码:
基于H.264标准的多路音视频服务器客户端软件设计
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
网络视频服务器是一种对音视频数据进行编码处理并完成网络传输的专用设备,视频服务器在网络视频服务和视频监控等领域有着广泛和重要的应用。可以广泛用于多媒体视讯调度指挥、网络视频监控和会议、多媒体网上直播、网络教学、远程医疗等各个方面。
     论文的核心内容是客户端软件系统的设计与实现。作者以模块化的软件设计思想对客户端软件系统进行了模块的划分和细化,然后详细介绍了各子模块的设计和主要功能函数的设计。
     本文设计并实现了“基于H.264标准的多路视频服务器客户端软件设计”。此软件设计主要是基于Windows平台进行客户端软件的开发。采用了DirectShow多媒体开发工具以及COM规范制作了AcitveX控件,对H.264视频压缩数据进行接收、解压、播放等一系列程序编写。网络传输是基于RTP实时传输协议,并应用了SIP对媒体会话协议作为承载体,支持在IPv6网络协议;并且完成了图像质量的错误检测和音视频同步;应用了基于MFC Dialog制作了客户端的界面;应用PELCO-D协议完成了云台控制协议对镜头控制部分。
     测试结果表明,该终端可以在IPv4/IPv6网络环境下采用SIP协议同其它设备建立连接,实现实时H.264格式和其它视频格式与多种格式音频的解码、播放、网络通信与硬盘存储等网络视频服务器客户端基本功能,可广泛应用于视频监控与VOD等领域。
The network video server is one kind of digital and network applications which has several advantages. It is easy to arrange surveillance system as network and it has long distance coverage. Moreover, it has strong anti-noise ability, has good picture quality and is easy to query. So it can be extensively applied in telecommunication, power, transportation, bank, irrigation works, intelligence mansion etc. The video server system can real-timely, visually, truly reflect the things that is supervised and controlled, and also can obtain the information of it, which increasing the efficiency of management and the level of official automatization.
     The Multi-Channel digital video server system is a new technique product that electronics technique, calculator technique, correspondence technique joining together with automatization technique and developing quickly.
     As video sequence has huge data capacity, while transmission network only has limited bandwidth, vide coding plays a key role in network video server system. H.264 is the latest ITU_T standard of video coding, and now is being used widely because of its strong adaptability, high compression efficiency, excellent video quality. So we choose it as the video compression protocol of the video server system.
     In realization of the client of digital video server system, based on COM (Component Object Model) and DirectShow Filter, we built client of the digital video system distance-computer Filter Graph, which achieved video receiving from network, decoding, displaying, recording and playback. We modified Intel DirectShow RTP framework, which we used to achieve the transmission mode of H.263 over RTP". Based on DirectShow architecture, we can expend the function of the client of digital video server system by adding or deleting DirectShow Filter, without change the system framework macroscopically. So, the system, we had developed, had very good compatibility and extensibility. The H.264 standard and RTP protocol, which had been introduced in the client of digital video system, guaranteed high-performance of the video transmission on low-bit rate network.
     In the aspect of the pan-and-tilt drives and lens control, we based on XML and SIP, realized direct communication between ahead-computer and pan-and-tilt decoder. Using winsock, windows network application-programming interface, we achieved network communication of control instruction between ahead- embedded device and client-computer.
     Motion detection is a valuable technique in video surveillance system. By analyzing digital video sequences, the system can find out whether there is something moving inside the monitor scene. According to the results, it can take some actions, for example, tracing the object, alarming, starting the recording or transmitting process.
     The scheme adopted C/S network modes and adapted well to high real time requirement for network video streaming transmission. This thesis mostly analyses and studies these techniques of client of Multi-Channel digital video server system, and it also realizes practically a "Software Design for PC Client of Multi-Channel Digital Video Server Based on H.264 ".
     The process of the design and realization of the software for client of digital video server can be divided into following steps.
     (1) The next generation video compression standard H.264 is designed to compress the source images and videos with lower distortion, less bit rates and higher compression ratio, and in the hope that it can be applied in many occasions. H.264 adopts many advanced algorithms. With these algorithms, it makes a perfect balance between the computation complexity and the coding performance. Compared with the other techniques, H.264 has high compression performance, and its system can be well applied in the mobile and the IP network. For the purpose of universality and expansibility of the software of client of digital video server system, the architecture development is based on DirectX standard. This system implements the functions of video decoding with H.264, playing back, displaying and network communicating and transmitting under the framework of DirectShow. Microsoft' s DirectShow, which is built for the streaming media which runs on the Windows operations, and makes the play of multimedia flexible, and provides service for almost all kinds of media format, especially for capture and replay of video streams.
     (2) Client/Server Model is one of the normal communication model which based on TCP/IP. There are programs run respectively in Client terminals (remote) and Server terminal (local), the Client terminal send request to Server terminal, then the Server terminal reply it accordingly. It studies the control principle and functions of the PAN/TILT and LENS control, alarming system. The signaling in PAN/TILT and LENS control, alarming and parameter control in the Client terminal are transmitted to the Server terminal are all completed with the same mode. They make SIP as the application protocol and encapsulate the signal with XML, and send the signal as Content in SIP to the Server terminal. In the realization of communication, we adopt Winsock to complete the oriented-connection (TCP) C/S model.
     (3) A SIP/SDP protocol stack for session initializing and media format describing and a RTP/RTCP protocol stack for realtime multimedia streams' transmitting are also realized.
     (4) MFC (Microsoft Foundation Class) and its SDK is applied in the interface of the Client terminal. The program was completed in VC environment and with C++ as programming language. The custom can kick ActiveX button and connect to the Server terminal, change the video format, switch between one channel and four channels. The client software also support the local storage, snap, zoom in, zoom out and so on. The interface of the client is appropriate and easily to use, and achieve many targets as anticipation.
     Function testing is carried out in both IPv6/IPv4 LAN and IPv4 with public internet network environment. Result indicates that the terminal can receive, decode and playback video and audio streams persistently. Video streaming is played smoothly and the audio streaming is synchronized favourably. The PAN/TILT and L ENS control and alarming are realized by the software. The Software can be expanded because the COM is used as criterion, so the software can be used as the second development codec.
引文
[1]汪晓平.基于网络的数字视频远程监控系统关键技术的研究及系统实施.杭州,浙江大学机械系,2002
    [2]童勇木,张兴会.网络视频监控系统研究.天津工程师范学院学报,2006,16(2):16-22.
    [3]Iain E.G.Richardson.H.264和MPEG-4视频压缩.湖南:国防科技大学出版社,2004年11月.
    [4]黄永峰等.IP网络多媒体通信技术.人民邮电出版社,2003.
    [5]刘峰.视频图像编码技术及国际标准.北京邮电大学出版社,2005.
    [6]ISO/IEC CD 11172,Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to 1.5M bits/sec,Nov.,1991.
    [7]ISO/IEC CD 13818,Generic Coding of Moving Pictures and Associated Audio Information.Nov.,1994.
    [8]ISO/IEC CD 14496,Final Draft International Standard,Information Technology-Generic Coding of Audio-Visual Objects.Dec.,1998.
    [9]R.Talluri,Error-Resilient Video Coding in the ISO MPEG-4 Standard,IEEE communications Magazine,Vol.36 No.6,June 1998:112-119.
    [10]CCITT SGXV Recommendation H.261.Video Codec for Audio-Visual Services at p*64kbit/s.Aug.,1990.
    [11]ITU-T Recommecdation H.263.Video Coding for Low Bitrate Communication.1995.
    [12]ITU-T Recommecdation H.264.Advanced Video Coding for Generic Audio-Visual Services.2003.
    [13]周文.H.264编码技术发展状况以及与MPEG-4的比较分析.西门子期刊——第一期刊,2005,12,14.
    [14]H.264的关键技术及应用趋势.上海凝宇信息科技有限公司.2006,7.
    [15]CCITT Recommendation G.711.Pulse Code Modulation(PCM)of Voice Frequencies.Aug.,1972.
    [16]CCITT Recommendation G.721.32 kbit/s Adaptive Differential Pulse Code Modulation (ADPCM).1984.
    [17]ITU-T Recommendation G.729.Coding of Speech at 8 kbit/s Using Conjugate-structure Algebraic-code-excited Linear Prediction(CS-ACELP).1995
    [18]ITU-T Recommendation G.723.1.Dual Rate Speech Coder for Multimedia Communications Transmitting at 5.3 and 6.3 kbit/s.1996.
    [19]Adolfo Rodriguez,John Gatrell.TCP/IP Tutorial and Technical Overview 7th Edition.Pearson Education Press,2002:138-140.
    [20]RTP:A Transport Protocol for Real-Time Applications.RFC 1889.IETF.1999.
    [21]Rosenberg J,Schulzrinne H,Camarillo G.SIP:Session Initiation Protocol.RFC 3261,Jun.,2002
    [22]桂海源.IP电话技术与软交换.北京邮电大学出版社,2004年5月,第一版.
    [23]Zhen Xiao,Kenneth P.Birman,Robbert van Renesse.Optimizing Bufer Management for Reliable Multicast[J].Dependable Systems and Networks,Proceedings,International Conference on,2002:187-198
    [24]潘爱民.COM原理与应用.清华大学出版社,2007(4).
    [25]Don Box,Keith Brown.Effective COM.余蒲澜译.中国电力出版社,2003.
    [26]金永霞.全新的多媒体信息压缩技术DivX.河海大学常州分校学报,2001,15(2):63-65.
    [27]蔡孟哲.COM/ActiveX实用技巧.中国铁道出版社,2001年6月.
    [28]Microsoft.Microsoft DirectX Media SDK Document.Redmond Washington:Microsoft Press,2000.
    [29]Tom Armstrong.ATL开发指南.电子工业出版社,2000年11月.
    [30]侯俊杰.深入浅出MFC第二版.华中科技大学出版社,2001.
    [31]陆其明.DirectShow开发指南,清华大学出版社,2003.
    [32]Microsoft DirectX SDK 9.0 help document.
    [33]刘平,高胜法,王金富.一种基于DirectShow COM接口的多媒体流译码程序的设计.山东科学,2002,Vol5(3):17-21.
    [34]AbdulH,Sadka.Compressed VideoCommunications[M].Beijing:Science Press,2002.
    [35]文坤,高胜法.DirectShow程序设计原理及应用.计算机系统应用,2006(3):25-28.
    [36]肖振中,梁晋.基于DirectShow的网络视频监测系统的设计与实现.计算机工程与应用,2006,42(12):1151-120.
    [37]黄振宁,王敏,吴涛.基于COM的Directshow Filter实现.微机发展.2004,14(5):171-174.
    [38]Bob Dukelow,Keith Bromley,Jerry Symanski.DirectShow for High-Performance Signal.Spawar System Center,San Diego Proceeding of the Gomac Conference,2002(3).
    [39]郑鸿,陈青山,张文渊.DirectShow的原理及其应用.微型计算机应用,2001,17(8):732-738.
    [40]陈淑珍,陈睿.基于COM技术的多媒体监控系统的设计与实现I[J].武汉大学学报(理学版),2001,47(3):318-322.
    [41]Dale Rogerson.Inside COM.Remond Washington:Microsoft Press,1997.
    [42]David S.Platt.The Essence of COM.Prentice Hall Press,2000.
    [43]许延,常义林,刘增基.多媒体同步技术研究.西安电子科技大学学报,2000,27(4):504-509.
    [44]Furht B.Multimedia systems:an overview.IEEE Multimedia Magazine,1994,1(1):47-59.
    [45]Xu Y,Chang Y L,Liu Z J.Calculation and analysis of compensation buffer size in multimedia systems.IEEE CommunicationsLetters,2001,5(8):355-357.
    [46]戴雯惠.基于VC++与DirectShow技术的媒体播放器的设计与实现.甘肃科技纵横,2005,Vol34(6):28-29.
    [47]"D" Protocol Manual.Peclo Corporation.
    [48]Techwell.Inc.TW9903-Multi-standard Video Decoder with High Quality down Scaler.Prelimin ary Data Sheet,2002
    [49]Wei Niu,Jiao Long,Dan Han.Human activity detection and recognition for video surveill ance[C].2004 IEEE International Conferenceon,2004:719-722.
    [50]J.Y.Khan,J.Wall M.A.RashidBluetooth-Based Wireless Personal Area Network for MultimediaCommunication Proceedings of the First IEEE International Workshop on Electronic Design,Test and Applications,2002.
    [51]Wang Y,Zhu Q.R,and Shaw L.M aximallys moothim agere covery in transformc oding.IEEE Trans Communnication,1993,1544-1551.
    [52]A.B.Roach.Session Initiation Protocol(SIP)-Specific Event Notification,RFC3265.In ternet Engineering Task Force,2002(6).
    [53]Wang XK,Deng RH,Bao F.Multicast Internet Protocol.Computer Communications,2000,23.
    [54]Ribas Corbera,Lei J.Shawmin.Rate control in DCT video coding for low-delay c ommunications,Circuits and Systems for Video Technology.IEEE Transactions,1999(1),172-185.
    [55]张飞侠,任洪海.远程数字视频监控系统中云台镜头的控制.科学信息,2007,30:12.
    [56]Wang X K,Deng RH,Bao F.Multicast Interact Protocol.Computer Communications,2000,23:1047-1054.
    [57]H.Schulzrinne,S.Casner,R.Frederick.RFC3550-July2003 RTP:A Transport Protocol for Real-Time Applications.
    [58]朱秀昌,宋建新.多媒体网络通信技术及应用.电子工业出版社,2007.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700