用户名: 密码: 验证码:
H.264/AVC视频编码标准研究及其编码器的优化
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着21世纪的到来,人类进入了一个全新的多媒体时代。作为多媒体中最重要、最具表现力和最复杂的数字视频处理,也随着时代的发展取得了长足的进步。ITU—T和ISO/IEC制定的H.26x和MPEG系列视频编码标准极大的改变了人们的生活。在新世纪,ITU—T和ISO/IEC又一次联合制定和发布了新一代的视频编码国际标准:H.264/AVC。本文深入研究了H.264/AVC的编码原理,并在此基础上对其参考软件编码器进行了优化。
     绪论部分简要介绍了视频压缩编码的基本方法,基于块匹配的混合编码框架的构成以及主要视频编码标准的特点。
     第二章介绍了H.264/AVC的编码原理,对其中的主要功能模块:帧内预测、多种块模式的帧间预测、1/4和1/8像素的运动估计、多参考帧、整数变换、CABAC和环路去块效应滤波器及其包含的新技术进行了详细论述。
     第三章首先分析了H.264/AVC参考软件编码器JM61e的性能,仿真实验显示由于采用了一系列新技术,使得H.264/AVC的压缩性能明显超过了现有其它标准。在相同的主观视频质量下,码率减少50%左右。然后分析了基于Baseline Profile编码器的计算瓶颈和热点,确定了下一步编码器优化的重点。
     最后根据H.264/AVC的特点,采用基于边缘方向信息的帧内预测模式选择快速算法、不对称十字和多六角形网格混合运动搜索快速算法、基于全零块检测的整数变换和运动估计快速算法以及帧间模式选择快速算法对Baseline的编码器进行了优化。有效的提高了编码器的编码效率。
With the coming of the 21st century, man is entering into a brand-new multimedia age. Digital Video Processing, as the most important, expressive and complex one in multimedia, has progressed quite a lot. H.26x and MPEG-series video coding standards made by ITU-T and ISO/IEC have changed the people's lives significantly. In this new age, ITU-T and ISO/IEC join again to establish and release a new generation international video coding standard: H.264/AVC. This dissertation tries to probe the coding principle of H.264/AVC, and optimize the reference codec on its basis.
    The Introduction briefly presents the basic methods of video coding, the structure of block-based hybrid coding frame and the chief characteristics of video coding standards.
    Chapter II introduces the encoding principle of H.264/AVC and elaborates on its main function modules of H.264/AVC and new technologies being adopted: intra prediction, multi-block inter prediction, 1/4 and 1/8 fractional pixel motion estimation, multiple reference frames, integer transform, CABAC and deblock filter.
    Chapter III first analyses the performance of reference software encoder JM61e. Simulation results reveal that the adoption of a series of new technologies makes H.264/AVC achieve much higher coompression efficiency compared to contemporary video coding stanards. Under the same visual quality, bit rate reduces about 50%. Then analyses the bottle neck and hot spot based on Baseline Profile, and determine which emphasis should be placed on next-step optimization of encoder.
    Finally, according to the characteristics of H.264/AVC, apply the Baseline encoder to optimization through fast mode decision for intra prediction based on edge direction information, Unsymmetrical-Cross Multi-Hexagon-Grid motion estimation fast algorithm, integer transform and motion estimation fast algorithm based on all zero coefficient detection, inter prediction mode selection fast algorithm, therefore greatly improve the efficiency of enoder.
引文
[1] A. M. Tekalp, "Digital Video Processing, "Englewood Cliffs: Printice Hall, 1995.
    [2] K. R. Castleman, "Digital image processing," Englewood Cliffs: Printice Hall,1996.
    [3] 沈兰荪,卓力等,“视频编码与低速率传输,”北京:电子工业出版社,2001.
    [4] R. J. Clark, "Transform Coding of Images, "London: Academic Press, 1985.
    [5] 章毓晋,图象工程(上册)——图象处理和分析,北京:清华大学出版社,1999
    [6] N. Ahmed, T. Nataraj an and K. R. Rao, "Discrete Cosine Transform", IEEE Trans.Signal Computer, vol. 23, 1974, pp.90-93.
    [7] E. Feig,and S.Winograd, "Fast algorithms for the discrete cosine transform," IEEE Trans. Signal Processing, vol. 40(9), Sept. 1992, pp. 2174-2193.
    [8] B. Lee, "A New Algorithm to Compute the Discrete Cosine Transform," IEEE Trans. Signal Processing, vol. 32(6), Dee. 1984, pp. 1243-1245.
    [9] J. W. Woods and S. D. O'Neil, "Subband coding of Images," IEEE Trans. Acoust. Speech and Sign., Vol. 34, pp.1278-1288, Oct. 1986
    [10] M. Vetterli, "Multidimensional sub-band coding: Some theory and algorithms," Signal Processing, Vol. 6, pp. 97-112, 1984.
    [11] P. G. Howard and J. S. Vitter, "Arithmetic coding for datacompression," Proceedings of the IEEE, Vol 82(6), pp. 857-865, 1994.
    [12] I. H. Witten, R. M. Neal, and J. G. Cleary, "Arithmetic coding for datacompression.," Communications of the ACM, Vol 30(6), pp.520-540, June 1987.
    [13] J. Jiang, "Novel design of arithmetic coding for data compression, "IEE Proc.Comput. Dig. Tech., Vol 142(6), pp. 419-424, Nov, 1995.
    [14] A. Moffat, R. M. Neal and I. H. Witten, "Arithmetic coding revisited," ACM Trans. on Inf. Systems Vol 16(3), pp. 256-294, July 1998.
    [15] H. Li, M. Novak, and R. Forchheimer, "Fractal-based image sequence compression scheme," Opt. Eng., vol. 32, no. 7, pp. 1588-1595, July 1993.
    [16] M. S. Lazar and L. T. Breton, "Fractal-based image sequence compression scheme," IEEE Trans. Circuits Syst. Video Technol., Vol. 4, pp. 297-308, June 1994.
    [17] J.D. Gibson, T. Berger, T. Looabaugh and et al., "Digital compression for multimedia principles & standards," Copyright1997 by Morgan Kaufmann Publishers, Inc.
    [18] T. S. Huang, S. C. Reddy, and K. Aizawa, "Human facial modeling, analysis, and synthesis for video compression," Proc. SPIE Conf Visual Communications and Image Processing, Vol. 1605, pp. 234-241, Nov. 1991.
    [19] D. Pearson, "Developments in model-based video coding," Proc. IEEE, vol. 83,pp. 892-906, June 1995.
    [20] Yao Wang, Jorn Ostermann and Ya-Qin Zhang "Video Processing and Communications," Pearson Education: Printice Hall, 2002.
    
    
    [21] 陈国斌,“视频编码中质量、复杂度和码率控制,”[博士毕业论文],杭州:浙江大学,2003.
    [22] Borko Furht, Joshua Greenberg, Raymond Westwater, "Motion Estimation Algorithms for Video Compression," Copyright 2000, Kluwer Academic.
    [23] J.Koga, K. Iiunuma, A. Hirani and et al., "Motion compensated interframe coding for video conferencing," in Porc. National Telecommunications Conference, 1981, pp. G5.3.1-5.3.5
    [24] R. Li, B. Zeng and M.L. Liou, "A new three-step search algorithm for block motion estimation," IEEE Trans Circuits Syst. Video Technol., vol. 4(4), Aug. 1994,pp. 438-442
    [25] J. Jain and A. Jain, "Displacement Measurement and its Application in interframe Image Coding," IEEE Trans. Communications. vol. 29(12), Dec. 1981, pp.1799-1808
    [26] R. Srinivasan, K. Rao, "Predictive Coding Based on Efficient Motion Estimation," IEEE Trans. Communications, vol. 33(8), Aug 1985, pp. 888-896
    [27] B. Furth, J. Greeberg, R. Westwater, "Motion estimation algorithms for video compression," Copyright 1997 by Kluwer Academic Publishers
    [28] ISO/IEC JTC1/SC29/WG11, N3324, "Optimization Model: Version 1.0,"Noordwijkerout, 2000.
    [29] ISO/IEC JTC1/SC29/WG11, N3675, "Optimization Model: Version 2.0," La Baule, Oct 2000.
    [30] A.M. Tourapis and M.L. Liou, "New Results on Zonal Based Motion Estimation Algorithms-Advanced Predictive Diamond Zonal Search," IEEE Int. Symp. Circuit Syst. (ISCAS 2001), vol. 5, 2001, pp.183-186.
    [31] ITU-T Recommendation H.261, "Video codec for audiovisual services at p×64 kbit/s," in Proc. COM15R 16-E, March, 1993
    [32] ISO/IEC CD 11172, "Coding of moving pictures and associated audio for digital storage media at up to 1.5Mbits/sec-Part 2: Coding of moving pictures information," Dec. 1991
    [33] ISO/IEC 13818-2, "Information technology-Generic coding of moving pictures and associated audio Part 2: Video," 1995
    [34] ITU-T Recommendation H.263, "Video coding for low bitrate communication," May, 1996
    [35] B. Girod, U.Horn and b. Belzer, "Scalable video with multiscale motion compensation and unequal error protection," Multimedia Communications and Video Coding. New York: Plenum Press, 1996
    [36] ITU-T Recommendation H.263 Version 2, "Video coding for low bitrate communication," Jan. 1998.
    [37] ITU-T Draft for "H. 263++" Annexes U, V, and W to Recommendation H. 263.Nov. 2000.
    [38] ISO/IEC FDIS 14496-2, "Information technology-Generic coding of audio-visual objects Part 2: Visual," Oct. 1998
    
    
    [39] 钟玉琢,王琪,贺玉文.基于对象的多媒体数据压缩编码国际标准——MPEG-4及其校验模型.北京:科学出版社.2000年10月
    [40] ITU-T TSG16. Draft call for proposal for H.26L Video coding[R]. ITU-T VCEG 16,1998
    [41] ITU-T Rec. H. 264 |ISO/IEC 14496-10 AVC, Document JVT-G050, 7th Meeting: Pattaya, Thailand, March, 2003
    [42] Thomas Wiegand, Gary J. Sullivan, Gisle Bjφntegaard, and Ajay Luthra,"Overview of the H.264/AVC Video Coding Standard," IEEE Trans. Circuits Syst. Video Technol., Vol. 13(7), pp. 657-673, July 2003.
    [43] Ralf Schfer, Thomas Wiegand, Heiko Schwarz, "The emerging H.264/AVC Standard" EBU Technical Review, Jan. 2003
    [44] Ye-Kui Wang, Miska M. Hannuksela, Viktor Varsa, "The Error Concealment Feature in the H.26L Test Model" Proc. IEEE International Conference on Image Processing(ICIP'02), Vol. Ⅱ, pp. 729-732, Sept. 2002.
    [45] S.Wenger, T. Stockhammer, "An Overview on the H.264 NAL Concept" Doc.JVT-B028, Geneva, Switzerland, January 2002
    [46] VideoLocus Inc. "AVC Real-Time SD Encoder Demo" Doc. JVT-D023,Klagenfurt, Austria, July 2002
    [47] Anthony Joch, "Demostration of Main Profile Decoder on TI C64x", Doc.JVT-F075, Awaji, JP, Dec, 2002
    [48] Gary Sullivan, "JVT IPR Status Report" Doc. JVT-C110, Fairfax, Virginia, USA, May, 2002
    [49] Thomas Stockhammer, Miska M. Hannuksela, and Thomas Wiegand, "H.264/AVC in Wireless Environments," IEEE Trans. Circuits Syst. Video Technol.,Vol. 13(7),pp. 657-673, July 2003.
    [50] Stephan Wenger, "H.264/AVC Over IP," IEEE Trans. Circuits Syst. Video Technol., Vol. 13(7), pp. 645-656, July 2003.
    [51] Tu-Chih Wang, Hung-Chi Fang, and Liang-Gee Chen, "Low-Delay and Error-Robust Wireless Video Transmission for Video Communications," IEEE Trans. Circuits Syst. Video Technol., Vol. 12(12), pp. 1049-1058, Dec. 2002.
    [52] Yan Lu, Wen Gao, Feng Wu. "Sprite generation for frame-based video coding".In Proc. 2002, IEEE Int. Conf. Image Processing, vol. 1, pp. 437-476. Sep. 2001,
    [53] Yan Lu, Wen Gao, Feng Wu, "High Efficient Sprite Coding With Directional Spatial Prediction," In Proc. 2002, IEEE Int. Conf Image Processing, vol. 1, pp. 201-204. Sep. 2002,
    [54] VideoLocus Inc. "H.264/MPEG-4 AVC Compression Tutorial."http://www.videolocus.com/
    [55] Henrique S Malvar, Antti Hallapuro, Marta Karczewicz, Louis Kerofsky."Low-Complexity Transform and Quantization in H.264/AVC" IEEE Trans. Circuits Syst. Video Technol. Vol. 13(7), pp. 598-603, July2003.
    [56] Detlev Marpe, Heiko Schwarz, and Thomas Wiegand. "Context-based adaptive binary arithmetic coding in the H.264 AVC video compression standard. pdf" IEEE
    
    Trans. Circuits Syst. Video Technol. Vol. 13(7), pp. 598-603, July 2003.
    [57] M.-C. Hong and H.-S. Hahn, "A Loop/Post Filter to Suppress Blocking and Ringing Artifacts for H.26L VideoCodec" In Proc. 2002, IEEE Int. Conf. Image Processing, vol. 1, pp. 940 -947, Jan. 2001.
    [58] JVT Reference Software Version 61e http://bs.hhi.de/~suehring/tml/download
    [59] Sergio Saponara, Carolina Blanch, Kristof Denolf, "Data Transfer and Storage Complexity Analysis of the AVC/JVT Codec on a Tool-by-Tool Basis" Doc. JVT-d138, Klagenfurt, Austria, July, 2002.
    [60] Christophe Clerc, Massimo Ravasi, Marco Mattavelli, "Complexity Evaluation of Different Configurations of JVT Codec" Doc. JVT-G038, Pattaya, Thailand, March,2003.
    [61] Kristof Denolf, Carolina Blanch, "Initial Memory Complexity Analysis of the AVC Codec," Fairfax, Virginia, USA, May, 2002.
    [62] Thomas Wiegand, Bernd Girod, "Lagrange Multiplier Selection in Hybrid Video Coder Control, "In Proc. IEEE International Conference on Image Processing, Thessaloniki, Greece, Sep. 2001.
    [63] Anthony Joch, Faouzi Kossentini, Heiko Schwarz, Thomas Wiegand, Gary J. Sullivan "Performance Comparison of Video Coding Standards using Lagrangian Coder Control" In Proc. IEEE International Conference on Image Processing (ICIP 2002), NY, USA, Sep. 2002.
    [64] JN Zhang, YW He, SQ Yang, YZ Zhong, "Performance and Complexity Joint Optimization for H.264 Video Coding", In ISCAS'2003, May 25-28, 2003,
    [65] Hye-Yeon Cheong, Alexis Tourapis, "Fast Motion Estimation within the H.264 Codec", In Proc. IEEE International Conference on Multimedia & Expo(ICME 2003), Baltimore, USA, July. 2003
    [66] Pen Yin, Hye-Yeon Cheong Tourapis, Alexis Michael Tourapis and Jill Boyce "Fast Mode Decision And Motion Estimation For JVT/H.264", In Proc. IEEE International Conference on Image Processing(ICIP 2003), Barcelona, spain, Sep.2003
    [67] Patrick Ndjiki-Nya, Bela Makai, Aljoscha Smoli, Heiko Schwarz, and Thomas Wiegand, "Improved H.264/AVC Coding using Texture Analysis and Synthesis", In Proc. IEEE International Conference on Image Processing(ICIP 2003), Barcelona, Spain, September 14-17, 2003
    [68] Aljoscha Smoli, Yuriy Vatis, Heiko Schwarz, and Thomas Wiegand, "Improved H.264/AVC Coding Using Long-Term Global Motion Compensation", In Proc. VCIP 2004, SPIE Visual Communications & Image Processing, San Jose, CA, USA, January 2004.
    [69] Bojun Meng, Oscar C, Chi-Wah Wong, Hong-Kwai Lam, "Efficient Intra-Prediction Mode Selection for 4x4 Blocks in H. 264", In Proc. IEEE International Conference on Multimedia & Expo(ICME 2003), Baltimore, USA, July.2003
    [70] Gary Sullivan, "JVT IPR Status Report" Doc. JVT-C 110, Fairfax, Virginia, USA,
    
    May, 2002.
    [71] Tom McMahon, Thomas Wiegand, Gary Sullivan, "Draft Prof. Ext Arnendment," Doc. JVT-I047, San Diego, CA, USA, Sep., 2003.
    [72] Richard Gerber, "The Software Optimization Cookbook, "Copyright@Intel Corporation. 2002.
    [73] Ville Lappalainenl and Timo D. H, "Unified Method for Optimization of Several Video Coding Algorithms on General-Purpose Processors, "In Proc. 2002, IEEE Int. Conf. Information Technology, vol. 1, pp. 431-439, Jan. 2002.
    [74] Bojun Meng, Oscar C. Au, Chi-Wah Wong, "Efficient Intra-Prediction Algorithm in H.264" In Proc. IEEE International Conference on Image Processing(ICIP 2003),Barcelona, spain, Sep. 2003
    [75] Feng PAN, Xiao LIN, Rahardja, "Fast Mode Decision for Intra Prediction" Doc.JVT-G013, Pattaya, Thailand, March, 2003.
    [76] Zhibo Chen, Peng Zhou, Yun He, "Fast Integer Pel and Fractional Pel Motion Estimation for JVT," Doc. JVT-F017, Awaji, JP, Dec. 2002
    [77] Cheng Du, Yun He, Junli Zheng, "PPHPS A Parabolic Prediction-Based, Fast, Half Pixel Search Algorithm for Very Low Bit-rate Moving Picture Coding", IEEE Transaction on CSVT, VOL. 13, NO. 6, pp.514-518, June, 2003
    [78] Yu-Wen Huang, Bing-Yu Hsieh, Tu-Chin Wang, "Analysis and Reduction of Reference Frames for Motion Estimation in MPEG-4 AVC/JVT/H.264," In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing(ICASSP 2003), HK, April. 2003
    [79] Andy Chang, Oscar C.Au, Y. M. Yeung, "A Novel Approach to Fast Multi-Block Motion Estimation for H.264 Video Coding," In Proc. IEEE International Conference on Multimedia & Expo (ICME 2003), Baltimore, USA, July. 2003
    [80] Yu Alice, Lee R, Flynn M. Early detection of all-zero coefficients in H.263.PCS'97, Berlin, Germany, 1997
    [81] 周璇,谭径微,余松煜.“H.263中预先判别全零系数的新方法”上海交通大学学报,vol.32(9),pp.107-109 Sep.1998,
    [82] 王熹微,栗强,崔慧娟,唐昆,“最佳零块判决准则下的运动搜索算法,”清华大学学报(自然科学版).,vol.43(7),pp.938-941 2003.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700