用户名: 密码: 验证码:
基于内容的视频图像非线性缩放技术研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
近年来,提升LED显示屏的播放质量已经成为业内的研究热点。用户在定制LED显示屏时,拥有很高的灵活度,可以自由选择屏幕的尺寸和宽高比。如果屏幕不符合标准比例,则全屏播放视频时,就会产生失真现象。在这种情况下,如何提高显示质量就成为急需解决的问题。
     本文的设计方法,可以在一定程度上解决这一难题。
     作者首先寻找图像中的视觉焦点区域。提出了一种改进的Canny自适应边缘检测算法。改进后的算法,不仅不需要人工输入参数,而且在处理含有噪声的图像时,也能取得良好效果。接下来,又对提取的边缘图进行消除杂散边处理。这样做,可以大幅降低后续图像分割的压力。
     非线性缩放可以看做是等比例的缩放以及非等比例拉伸的组合。而拉伸方向一般是一维空间的。根据这个特点,作者提出了一种简单有效的图像分割算法。该方法能够根据边缘点沿拉伸方向的投影分布情况,迅速把图像分解为背景区域和含目标区域。实验证明,该算方法不仅有效,而且执行时间在整个算法实现过程中,可以忽略不计。
     由于背景区域的放大倍数会大于目标区域,为了避免由于拉伸比例过大,而出现的视觉抽丝现象。作者提出了基于放大倍数的虚拟与插值相结合的背景放大方法。实验表明,该方法在特定图像中,可以取得良好的效果。
     最后,作者采用GPU并行加速技术以满足实时性要求,采用DirectShow视频处理框架实现对视频当前帧的提取。
In recent years, to enhance the display quality of LED panel has become a research hotspot in the industry. The screen size and aspect ratio of LED panel is free to choose by users. It will be distortion, if the screen does not meet the standard ratios in a full-screen play. In this case, how to improve the display quality has become urgent to be solved. The methods in this paper can solve this problem to some extent.
     First, the author finds the visual focus region of image. It is proposed an improved adaptive Canny algorithm. The improved algorithm, not only do not need to manually input parameters, but also can achieve good results when dealing with noisy images. Then, it has suppressed the stray edges from edge image. To do this, it can significantly reduce the pressure of the following image segmentation.
     Non-linear scaling can be seen as a proportional scaling and non-proportional tensile combination. And the tensile direction is always along with one-dimensional space. Based on this characteristic, the author has proposed a simple and effective algorithm for image segmentation. This method is edge points projection based, and it can rapidly segment the image as backgrounds and foregrounds. Experiments have proved that this method is not only effective but also very fast.
     The amplification of background region would be greater than the foreground area. In order to avoid the phenomenon of visual snag when the stretch ratio is too large, the author has proposed a background scaling method which is based on the combination of background virtualization and interpolation amplification. Experiments have proved that it can achieve good results in some images.
     Finally, the author has used a parallel GPU acceleration technology to meet the real-time requirements, and used DirectShow framework to process the video frames.
引文
[1]杨清德.LED及其工程应用[M].北京:人民邮电出版社.2007.
    [2]王宇.LED显示屏的扫描算法[D]:[硕士学位论文].东南大学电路与系统专业.2005.
    [3]李熹霖.谈LED全彩色显示屏的应用和技术指标[J].现代显示.2003.6:15-19.
    [4]郝金刚,梁春军,刘淡宁等.LED产业分析报告[J].现代显示.2006.3:8-15.
    [1]王慧燕.图像边缘检测和图像匹配研究及应用[D]:[博士学位论文].杭州:浙江大学电气工程专业,2003.
    [2]Nalwa V S. A Guided Tour of Computer Vision[M]. Boston, MA, USA: Addison-Wesley Longman Publishing Co., Inc,1993
    [3]Robert L G. Machine perception of three-dimensional solids[C]. In:Optical and Electro-Optical Information Processing, Tippett J, et al, eds,1965.159-197
    [4]Sobel I. Camera Models and Machine Perception Stanford AI,1970, Memo 121.
    [5]胡文锦.图像边缘检测方法的研究[D]:[硕士学位论文].北京:北京交通大学计算数学,2009
    [6]Kwon Oh Kyu, Sim Dong Gyu, Park Rae Hong. Robust Hausdorff distance matching algorithoms using pyramidal structures[J]. Pattern Recognition,2001, 34(7):2005-2013
    [7]Prewitt J M S. Object enhancement and extraction[C]. In:Lipkin B. S. and Rosenfeld A.eds. Picture Processing and Psychohistories. NewYork:Academic Press,1970.
    [8]Gonzales R C, Woods R E. Digital Image Processing[M]. Beijing:Publishing House of Electronics Industry,2003
    [9]Kirsch R. Computer determination of the constituent structure of biological image[J]. Computer Biomedical Research,1971(4):315-328
    [10]Jain R, Kasturi R, Schuck B G, Machine Vision[M]. New York:McGraw-Hill, 1995.
    [11]Marr D, Hildreth E C. Theory of Edge Detection[J]. Proc. Royal Soc. London B, 1980(207):187-217.
    [12]Wang X. Laplacian operator-based edge detectors[J]. IEEE Trans,Pattern Analysis and Machine Intelligence,2007(5):886-890.
    [13]Buxton B. Early Image Processing Structural Techniques Motivated by Human Visual Response. University of Surrey,1984
    [14]Clark J J. Authenticating edges produced by zero-crossing algorithm [J]. IEEE Trans on PAMI,1998,11(1):43-57.
    [15]Haralick R M. Digital step edges from zero crossing of second directional derivatives[J]. IEEE Trans on PAMI,1984,6(1):58-68.
    [16]Lu Y, Jain R C. Pattern Analysis and Machine Intelligence[J]. IEEE Transactions, 1989,11(4):337-356.
    [17]周德龙,潘泉.图像模糊边缘检测的改进算法[J].中国图像图形学报,2001,6(4):353-358
    [18]刘宛予,唐志健,朱跃敏,等.新的超声立体视觉数据增强技术[J].光学精密工程,2008,16(9)
    [19]王宁.图像边缘检测及柔性形态学在其中的应用[D]:[硕士学位论文].鞍山:辽宁科技大学,2008
    [20]周灵翔.计算机视觉若干关键问题研究[D]:[博士学位论文].杭州:浙江大学,1997
    [21]Hueckel M. A local visual operator which recognizes edges and lines[J]. Journal of the Association for Computing Machinery,1973,20(4):634-647
    [22]Hueckel M. An operator which locates edges in digitized pictures[J]. Journal of the Association for Computing Machinery,1971,18(1):113-125
    [23]Mallat S, Zhong S. Characterization of signals from multiscale edges[J]. IEEE Trans. PAMI,1992,14(7)
    [24]Wang Yu-ping, Cai.Yuan-long. Construction and properties of B-spline wavelet filters for multiscale edge detection[J]. Proceedings of International Conference on Image Processing. Washington, USA,1995(2):145-148
    [25]Pierre Marie, Dubuisson, Jain Anil K. A Modified Hausdorff Distance for Object Matching[J]. proc. of 12th Int. Conf. on Pattern Recognition. Jerusalem, Israel, 1994:566-568
    [26]Mallat S G. Multifrequency channel decomposition of images and wavelet models[J]. IEEE-ASSP,1989,37(12):2091-2110
    [27]Mallat S, Hwang L. Singularity detection and processing with wavelets[J]. IEEE Trans. IT,1992,38(2)
    [28]杨福生.小波变换的工程分析与应用[M].北京:科学出版社,2000
    [29]宫武鹏,王永仲,黎全.用B样条小波进行图像的多尺度边缘检测[J].红外技术,2000,22(4):15-22
    [30]肖旺新,张雪,黄卫.视频交通图像自适应阈值边缘检测[J].交通运输工程学报,2003,3(4):104-107
    [31]王晖,张基宏.多尺度图像边界提取的小波算法与最优准则[J].深圳大学学报(理工版),1997,14(2):21-25
    [32]凌旭峰,杨杰,吕永.基于小波多尺度分析的X-线头影特征点提取[J].上海交通大学学报,2001,35(9):1350-1354
    [33]Wang X. Moving Window-Based Double Haar Wavelet Transform for Image Processing[J]. IEEE Trans. Image Processing,2006,Vol. IP-15:2771-2779,
    [34]杨选,梁德群,杨万海.边缘检测中Canny算子、二次样条小波算子性能分析[J].通信学报,1999,20(8):47-51
    [35]景晓军,李剑峰,熊玉庆.静止图像的一种自适应平滑滤波算法[J].通信学报,2002,23(10):6-14
    [36]Chao Xu. Analysis of morphological quasi-circular structure elements and skeleton[J]. Acta Electronica Sinica,1999,27(8):78-81
    [37]Haralick R, Zhuang X. Image analysis using mathematical morphology[J]. IEEE Trans. Pattern Analysis and Machine Intelligence,1987,9(4):532-550
    [38]Li Xiang-ji. Mathematical morphological edge detectors for noisy image corrupted by impulses[J]. Journal of Image and Graphics.1998,3(11):903-906
    [39]Soille P. Morphological image analysis:Principles and Applications[M]. Berlin: Springer,1999
    [40]杨述斌,彭复员,张增常.基于形态学的遥感图像全方位边缘检测算法研究[J].遥感信息,2003
    [41]蔡超,刘买利,丁明跃,等.一种基于偏微分方程的形态学边缘检测新方法[J].华中科技大学学报(自然科学版),2003,31(10):1-3
    [42]Moulin P, Liu J. Analysis of Multiresolution image denoising schemes using generalized Gaussian and complexity priors[J]. IEEE Trans., Information Theory, 1999(41):909-919
    [43]Wang X. Laplacian operator-based edge detectors[J]. IEEE Trans. Pattern Analysis and Machine Intelligence,2007(5):886-890
    [44]Donoho D. De-noising by Soft Thresholding[J]. IEEE Trans. Information Theory, 1995(41):613-627
    [45]Arceand G. R. Forst RE. On Detail Preserving Rank-Order Filters for Image Processing[J]. IEEE Trans. Acoustic, Speech, and Signal Processing,1989(37): 83-98
    [46]王爽,黄友锐,李冬.基于阈值优化的图像模糊边缘检测快速算法[J].安徽理工大学学报(自然科学版),2008,28(1):49-53
    [47]冯斌,王璋,孙俊.基于量子粒子群算法的Ostu图像阈值分割[J].计算机工程与设计,2008,29(13):3429-3434
    [48]张颖,陈雪波,王宁.粒子群优化算法在柔性边缘检测方法选取中的应用[J].清华大学学报(自然科学版),2007,47(S2):1751-1755
    [49]谭东宁,谭东汉.小样本机器学习理论:统计学习理论[J].南京理工大学学报,2001,25(1)
    [50]侯志强,韩崇昭,郑林,等.一种基于视觉模型的边缘检测阈值选择策略[J].光电工程,2004,31(2):59-62
    [51]Canny.[EB/OL].http://zh.wikipedia.org/wiki/Canny%E7%AE%97%E5%AD%90
    [52]Canny John. A Computational Approach to Edge Detection[J]. IEEE Trans. on PAML,1988,8(6)
    [53]王文娟,韩峰,崔桐.一种基于模糊增强的Canny边缘检测方法[J].内蒙古工业大学学报,2008,27(1):65-69
    [54]何文浩,原魁,邹伟.自适应阈值的边缘检测算法及其硬件实现[J].系统工程与电子技术,2009,31(1):233-237
    [55]雒涛,郑喜凤,丁铁夫,改进的自适应阈值Canny边缘检测算法[J],光电工程,2009.11
    [56]Luo Tao, Zheng Xifeng, Ding Tiefu, Self-adaptive Threshold Canny Operator in Color Image Edge Detection, The 2nd International Congress on Image and Signal Processing(CISP'09), Tianjin. China. Oct.17-19,2009.
    [57]林玉池,崔彦平,黄银国.复杂背景下边缘提取与目标识别方法研究[J].光学精密工程,2006,14(3):509-514
    [58]Sheikh H R, Wang Z, Cormack L, Bovik A C. LIVE Image Quality Assessment Database Release 2[EB/OL]. http://live.ece.utexas.edu/research/quality,
    [59]Koschan A. A Comparative Study on Color Edge Detection[J]. Reprint from Proceedings 2nd Asian Conference on Computer Vision ACCV'95.1995(Ⅲ): 574-578
    [60]刘明艳.边缘的检测及细化研究[D]:[硕士学位论文].曲阜:曲阜师范大学,2007
    [61]Suzuki K, Horiba I, Sugie N. Neural edge enhancer for supervised edge enhancement from noisy images[J]. IEEE Trans. Pattern Analysis and Machine Intelligence,2003,25(12):1582-1596
    [62]王小鹏,郑玉甫.一种图像噪声的形态学多尺度去除方法[J].计算机工程,2006,32(4):208-2 10
    [63]吕俊白.一种有效的二值图像细化算法[J].计算机工程,2003,29(18):147-148
    [64]崔凤奎,王晓强,张丰收,等.二值图像细化算法的比较与改进[J].洛阳工学院学报,1997,18(4):48-52
    [65]王家隆,郭成安.一种改进的图像模板细化算法[J].中国图象图形学报,2004,9(3):297-301
    [66]林春蔚,C环境下地图图像矢量化及图形编辑技术与实例[M].北京:海洋出版社,1993
    [67]Chin R T., Wan H K, Stover D L, et al. A one pass thinning algorithm and its parallel implementation[J]. Computer Vision Graphics Image Processing,1987, 40(1):30-40
    [68]马建波.C语言图像处理程序集[M].北京:海洋出版社,1992
    [69]Zhang T Y., Suen C Y. A fast thinning algorithm for Thinning Digital patterns[J]. Communications of ACM,1984,27(3):236-239
    [70]Hall R W. Fast Parallel Thinning Algorithms:Parallel Speed and Connective Preservation[J]. Communication of the ACM,1989,32(1):124-131
    [71]贾瑜,饶建辉.一种对文字图像细化的改进Hilditch算法研究[J].武汉工业学院学报,2006,25(3):37-39
    [72]Chin R T., Wan H K, Stover D L, et. al. A one pass thinning algorithm and its parallel implementation[J]. Computer Vision Graphics Image Processing,1987, 40(1):30-40
    [1]林媛.图像缩放算法研究及其FPGA实现[D]:[硕士学位论文].厦门:厦门大学.2006.
    [2]万睿.LED全彩色大屏幕关键技术的应用研究[D]:[硕士学位论文].重庆:重庆大学仪器科学与技术.2008
    [3]袁小燕.浅谈LED大屏幕及其发展[J].中小企业管理与科技.2009.6:295-296.
    [4]Li Xin, Orchard M T. New edge-directed interpolation[J]. IEEE Transactions on Image Processing.2001.10(10):1521-1527.
    [5]张道兵,陆见微,张惠等.结合线性变换和非线性变换的放大算法研究[J].光子学报.2006.35(6):957-960.
    [6]胡敏,檀结庆.保持轮廓清晰的有理-线性彩色图像内插放大方法[J].系统仿真学报.2004.12:2857-2863.
    [7]杨健君.数字图像放大技术的研究与电视墙图像分配器的研制[D]:[博士学位论文].成都:电子科技大学.2002.
    [8]张阿珍,刘政林,邹雪城等.基于双三次插值算法的图像缩放引擎的设计[J].微电子学与计算机.2007.24(1):49-51.
    [9]Erik Meijering, Michael Unser. A note on cubic convolution interpolation[J]. IEEE Transactions on Image Processing.2003. Vol.12(4):477-479.
    [10]Wilipedia. Lanczos algorithm [EB/OL]. http://en.wikipedia.org/wiki/Lanczos_algorithm.
    [11]Jane K. Cullum, Ralph A. Willoughby. Lanczos Algorithms for Large Symmetric Eigenvalue Computations[M]. Classics in Applied Mathematics(41).2002.
    [12]Wu Kesheng, Horst Simon. Thick-Restart Lanczos Method for Large Symmetric Eigenvalue Problems(M). SIAM.2000.
    [13]龚奕刚.图像放大算法的研究[D]:[硕士学位论文].无锡: 江南大学计算机应用专业.2008.
    [14]孙培建.基于加权平均插值曲面的图像放大方法研究[D]:[硕士学位论文].济南:山东大学计算软件与理论专业.2006.
    [15]PixelWorks. DNX[EB/OL]. http://www.pixelworks.com/dnx/flash.html
    [16]窦延平,张同珍,姜丽红等.数据结构与算法(C++)[M].上海:上海交通大学出版社.2005.
    [17]长春希达电子技术有限公司.LED[EB/OL]. http://www.ccxida.com.
    ■[1]薛一波,王建中.并行处理中加速比的研究[J].计算机工程与设计.1994.
    16(1):11-16
    [2]胡越明.计算机系统结构[M].北京:北京航空航天大学出版社.2007.1-25.[3]谢超,麦联叨,都志辉等.关于并行计算系统中加速比的研究与分析[J].计
    算机工程与应用.2003.39(26):66-68.[4]廖晓群,蔺丽华.DSP并行处理系统设计中的几个关键问题[J].西安科技学
    院学报.2001.21(3):227-230.[5]都志辉.高性能计算并行编程技术-MPI并行程序设计[M].北京:清华大学
    出版社.2001.35-75.[6]彭绍鑫.基于GPU的硬件加速方法及其在积分方程数值解中的应用[D]:[硕
    [7]Whitepaper. NVIDIA's Next Generation CUDATM Compute Architecture:士学位论文].成都:电子科技大学电磁场与微波技术专业.2008.FermiTM[R]. NVIDIA Corporation.2009.1-21.
    [8]Guobin Shen, Guang-Ping Gao, Shipeng Li, et al. Accelerate Video Decoding With Generic GPU[J]. IEEE Transactions on Circuits and System for Video Technology.2005. Vol.15:685-693.
    [9]Strengert, M., Kraus, M., Ertl, T. Pyramid methods in gpu-based image processing. In:Proceedings Vision, Modeling, and Visualization 2006.169-176.
    [10]Cg Toolkit User's Manual A Developer's Guide to Programmable Graphics. Release 1.4, Sept.2005.
    [11]NVIDIA. NVIDIA CUDA计算统一设备架构(第二版)[M].上海.2008.
    [12]K. Moreland, E. Angel, The FFT on a GPU[J]. In SIGGRAPH/Eurographics Workshop on Graphics Hardware 2003 Proceedings,2003 July:112-119
    [1]M. Chen. Design of a Virtual Auditorium[C]. Proceedings of ACM Multimedia 2001, Ottawa, Canada, Sept.2001,109-111.
    [2]David Oboril, Miroslav Balik, Jiri Schimmel, et al. Modeling Digital Musical Effects For Signal Processors, Based on Real Effect Manifestation Analysis[C]. Proceedings of the COST G-6 Conference on Digital Audio Effects(DAFX-00), Verona, Italy, Dec.7-9,2009.
    [3]Pesce, Mark D. Programming Microsoft DirectShow for Digital Video and Television[C]. Microsoft Press.2003. ISBN 0-7356-1821-6.
    [4]陆其明,DirectShow开发指南[M].北京:清华大学出版社.2003
    [5]袁红亮.基于DirectShow的流媒体实时传输的研究与实现[D]:[硕士学位论文].大连:大连理工大学,2006.
    [6]吴潜禄,基于集群的大屏幕高清晰度视频播放系统的研究与实现[D]:[硕士学位论文].浙江:浙江大学,2006.
    [7]Luo Tao, Ding Tiefu, DirectShow Application in LED Panel Display System[C], International Conference on Information Technolgy and Computer Science(ITCS'09), Ukraine,July 25-26,2009.
    [8]张明华,谢琦,梅海彬.DirectShow中过滤器图的定制方法及应用[J].计算机工程.2004.30(7).141-143.
    [9]徐燃,潘接林.DirectShow技术在音频格式转换及音视频分离方面的应用[J].计算机应用.2005.25.387-390.
    [10]吴潜禄.基于集群的大屏幕高清晰度视频播放系统的研究与实现[D]:[硕士学位论文].浙江:浙江大学,2006
    [11]陆其明.DirectShow实务精选[M].北京:科学出版社.2004.
    [12]Wikipedia. DirectShow. [EB/OL]. http://en.wikipedia.org/wiki/DirectShow

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700