CLC number: TN919.8
On-line Access: 2024-08-27
Received: 2023-10-17
Revision Accepted: 2024-05-08
Crosschecked: 2009-11-09
Cited: 4
Clicked: 9286
Lu YU, Jian-peng WANG. Review of the current and future technologies for video compression[J]. Journal of Zhejiang University Science C, 2010, 11(1): 1-13.
@article{title="Review of the current and future technologies for video compression",
author="Lu YU, Jian-peng WANG",
journal="Journal of Zhejiang University Science C",
volume="11",
number="1",
pages="1-13",
year="2010",
publisher="Zhejiang University Press & Springer",
doi="10.1631/jzus.C0910684"
}
%0 Journal Article
%T Review of the current and future technologies for video compression
%A Lu YU
%A Jian-peng WANG
%J Journal of Zhejiang University SCIENCE C
%V 11
%N 1
%P 1-13
%@ 1869-1951
%D 2010
%I Zhejiang University Press & Springer
%DOI 10.1631/jzus.C0910684
TY - JOUR
T1 - Review of the current and future technologies for video compression
A1 - Lu YU
A1 - Jian-peng WANG
J0 - Journal of Zhejiang University Science C
VL - 11
IS - 1
SP - 1
EP - 13
%@ 1869-1951
Y1 - 2010
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/jzus.C0910684
Abstract: Many important developments in video compression technologies have occurred during the past two decades. The block-based discrete cosine transform with motion compensation hybrid coding scheme has been widely employed by most available video coding standards, notably the ITU-T H.26x and ISO/IEC MPEG-x families and video part of China audio video coding standard (AVS). The objective of this paper is to provide a review of the developments of the four basic building blocks of hybrid coding scheme, namely predictive coding, transform coding, quantization and entropy coding, and give theoretical analyses and summaries of the technological advancements. We further analyze the development trends and perspectives of video compression, highlighting problems and research directions.
[1] Ahmed, N., Natarajan, T., Rao, K.R., 1974. Discrete cosine transform. IEEE Trans. Comput., C-23(1):90-93.
[2] Boyce, J.M., 2004. Weighted Prediction in the H.264/MPEG AVC Video Coding Standard. Proc. Int. Symp. on Circuits and Systems, 3:789-792.
[3] Chen, P., Ye, Y., Karczewicz, M., 2008. Video Coding Using Extended Block Sizes. ITU-T Q.6/SG16 VCEG, VCEG-AJ23, San Diego, USA.
[4] Cover, T.M., Thomas, J.A., 2003. Elements of Information Theory. Tsinghua University Press, Beijing, China, p.234-237.
[5] Girod, B., 1987. The efficiency of motion-compensating prediction for hybrid coding of video sequences. IEEE J. Sel. Areas Commun., 5(7):1140-1154.
[6] Girod, B., Flierl, M., 2002. Multi-Frame Motion-Compensated Video Compression for the Digital Set-Top Box. Int. Conf. on Image Processing, 2:1-4.
[7] Guo, X., Huang, Y., Lei, S., 2009. Ordered Entropy Slices for Parallel CABAC. ITU-T Q.6/SG16 VCEG, VCEG-AK25, Yokohama, Japan.
[8] Guo, Y., Wang, Y., Li, H., 2008. Priority-Based Template Matching Intra Prediction. IEEE Int. Conf. on Multimedia and Expo, p.1117-1120.
[9] Hinds, A.T., Reznik, Y.A., Yu, L., Ni, Z., Zhang, C., 2007. Drift analysis for integer IDCT. SPIE, 6696: Article 14, p.1-16.
[10] Huffman, D., 1952. A method for the construction of minimum redundancy codes. Proc. IRE, 40(9):1098-1101.
[11] ISO/IEC JTC 1, 1993. Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to About 1.5 Mbits/s-Part 2: Video. ISO/IEC 11172-2 (MPEG-1 Part 2). Geneva, Switzerland.
[12] ISO/IEC JTC 1, 1999. Coding of Audio-Visual Objects-Part 2: Visual. ISO/IEC 14496-2 (MPEG-4 Part 2). Geneva, Switzerland.
[13] ISO/IEC JTC 1, 2008. Fixed-Point 8×8 Inverse Discrete Cosine Transform and Discrete Cosine Transform. Information Technology-MPEG Video Technologies-Part 2: ISO/IEC 23002-2. Geneva, Switzerland.
[14] ITU-T, 1993. Video Codec for Audiovisual Services at px64 kbits/s. ITU-T Rec. H.261. Geneva, Switzerland.
[15] ITU-T, 2000. Video Coding for Low Bit Rate Communication. ITU-T Rec. H.263. Geneva, Switzerland.
[16] ITU-T and ISO/IEC, 1992. Digital Compression and Coding of Continuous-Tone Still Images. ITU-T Rec. T.81 and ISO/IEC 10918-1. Geneva, Switzerland.
[17] ITU-T and ISO/IEC JTC 1, 1994. Generic Coding of Moving Pictures and Associated Audio Information-Part 2: Video. ITU-T Rec. H.262 and ISO/IEC 13818-2 (MPEG-2 Part 2). Geneva, Switzerland.
[18] ITU-T and ISO/IEC JTC 1, 2000. JPEG2000 Image Coding System. ITU-T Rec. T.800 and ISO/IEC 15444-1. Geneva, Switzerland.
[19] Jain, J.R., Jain, A.K., 1981. Displacement measurement and its application in interframe image coding. IEEE Trans. Commun., 29(12):1799-1808.
[20] Jayant, N.S., Noll, P., 1984. Digital Coding of Waveforms. Prentice-Hall, Englewood Cliffs, New Jersey, p.62-64, 524-546.
[21] Jiang, W., Wang, J., Sun, J., 2005. Rate-distortion based quantization level adjustment for H.264. Electron. Lett., 41(16):903.
[22] Kamp, S., Evertz, M., Wien, M., 2008. Decoder Side Motion Vector Derivation for Inter Frame Video Coding. 15th IEEE Int. Conf. on Image Processing, p.1120-1123.
[23] Kamp, S., Bross, B., Wien, M., 2009. Fast Decoder Side Motion Vector Derivation for Inter Frame Video Coding. Picture Coding Symp., p.1-4.
[24] Karczewicz, M., Nieweglowski, J., Lainema, J., Kalevo, O., 1996. Video Coding Using Motion Compensation with Polynomial Motion Vector Fields. 1st Int. Workshop on Wireless Image/Video Communications, p.26-31.
[25] Karczewicz, M., Ye, Y., Chong, I., 2008. Rate Distortion Optimized Quantization. ITU-T Q.6/SG16 VCEG, VCEG-AH21, Antalya, Turkey.
[26] Kauff, P., Makai, B., Rauthenberg, S., Golz, U., de Lameillieure, J.L.P., Sikora, T., 1997. Functional coding of video using a shape-adaptive DCT algorithm and object-based motion prediction toolbox. IEEE Trans. Circ. Syst. Video Technol., 7(1):181-196.
[27] Kim, J., Na, T., Kim, C., Lee, B., Kim, M., 2008. Enlarging MB Size for High Fidelity Video Coding Beyond HD. ITU-T Q.6/SG16 VCEG, VCEG-AJ21, San Diego, USA.
[28] Lee, D.T., 2005. JPEG 2000: retrospective and new developments. Proc. IEEE, 93(1):32-41.
[29] Malvar, H.S., Hallapuro, A., Karczewicz, M., Kerofsky, L., 2003. Low-complexity transform and quantization in H.264/AVC. IEEE Trans. Circ. Syst. Video Technol., 13(7):598-603.
[30] Marpe, D., Schwarz, H., Wiegand, T., 2003. Context-based adaptive binary arithmetic coding in the H.264/AVC video compression standard. IEEE Trans. Circ. Syst. Video Technol., 13(7):620-636.
[31] Narroschke, M., 2006. Extending H.264/AVC by an Adaptive Coding of the Prediction Error. The 25th Picture Coding Symp., O5-3.
[32] Ortega, A., Ramchandran, K., 1998. Rate-distortion methods for image and video compression. IEEE Signal Process. Mag., 15(6):23-50.
[33] Ostermann, J., Narroschke, M., 2006. Motion Compensated Prediction with 1/8-Pel Displacement Vector Resolution. ITU-T Q.6/SG16 VCEG, VCEG-AD09. Hangzhou, China.
[34] Ray, W., Driver, R.M., 1970. Further decomposition of the Karhunen-Loève series representation of a stationary random process. IEEE Trans. Inf. Theory, 16(6):663-668.
[35] Rusanovskyy, D., Ugur, K., Gabbouj, M., Lainema, J., 2008. Video Coding with Pixel-Aligned Directional Adaptive Interpolation Filters. IEEE Int. Symp. on Circuits and Systems, p.704-707.
[36] Rusanovskyy, D., Ugur, K., Hallapuro, A., Lainema, J., Gabbouj, M., 2009. Video coding with low-complexity directional adaptive interpolation filters. IEEE Trans. Circ. Syst. Video Technol., 19(8):1239-1243.
[37] Schwarz, H., Marpe, D., Wiegand, T., 2006. Analysis of Hierarchical B Pictures and MCTF. IEEE Int. Conf. on Multimedia and Expo, p.1929-1932.
[38] Segall, A., Zhao, J., 2008. Entropy Slices for Parallel Entropy Decoding. ITU-T SGI 6/Q.6 Doc. COM16-C405. Geneva, Switzerland.
[39] Shannon, C.E., 1948. A mathematical theory of communication. Bell Syst. Techn. J., 27:379-423, 623-656.
[40] Shannon, C.E., 1959. Coding Theorems for a Discrete Source with a Fidelity Criterion. IRE National Convention Record, Part 4, p.142-163.
[41] Shiodera, T., Tanizawa, A., Chujoh, T., 2007. Block Based Extra/Inter-Polating Prediction for Intra Coding. IEEE Int. Conf. on Image Processing, 6:445-448.
[42] Smolic, A., Makai, B., Sikora, T., 1999a. Real-time estimation of long-term 3-D motion parameters for SNHC face animation and model-based coding applications. IEEE Trans. Circ. Syst. Video Technol., 9(2):255-263.
[43] Smolic, A., Sikora, T., Ohm, J.R., 1999b. Long-term global motion estimation and its application for sprite coding, content description, and segmentation. IEEE Trans. Circ. Syst. Video Technol., 9(8):1227-1242.
[44] Sullivan, G.J., Sun, S., 2005. On dead-zone plus uniform threshold scalar quantization. SPIE, 5960: Article 33, p.1-14.
[45] Sullivan, G.J., Wiegand, T., 1998. Rate-distortion optimization for video compression. IEEE Signal Process. Mag., 15(6):74-90.
[46] Sze, V., Demircin, M.U., Budagavi, M., 2008. CABAC Throughput Requirements for Real-Time Decoding. ITU-T Q.6/SG16 Doc. VCEG-AJ31. San Diego, USA.
[47] Tan, K.T., Ghanbari, M., 2000. A multi-metric objective picture-quality measurement model for MPEG video. IEEE Trans. Circ. Syst. Video Technol., 10(7):1208-1213.
[48] Tan, T.K., Boon, C.S., Suzuki, Y., 2006. Intra Prediction by Template Matching. IEEE Int. Conf. on Image Processing, p.1693-1696.
[49] Tsukuba, T., Yamamoto T., Tokumo Y., Aono T., 2007. Adaptive Multidirectional Intra Prediction. ITU-T Q.6/SG16 VCEG, VCEG-AG05, Shenzhen, China.
[50] Ugur, K., Lainema, J., Gabbouj, M., 2007. Adaptive Interpolation Filter with Flexible Symmetry for Coding High Resolution High Quality Video. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, p.1013-1016.
[51] van den Branden Lambrecht, C.J., Verscheure, O., 1996. Perceptual quality measure using a spatiotemporal model of the human visual system. SPIE, 2668:450-461.
[52] Vatis, Y., Ostermann, J., 2006. Locally Adaptive Non-Separable Interpolation Filter for H.264-AVC. IEEE Int. Conf. on Image Processing, p.33-36.
[53] Vatis, Y., Ostermann, J., 2009. Adaptive interpolation filter for H.264/AVC. IEEE Trans. Circ. Syst. Video Technol., 19(2):179-192.
[54] Vatis, Y., Edler, B., Nguyen, D.T., Ostermann, J., 2005. Motion- and Aliasing-Compensated Prediction Using a Two-Dimensional Non-Separable Adaptive Wiener Interpolation Filter. ICIP IEEE Int. Conf. on Image Processing, 2:894-897.
[55] Vetterli, M., Kovacevic, J., 1995. Wavelets and Subband Coding. Prentice-Hall, Englewood Cliffs, New Jersey, p.414-464.
[56] Wedi, T., 2002. Adaptive Interpolation Filter for Motion Compensated Prediction. Int. Conf. on Image Processing, p.509-512.
[57] Wedi, T., 2006. Adaptive interpolation filters and high-resolution displacements for video coding. IEEE Trans. Circ. Syst. Video Technol., 16(4):484-491.
[58] Wen, J., Luttrell, M., Villasenor, J., 2000. Trellis-based R-D optimal quantization in H.263+. IEEE Trans. Image Process., 9(8):1431-1434.
[59] Wiegand, T., Zhang, X., Girod, B., 1999. Long-term memory motion-compensated prediction. IEEE Trans. Circ. Syst. Video Technol., 9(1):70-84.
[60] Wiegand, T., Sullivan, G.J., Bjontegaard, G., Luthra, A., 2003a. Overview of the H.264/AVC video coding standard. IEEE Trans. Circ. Syst. Video Technol., 13(7):560-576.
[61] Wiegand, T., Schwarz, H., Joch, A., Kossentini, F., Sullivan, G.J., 2003b. Rate-constrained coder control and comparison of video coding standards. IEEE Trans. Circ. Syst. Video Technol., 13(7):688-703.
[62] Wien, M., 2003. Variable block-size transforms for H.264/AVC. IEEE Trans. Circ. Syst. Video Technol., 13(7):604-613.
[63] Wittmann, S., Wedi, T., 2008. Separable Adaptive Interpolation Filter for Video Coding. 15th IEEE Int. Conf. on Image Processing, p.2500-2503.
[64] Won, K., Yang, J., Jeon, B., 2009. Motion Vector Coding Using Decoder-Side Estimation of Motion Vector. IEEE Int. Symp. on Broadband Multimedia Systems and Broadcasting, p.1-4.
[65] Wu, H., Yu, Z., Winkler, S., Chen, T., 2001. Impairment Metrics for MC/DPCM/DCT Encoded Digital Video. 22nd Picture Coding Symp., p.129-131.
[66] Ye, Y., Karczewicz, M., 2008. Improved H.264 Intra Coding Based on Bi-Directional Intra Prediction, Directional Transform, and Adaptive Coefficient Scanning. 15th IEEE Int. Conf. on Image Processing, p.2116-2119.
[67] Yu, L., Chen, S., Wang, J., 2009. Overview of AVS video coding standards. Signal Process.: Image Commun., 24(4):247-262.
[68] Zhang, C., Yu, L., Lou, J., Cham, W., Dong, J., 2008. The technique of prescaled integer transform: concept, design and applications. IEEE Trans. Circ. Syst. Video Technol., 18(1):84-97.
[69] Zheng, Y., Yin, P., Escoda, O.D., Li, X., Gomila, C., 2008. Intra Prediction Using Template Matching with Adaptive Illumination Compensation. 15th IEEE Int. Conf. on Image Processing, p.125-128.
Open peer comments: Debate/Discuss/Question/Opinion
<1>
Lee<toanny@126.com>
2010-01-29 10:33:11
What a good review on video coding, multimedia communication!Worth reading very much!