CLC number: TP391.41
On-line Access: 2024-08-27
Received: 2023-10-17
Revision Accepted: 2024-05-08
Crosschecked: 2009-10-18
Cited: 13
Clicked: 6914
Kai LUO, Dong-xiao LI, Ya-mei FENG, Ming ZHANG. Depth-aided inpainting for disocclusion restoration of multi-view images using depth-image-based rendering[J]. Journal of Zhejiang University Science A, 2009, 10(12): 1738-1749.
@article{title="Depth-aided inpainting for disocclusion restoration of multi-view images using depth-image-based rendering",
author="Kai LUO, Dong-xiao LI, Ya-mei FENG, Ming ZHANG",
journal="Journal of Zhejiang University Science A",
volume="10",
number="12",
pages="1738-1749",
year="2009",
publisher="Zhejiang University Press & Springer",
doi="10.1631/jzus.A0820806"
}
%0 Journal Article
%T Depth-aided inpainting for disocclusion restoration of multi-view images using depth-image-based rendering
%A Kai LUO
%A Dong-xiao LI
%A Ya-mei FENG
%A Ming ZHANG
%J Journal of Zhejiang University SCIENCE A
%V 10
%N 12
%P 1738-1749
%@ 1673-565X
%D 2009
%I Zhejiang University Press & Springer
%DOI 10.1631/jzus.A0820806
TY - JOUR
T1 - Depth-aided inpainting for disocclusion restoration of multi-view images using depth-image-based rendering
A1 - Kai LUO
A1 - Dong-xiao LI
A1 - Ya-mei FENG
A1 - Ming ZHANG
J0 - Journal of Zhejiang University Science A
VL - 10
IS - 12
SP - 1738
EP - 1749
%@ 1673-565X
Y1 - 2009
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/jzus.A0820806
Abstract: A new algorithm is proposed for restoring disocclusion regions in depth-image-based rendering (DIBR) warped images. Current solutions include layered depth image (LDI), pre-filtering methods, and post-processing methods. The LDI is complicated, and pre-filtering of depth images causes noticeable geometrical distortions in cases of large baseline warping. This paper presents a depth-aided inpainting method which inherits merits from Criminisi’s inpainting algorithm. The proposed method features incorporation of a depth cue into texture estimation. The algorithm efficiently handles depth ambiguity by penalizing larger Lagrange multipliers of filling points closer to the warping position compared with the surrounding existing points. We perform morphological operations on depth images to accelerate the algorithm convergence, and adopt a luma-first strategy to adapt to various color sampling formats. Experiments on test multi-view sequence showed that our method has superiority in depth differentiation and geometrical loyalty in the restoration of warped images. Also, peak signal-to-noise ratio (PSNR) statistics on non-hole regions and whole image comparisons both compare favorably to those obtained by state of the art techniques.
[1] Akar, G.B., Tekalp, A.M., Fehn, C., Civanlar, M.R., 2007. Transport methods in 3DTV: a survey. IEEE Trans. Circ. Syst. Video Technol., 17(11):1622-1630.
[2] Chen, W.Y., Chang, Y.L., Lin, S.F., Ding, L.F., Chen, L.G., 2005. Efficient Depth Image Based Rendering with Edge Dependent Depth Filter and Interpolation. IEEE Int. Conf. on Multimedia and Expo, p.1314-1317.
[3] Criminisi, A., Perez, P., Toyama, K., 2004. Region filling and object removal by exemplar-based image inpainting. IEEE Trans. Image Process., 13(9):1200-1212.
[4] Daribo, I., Tillier, C., Pesquet-Popescu, B., 2007. Distance Dependent Depth Filtering in 3D Warping for 3DTV. IEEE 9th Workshop on Multimedia Signal Processing, p.312-315.
[5] Fehn, C., 2004. Depth-Image-Based Rendering (DIBR), Compression, and Transmission for a New Approach on 3D-TV. SPIE, 5291:93-104.
[6] Fehn, C., Cooke, E., Schreer, O., Kauff, P., 2002. 3D analysis and image-based rendering for immersive TV applications. Signal Process. Image Commun., 17(9):705-715.
[7] Fehn, C., de la Barre, R., Pastoor, S., 2006. Interactive 3-DTV: concepts and key technologies. Proc. IEEE, 94(3):524-538.
[8] Flierl, M., Mavlankar, A., Girod, B., 2007. Motion and disparity compensated coding, for multiview video. IEEE Trans. Circ. Syst. Video Technol., 17(11):1474-1484.
[9] Gonzalez, R.C., Woods, R.E., 2002. Digital Image Processing (2nd Ed.). Prentice Hall, Upper Saddle River, New Jersey, USA.
[10] Gonzalez, R.C., Woods, R.E., Eddins, S.L., 2005. Digital Image Processing Using Matlab. Prentice Hall, Upper Saddle River, New Jersey, USA.
[11] Hartley, R.I., Zisserman, A., 2000. Multiple View Geometry in Computer Vision. Cambridge University Press, Cambridge, UK.
[12] Kauff, P., Atzpadin, N., Fehn, C., Muller, M., Schreer, O., Smolic, A., Tanger, R., 2007. Depth map creation and image-based rendering for advanced 3DTV services providing interoperability and scalability. Signal Process. Image Commun., 22(2):217-234.
[13] Lee, J., Kim, C., 2007. Removing Foreground Objects by Using Depth Information from Multi-view Images. SPIE, 6696:669627-9.
[14] McMillan, L., 1997. An Image-based Approach to Three-dimensional Computer Graphics. PhD Thesis, University of North Carolina at Chapel Hill, USA.
[15] Merkle, P., Smolic, A., Muller, K., Wiegand, T., 2007. Efficient prediction structures for multiview video coding. IEEE Trans. Circ. Syst. Video Technol., 17(11):1461-1473.
[16] Shade, J., Gortler, S., He, L.W., Szeliski, R., 1998. Layered Depth Images. 25th Annual Conf. on Computer Graphics and Interactive Techniques, p.231-242.
[17] Smolic, A., Mueller, K., Stefanoski, N., Ostermann, J., Gotchev, A., Akar, G.B., Triantafyllidis, G., Koz, A., 2007. Coding algorithms for 3DTV: a survey. IEEE Trans. Circ. Syst. Video Technol., 17(11):1606-1621.
[18] Tauber, Z., Li, Z.N., Drew, M.S., 2007. Review and preview: disocclusion by inpainting for image-based rendering. IEEE Trans. Syst. Man Cybern. Part C-Appl. Rev., 37(4):527-540.
[19] Vázquez, C., Tam, W.J., Speranza, F., 2006. Stereoscopic Imaging: Filling Disoccluded Areas in Depth Image-based Rendering. SPIE, 6392:63920D-12.
[20] Wang, L., Jin, H., Yang, R., Gong, M., 2008. Stereoscopic Inpainting: Joint Color and Depth Completion from Stereo Images. IEEE Conf. on Computer Vision and Pattern Recognition, p.1-8.
[21] Wang, W., Huo, L., Zeng, W., Huang, Q., Gao, W., 2007. Depth Image Segmentation for Improved Virtual View Image Quality in 3-DTV. Int. Symp. on Intelligent Signal Processing and Communication Systems, p.300-303.
[22] Xu, G., Zhang, Z., 1996. Epipolar Geometry in Stereo Motion and Object Recognition. Kluver Academic Publishers, Dordrecht, The Netherlands.
[23] Yoon, S.U., Ho, Y.S., 2007. Multiple color and depth video coding using a hierarchical representation. IEEE Trans. Circ. Syst. Video Technol., 17(11):1450-1460.
[24] Zhang, L., Tam, W.J., 2005. Stereoscopic image generation based on depth images for 3D TV. IEEE Trans. Broadcast., 51(2):191-199.
[25] Zitnick, C.L., Kang, S.B., Uyttendaele, M., Winder, S., Szeliski, R., 2004. High-quality video view interpolation using a layered representation. ACM Trans. Graph., 23(3):600-608.
Open peer comments: Debate/Discuss/Question/Opinion
<1>