Publishing Service

Polishing & Checking

Frontiers of Information Technology & Electronic Engineering

ISSN 2095-9184 (print), ISSN 2095-9230 (online)

Depth estimation using an improved stereo network

Abstract: Self-supervised depth estimation approaches present excellent results that are comparable to those of the fully supervised approaches, by employing view synthesis between the target and reference images in the training data. ResNet, which serves as a backbone network, has some structural deficiencies when applied to downstream fields, because its original purpose was to cope with classification problems. The low-texture area also deteriorates the performance. To address these problems, we propose a set of improvements that lead to superior predictions. First, we boost the information flow in the network and improve the ability to learn spatial structures by improving the network structures. Second, we use a binary mask to remove the pixels in low-texture areas between the target and reference images to more accurately reconstruct the image. Finally, we input the target and reference images randomly to expand the dataset and pre-train it on ImageNet, so that the model obtains a favorable general feature representation. We demonstrate state-of-the-art performance on an Eigen split of the KITTI driving dataset using stereo pairs.

Key words: Monocular depth estimation; Self-supervised; Image reconstruction

Chinese Summary  <19> 基于改进立体网络的深度估计

徐万朋1,邹玲3,吴玲达1,齐越2,钱昭勇1
1航天工程大学复杂电子系统仿真科学与技术实验室,中国北京市,101416
2鹏城实验室,中国深圳市,518055
3北京电影学院数字媒体学院,中国北京市,100088
摘要:自监督深度估计方法通过在训练数据中利用目标图像和参考图像之间的视角合成,呈现了可以与全监督方法相媲美的结果。然而,作为主干网络的ResNet最初是为了应对分类问题而设计的,在应用于下游领域时存在一些结构上的缺陷。图像中的低纹理区域也使深度估计的效果受到很大影响。为了解决这些问题,本文提出一系列改进,以实现更加有效的深度预测。首先,我们通过改进网络结构来促进网络中的信息流通,并提高学习空间结构的能力。其次,使用二值蒙版去除目标图像和参考图像之间低纹理区域中的像素,以更准确地重建图像。最后,我们随机输入目标图像和参考图像对数据集进行扩充,并在ImageNet上进行预训练,从而使模型获得良好的通用特征表示。我们使用立体图像对作为输入,在KITTI自动驾驶数据集的特征分割上验证了最先进的性能。

关键词组:单目深度估计;自监督;图像重建


Share this article to: More

Go to Contents

References:

<Show All>

Open peer comments: Debate/Discuss/Question/Opinion

<1>

Please provide your name, email address and a comment





DOI:

10.1631/FITEE.2000676

CLC number:

TP391.4

Download Full Text:

Click Here

Downloaded:

4122

Download summary:

<Click Here> 

Downloaded:

319

Clicked:

4716

Cited:

0

On-line Access:

2022-05-19

Received:

2020-12-04

Revision Accepted:

2022-05-19

Crosschecked:

2021-05-26

Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952276; Fax: +86-571-87952331; E-mail: jzus@zju.edu.cn
Copyright © 2000~ Journal of Zhejiang University-SCIENCE