CLC number: TP391.4
On-line Access: 2024-08-27
Received: 2023-10-17
Revision Accepted: 2024-05-08
Crosschecked: 2017-07-11
Cited: 0
Clicked: 6400
Jin Zhang, Zhao-hui Tang, Wei-hua Gui, Qing Chen, Jin-ping Liu. Interactive image segmentation with a regression based ensemble learning paradigm[J]. Frontiers of Information Technology & Electronic Engineering, 2017, 18(7): 1002-1020.
@article{title="Interactive image segmentation with a regression based ensemble learning paradigm",
author="Jin Zhang, Zhao-hui Tang, Wei-hua Gui, Qing Chen, Jin-ping Liu",
journal="Frontiers of Information Technology & Electronic Engineering",
volume="18",
number="7",
pages="1002-1020",
year="2017",
publisher="Zhejiang University Press & Springer",
doi="10.1631/FITEE.1601401"
}
%0 Journal Article
%T Interactive image segmentation with a regression based ensemble learning paradigm
%A Jin Zhang
%A Zhao-hui Tang
%A Wei-hua Gui
%A Qing Chen
%A Jin-ping Liu
%J Frontiers of Information Technology & Electronic Engineering
%V 18
%N 7
%P 1002-1020
%@ 2095-9184
%D 2017
%I Zhejiang University Press & Springer
%DOI 10.1631/FITEE.1601401
TY - JOUR
T1 - Interactive image segmentation with a regression based ensemble learning paradigm
A1 - Jin Zhang
A1 - Zhao-hui Tang
A1 - Wei-hua Gui
A1 - Qing Chen
A1 - Jin-ping Liu
J0 - Frontiers of Information Technology & Electronic Engineering
VL - 18
IS - 7
SP - 1002
EP - 1020
%@ 2095-9184
Y1 - 2017
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/FITEE.1601401
Abstract: To achieve fine segmentation of complex natural images, people often resort to an interactive segmentation paradigm, since fully automatic methods often fail to obtain a result consistent with the ground truth. However, when the foreground and background share some similar areas in color, the fine segmentation result of conventional interactive methods usually relies on the increase of manual labels. This paper presents a novel interactive image segmentation method via a regression-based ensemble model with semi-supervised learning. The task is formulated as a non-linear problem integrating two complementary spline regressors and strengthening the robustness of each regressor via semi-supervised learning. First, two spline regressors with a complementary nature are constructed based on multivariate adaptive regression splines (MARS) and smooth thin plate spline regression (TPSR). Then, a regressor boosting method based on a clustering hypothesis and semi-supervised learning is proposed to assist the training of MARS and TPSR by using the region segmentation information contained in unlabeled pixels. Next, a support vector regression (SVR) based decision fusion model is adopted to integrate the results of MARS and TPSR. Finally, the GraphCut is introduced and combined with the SVR ensemble results to achieve image segmentation. Extensive experimental results on benchmark datasets of BSDS500 and Pascal VOC have demonstrated the effectiveness of our method, and the comparison with experiment results has validated that the proposed method is comparable with the state-of-the-art methods for interactive natural image segmentation.
[1]Adamowski, J., Chan, H.F., Prasher, S.O., et al., 2012. Comparison of multivariate adaptive regression splines with coupled wavelet transform artificial neural networks for runoff forecasting in Himalayan micro-watersheds with limited data. J. Hydroinform., 14(3):731-744.
[2]Balcan, M.F., Blum, A., Yang, K., 2004. Co-training and expansion: towards bridging theory and practice. 17th Int. Conf. on Neural Information Processing Systems, p.89-96.
[3]Blum, A., Mitchell, T., 1998. Combining labeled and unlabeled data with co-training. 11th Annual Conf. on Computational Learning Theory, p.92-100.
[4]Boykov, Y.Y., Jolly, M.P., 2001. Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images. 8th IEEE Int. Conf. on Computer Vision, p.105-112.
[5]Boykov, Y.Y., Veksler, O., Zabih, R., 2001. Fast approximate energy minimization via graph cuts. IEEE Trans. Patt. Anal. Mach. Intell., 23(11):1222-1239.
[6]Ding, J.J., Lin, C.J., Lu, I.F., et al., 2015. Real-time interactive image segmentation using improved superpixels. IEEE Int. Conf. on Digital Signal Processing, p.740-744.
[7]Everingham, M., van Gool, L., Williams, C.K., et al., 2010. The Pascal Visual Object Classes (VOC) challenge. Int. J. Comput. Vis., 88(2):303-338.
[8]Friedman, J.H., 1991. Multivariate adaptive regression splines. Ann. Statist., 19(1):1-67.
[9]Fu, Z., Wang, L., Zhang, D., 2014. An improved multi-label classification ensemble learning algorithm. In: Li, S., Liu, C., Wang, Y. (Eds.), Pattern Recognition. Springer Berlin Heidelberg, p.243-252.
[10]Galar, M., Fernandez, A., Barrenechea, E., et al., 2012. A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches. IEEE Trans. Syst. Man Cybern. C, 42(4):463-484.
[11]Ge, L., Ju, R., Ren, T., et al., 2015. Interactive RGB-D image segmentation using hierarchical graph cut and geodesic distance. In: Ho, Y.S., Sang, J., Ro, Y.M., et al. (Eds.), Advances in Multimedia Information Processing. Springer International Publishing, p.114-124.
[12]Gulshan, V., Rother, C., Criminisi, A., et al., 2010. Geodesic star convexity for interactive image segmentation. IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, p.3129-3136.
[13]Jian, M., Jung, C., 2016. Interactive image segmentation using adaptive constraint propagation. IEEE Trans. Image Process., 25(3):1301-1311.
[14]Jobst, A.M., Kingston, D.G., Cullen, N.J., et al., 2016. Combining thin-plate spline interpolation with a lapse rate model to produce daily air temperature estimates in a data-sparse alpine catchment. Int. J. Climatol., 37(1):214-229.
[15]Jung, C., Jian, M., Liu, J., et al., 2014. Interactive image segmentation via kernel propagation. Patt. Recogn., 47(8): 2745-2755.
[16]Kolmogorov, V., Zabih, R., 2004. What energy functions can be minimized via graph cuts IEEE Trans. Patt. Anal. Mach. Intell., 26(2):147-159.
[17]Lazaridis, A., Mporas, I., Ganchev, T., et al., 2011. Support vector regression fusion scheme in phone duration modeling. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, p.4732-4735.
[18]Lee, Y.S., Cho, S.B., 2014. Activity recognition with Android phone using mixture-of-experts co-trained with labeled and unlabeled data. Neurocomputing, 126:106-115.
[19]Li, Y., Sun, J., Tang, C.K., et al., 2004. Lazy snapping. ACM Trans. Graph., 23(3):303-308.
[20]Liu, Y., Yu, Y., 2012. Interactive image segmentation based on level sets of probabilities. IEEE Trans. Visual. Comput. Graph., 18(2):202-213.
[21]Martin, D., Fowlkes, C., Tal, D., et al., 2001. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. 8th IEEE Int. Conf. on Computer Vision, p.416-423.
[22]Menon, R., Bhat, G., Saade, G.R., et al., 2014. Multivariate adaptive regression splines analysis to predict biomarkers of spontaneous preterm birth. Acta Obstetr. Gynecol. Scandinav., 93(4):382-391.
[23]Nguyen, T.N.A., Cai, J., Zhang, J., et al., 2012. Robust interactive image segmentation using convex active contours. IEEE Trans. Image Process., 21(8):3734-3743.
[24]Ning, J., Zhang, L., Zhang, D., et al., 2010. Interactive image segmentation by maximal similarity based region merging. Patt. Recogn., 43(2):445-456.
[25]Opitz, D., Maclin, R., 1999. Popular ensemble methods: an empirical study. J. Artif. Intell. Res., 11:169-198.
[26]Pauchard, Y., Fitze, T., Browarnik, D., et al., 2016. Interactive graph-cut segmentation for fast creation of finite element models from clinical CT data for hip fracture prediction. Comput. Methods Biomech. Biomed. Eng., 19(16):1693-1703.
[27]Peng, B., Zhang, L., Zhang, D., 2013. A survey of graph theoretical approaches to image segmentation. Patt. Recogn., 46(3):1020-1038.
[28]Qin, C., Zhang, G., Zhou, Y., et al., 2014. Integration of the saliency-based seed extraction and random walks for image segmentation. Neurocomputing, 129:378-391.
[29]Rother, C., Kolmogorov, V., Blake, A., 2004. GrabCut: interactive foreground extraction using iterated graph cuts. ACM Trans. Graph., 23(3):309-314.
[30]Shahshahani, B.M., Landgrebe, D.A., 1994. The effect of unlabeled samples in reducing the small sample size problem and mitigating the Hughes phenomenon. IEEE Trans. Geosci. Remote Sens., 32(5):1087-1095.
[31]Tang, M., Gorelick, L., Veksler, O., et al., 2013. GrabCut in one cut. IEEE Int. Conf. on Computer Vision, p.1769-1776.
[32]Wang, T., Sun, Q., Ji, Z., et al., 2016. Multi-layer graph constraints for interactive image segmentation via game theory. Patt. Recogn., 55:28-44.
[33]Wang, X.Y., Wang, Q.Y., Yang, H.Y., et al., 2011a. Color image segmentation using automatic pixel classification with support vector machine. Neurocomputing, 74(18): 3898-3911.
[34]Wang, X.Y., Wang, T., Bu, J., 2011b. Color image segmentation using pixel wise support vector machine classification. Patt. Recogn., 44(4):777-787.
[35]Wu, J., Zhao, Y., Zhu, J.Y., et al., 2014. MILCut: a sweeping line multiple instance learning paradigm for interactive image segmentation. IEEE Conf. on Computer Vision and Pattern Recognition, p.256-263.
[36]Xiang, S., Nie, F., Zhang, C., et al., 2009. Interactive natural image segmentation via spline regression. IEEE Trans. Image Process., 18(7):1623-1632.
[37]Xiang, S., Nie, F., Zhang, C., 2010. Semi-supervised classification via local spline regression. IEEE Trans. Patt. Anal. Mach. Intell., 32(11):2039-2053.
[38]Yang, W., Cai, J., Zheng, J., et al., 2010. User-friendly interactive image segmentation through unified combinatorial user inputs. IEEE Trans. Image Process., 19(9):2470-2479.
[39]Zhang, J., Tang, Z., Liu, J., et al., 2016. Recognition of flotation working conditions through froth image statistical modeling for performance monitoring. Miner. Eng., 86: 116-129.
[40]Zhang, W., Goh, A.T., 2016. Evaluating seismic liquefaction potential using multivariate adaptive regression splines and logistic regression. Geomech. Eng., 10(3):269-284.
[41]Zhang, Y., Song, H., Gu, J., et al., 2010. Interactive object extraction using hierarchical graph cuts. Int. Conf. on Audio Language and Image Processing, p.851-858.
[42]Zhang, Y., Wen, J., Wang, X., et al., 2014. Semi-supervised learning combining co-training with active learning. Expert Syst. Appl., 41(5):2372-2378.
[43]Zhou, W., Garcia, E.V., 2016. Nuclear image-guided approaches for cardiac resynchronization therapy (CRT). Curr. Cardiol. Rep., 18(1):1-11.
[44]Zhou, W., Hou, X., Piccinelli, M., et al., 2014. 3D fusion of LV venous anatomy on fluoroscopy venograms with epi-cardial surface on SPECT myocardial perfusion images for guiding CRT LV lead placement. JACC Cardiov. Imag., 7(12):1239-1248.
[45]Zhou, Z.H., 2011. When semi-supervised learning meets ensemble learning. Front. Electr. Electron. Eng. China, 6(1): 6-16.
[46]Zhou, Z.H., Li, M., 2005. Semi-supervised regression with co-training. 19th Int. Joint Conf. on Artificial Intelligence, p.908-913.
[47]Zhou, Z.H., Li, M., 2007. Semisupervised regression with cotraining-style algorithms. IEEE Trans. Knowl. Data Eng., 19(11):1479-1493.
Open peer comments: Debate/Discuss/Question/Opinion
<1>