JZUS - Journal of Zhejiang University SCIENCE

Frontiers of Information Technology & Electronic Engineering

ISSN 2095-9184 (print), ISSN 2095-9230 (online)

2020 Vol.21 No.7 P.1005-1018

Representation learning via a semi-supervised stacked distance autoencoder for image classification

Liang Hou, Xiao-yi Luo, Zi-yang Wang, Jun Liang

College of Control Science and Engineering, Zhejiang University, Hangzhou 310027, China

jliang@zju.edu.cn

Abstract: Image classification is an important application of deep learning. In a typical classification task, the classification accuracy is strongly related to the features that are extracted via deep learning methods. An autoencoder is a special type of neural network, often used for dimensionality reduction and feature extraction. The proposed method is based on the traditional autoencoder, incorporating the “distance” information between samples from different categories. The model is called a semi-supervised distance autoencoder. Each layer is first pre-trained in an unsupervised manner. In the subsequent supervised training, the optimized parameters are set as the initial values. To obtain more suitable features, we use a stacked model to replace the basic autoencoder structure with a single hidden layer. A series of experiments are carried out to test the performance of different models on several datasets, including the MNIST dataset, street view house numbers (SVHN) dataset, German traffic sign recognition benchmark (GTSRB), and CIFAR-10 dataset. The proposed semi-supervised distance autoencoder method is compared with the traditional autoencoder, sparse autoencoder, and supervised autoencoder. Experimental results verify the effectiveness of the proposed model.

Key words: Autoencoder, Image classification, Semi-supervised learning, Neural network

Chinese Summary <51> 半监督堆叠距离自动编码器的表征学习在图像分类上的应用

侯亮，罗潇逸，汪子扬，梁军
浙江大学控制科学与工程学院，中国杭州市，310027

摘要：图像分类是深度学习的重要应用。在典型分类任务中，分类精度与通过深度学习方法提取的特征密切相关。自动编码器是一种特殊神经网络，常用于降维和特征提取。本文所提方法基于传统的自动编码器，将不同类别样本之间的"距离"信息纳入其中。该模型被称为半监督距离自动编码器。首先以无监督方式对每一层进行预训练。在随后的监督训练中，将优化的参数设置为初始值。为获得更好性能，使用堆叠式模型代替具有单一隐含层的传统自动编码器结构。开展一系列实验测试不同模型在几个数据集上的性能，包括MNIST数据集、街景门牌号码（SVHN）数据集、德国交通标志识别基准（GTSRB）和CIFAR-10数据集。将所提半监督距离自动编码器方法分别与传统自动编码器、稀疏自动编码器和监督自动编码器比较，实验结果证明该模型有效。

关键词组：自动编码器；图像分类；半监督学习；神经网络

Share this article to： More

Go to Contents

References:

Open peer comments: Debate/Discuss/Question/Opinion

<1>

DOI:

10.1631/FITEE.1900116

CLC number:

TP391.9

Download Full Text:

Click Here

Downloaded:

9200

Download summary:

Downloaded:

1970

Clicked:

8859

Cited:

On-line Access:

2024-08-27

Received:

2023-10-17

Revision Accepted:

2024-05-08

Crosschecked:

2020-06-10

Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952276; Fax: +86-571-87952331; E-mail: jzus@zju.edu.cn
Copyright © 2000~ Journal of Zhejiang University-SCIENCE

CONTENTS

INSTR. FOR AUTHOR

FOR REVIEWER

ABOUT JZUS

Publishing Service