Full Text:   <2679>

Summary:  <2141>

CLC number: TP391.4

On-line Access: 2024-08-27

Received: 2023-10-17

Revision Accepted: 2024-05-08

Crosschecked: 2015-09-21

Cited: 2

Clicked: 7122

Citations:  Bibtex RefMan EndNote GB/T7714

 ORCID:

Fu-xiang Lu

http://orcid.org/0000-0002-5810-7631

-   Go to

Article info.
Open peer comments

Frontiers of Information Technology & Electronic Engineering  2015 Vol.16 No.10 P.817-828

http://doi.org/10.1631/FITEE.1500070


Beyond bag of latent topics: spatial pyramid matching for scene category recognition


Author(s):  Fu-xiang Lu, Jun Huang

Affiliation(s):  School of Information Science & Engineering, Lanzhou University, Lanzhou 730000, China; more

Corresponding email(s):   lufux@lzu.edu.cn, huangj@sari.ac.cn

Key Words:  Scene category recognition, Probabilistic latent semantic analysis, Bag-of-words, Adaptive boosting



Abstract: 
We propose a heterogeneous, mid-level feature based method for recognizing natural scene categories. The proposed feature introduces spatial information among the latent topics by means of spatial pyramid, while the latent topics are obtained by using probabilistic latent semantic analysis (pLSA) based on the bag-of-words representation. The proposed feature always performs better than standard pLSA because the performance of pLSA is adversely affected in many cases due to the loss of spatial information. By combining various interest point detectors and local region descriptors used in the bag-of-words model, the proposed feature can make further improvement for diverse scene category recognition tasks. We also propose a two-stage framework for multi-class classification. In the first stage, for each of possible detector/descriptor pairs, adaptive boosting classifiers are employed to select the most discriminative topics and further compute posterior probabilities of an unknown image from those selected topics. The second stage uses the prod-max rule to combine information coming from multiple sources and assigns the unknown image to the scene category with the highest ‘final’ posterior probability. Experimental results on three benchmark scene datasets show that the proposed method exceeds most state-of-the-art methods.

Open peer comments: Debate/Discuss/Question/Opinion

<1>

Please provide your name, email address and a comment





Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn
Copyright © 2000 - 2025 Journal of Zhejiang University-SCIENCE