Multi-Scale Multi-Level Generative Model in Scene Classification
スポンサーリンク
概要
- 論文の詳細を見る
Previous works show that the probabilistic Latent Semantic Analysis (pLSA) model is one of the best generative models for scene categorization and can obtain an acceptable classification accuracy. However, this method uses a certain number of topics to construct the final image representation. In such a way, it restricts the image description to one level of visual detail and cannot generate a higher accuracy rate. In order to solve this problem, we propose a novel generative model, which is referred to as multi-scale multi-level probabilistic Latent Semantic Analysis model (msml-pLSA). This method consists of two parts: multi-scale part, which extracts visual details from the image of diverse resolutions, and multi-level part, which concentrates multiple levels of topic representation to model scene. The msml-pLSA model allows for the description of fine and coarse local image detail in one framework. The proposed method is evaluated on the well-known scene classification dataset with 15 scene categories, and experimental results show that the proposed msml-pLSA model can improve the classification accuracy compared with the typical classification methods.
著者
-
XU De
Institute of Computer Science and Engineering, Beijing Jiaotong University
-
XIE Wenjie
Institute of Computer Science and Engineering, Beijing Jiaotong University
-
TANG Yingjun
Institute of Computer Science and Engineering, Beijing Jiaotong University
-
CUI Geng
Institute of Orthopedics, General Hospital of PLA
関連論文
- Adaptively Combining Local with Global Information for Natural Scenes Categorization
- Multi-Scale Multi-Level Generative Model in Scene Classification
- Color Constancy Based on Image Similarity
- How the Number of Interest Points Affect Scene Classification
- Natural Scene Classification Based on Integrated Topic Simplex
- Edge-Based Color Constancy via Support Vector Regression
- Combining Attention Model with Hierarchical Graph Representation for Region-Based Image Retrieval
- A Multi-Scale Adaptive Grey World Algorithm(Image Recognition, Computer Vision)
- A Novel Tone Mapping Based on Double-Anchoring Theory for Displaying HDR Images
- Adaptively Combining Local with Global Information for Natural Scenes Categorization
- Action Recognition Using Visual-Neuron Feature
- 2D Log-Gabor Wavelet Based Action Recognition
- Adaptive Non-linear Intensity Mapping Based Salient Region Extraction
- Modeling Bottom-Up Visual Attention for Color Images
- A Visual Inpainting Method Based on the Compressed Domain(Image Processing and Video Processing)
- Moving Object Completion on the Compressed Domain
- Color Constancy Based on Effective Regions
- Predicting DataSpace Retrieval Using Probabilistic Hidden Information
- Multi-Scale Multi-Level Generation Model in Scene Classification