Journals & Magazines >IEEE Transactions on Multimedia >Volume: 12 Issue: 4

Image Classification With Kernelized Spatial-Context

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The goal of image classification is to classify a collection of unlabeled images into a set of semantic classes. Many methods have been proposed to approach this goal by ...Show More

Metadata

Abstract:

The goal of image classification is to classify a collection of unlabeled images into a set of semantic classes. Many methods have been proposed to approach this goal by leveraging visual appearances of local patches in images. However, the spatial context between these local patches also provides significant information to improve the classification accuracy. Traditional spatial contextual models, such as two-dimensional hidden Markov model, attempt to construct one common model for each image category to depict the spatial structures of the images in this class. However due to large intra-class variances in an image category, one single model has difficulties in representing various spatial contexts in different images. In contrast, we propose to construct a prototype set of spatial contextual models by leveraging the kernel methods rather than only one model. Such an algorithm combines the advantages of rich representation ability of spatial contextual models as well as the powerful classification ability of kernel method. In particular, we propose a new distance measure between different spatial contextual models by integrating joint appearance-spatial image features. Such a distance measure can be efficiently computed in a recursive formulation that scales well to image size. Extensive experiments demonstrate that the proposed approach significantly outperforms the state-of-the-art approaches.

Published in: IEEE Transactions on Multimedia ( Volume: 12, Issue: 4, June 2010)

Page(s): 278 - 287

Date of Publication: 22 March 2010

ISSN Information:

DOI: 10.1109/TMM.2010.2046270

Contents

I. Introduction

Image categorization has attracted much attention in recent years. Its goal is to categorize a collection of unlabeled images into a set of predefined classes for semantic-level image retrieval. Among various image classification methods, many researchers have developed a set of sophisticated models, to represent the spatial context of the local patches in the images, e.g., hidden conditional random fields [2], constellation model [3], etc. Among them, 2-dimensional hidden Markov model (2-D HMM) has attracted much attention as a classic spatial contextual model [4]–[6]. This model can efficiently capture the spatial context among different patches in the images. In more detail, when using 2-D HMM for image categorization, a model is first learned from a training set of images for each image class. Then this learned model can be used to score the probability of an unlabeled image belonging to this class. However, the images in one class usually have large intra-class variance and this variance often leads to the difficulty in constructing a common spatial contextual model for this class. Fig. 1 illustrates an example of this difficulty. The images in the category “car” have many different views in this example, such as top view, side view, front view, and back view. Each view has a different spatial context of their local patches. These differences between the image spatial contexts bring large intra-class variance for this category. As stated above, the traditional 2-D HMM attempts to use a common model to generate all these images with different spatial structures. Therefore, the depictive ability of a single model is too limited to capture large intra-class variance perfectly. Actually, the above problem also exists in many other spatial-contextual models for image categorization, which attempt to use one common generative model to represent one class, such as HCRF [2] and constellation model [3]. Fig. 1.

Using one common 2-D HMM to model the category “car” with large intra-class variance. In this example, four different views in “car” make one model inadequate to capture such an intra-class variance.

References is not available for this document.

Image Classification With Kernelized Spatial-Context

Abstract:

Metadata

Abstract:

ISSN Information:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Image Classification With Kernelized Spatial-Context

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

I. Introduction

References