Approximate Image Matching using Strings of Bag-of-Visual Words Representation

Hong-Thinh Nguyen, Cécile Barat, Christophe Ducottet

2014

Abstract

The Spatial Pyramid Matching approach has become very popular to model images as sets of local bag-of words. The image comparison is then done region-by-region with an intersection kernel. Despite its success, this model presents some limitations: the grid partitioning is predefined and identical for all images and the matching is sensitive to intra- and inter-class variations. In this paper, we propose a novel approach based on approximate string matching to overcome these limitations and improve the results. First, we introduce a new image representation as strings of ordered bag-of-words. Second, we present a new edit distance specifically adapted to strings of histograms in the context of image comparison. This distance identifies local alignments between subregions and allows to remove sequences of similar subregions to better match two images. Experiments on 15 Scenes and Caltech 101 show that the proposed approach outperforms the classical spatial pyramid representation and most existing concurrent methods for classification presented in recent years.

Download


Paper Citation


in Harvard Style

Nguyen H., Barat C. and Ducottet C. (2014). Approximate Image Matching using Strings of Bag-of-Visual Words Representation . In Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2014) ISBN 978-989-758-004-8, pages 345-353. DOI: 10.5220/0004676803450353

in Bibtex Style

@conference{visapp14,
author={Hong-Thinh Nguyen and Cécile Barat and Christophe Ducottet},
title={Approximate Image Matching using Strings of Bag-of-Visual Words Representation},
booktitle={Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2014)},
year={2014},
pages={345-353},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004676803450353},
isbn={978-989-758-004-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2014)
TI - Approximate Image Matching using Strings of Bag-of-Visual Words Representation
SN - 978-989-758-004-8
AU - Nguyen H.
AU - Barat C.
AU - Ducottet C.
PY - 2014
SP - 345
EP - 353
DO - 10.5220/0004676803450353