Quantifying the Specificity of Near-duplicate Image Classification Functions

Richard Connor, Franco Alberto Cardillo

2016

Abstract

There are many published methods for detecting similar and near-duplicate images. Here, we consider their use in the context of unsupervised near-duplicate detection, where the task is to find a (relatively small) near- duplicate intersection of two large candidate sets. Such scenarios are of particular importance in forensic near-duplicate detection. The essential properties of a such a function are: performance, sensitivity, and specificity. We show that, as collection sizes increase, then specificity becomes the most important of these, as without very high specificity huge numbers of false positive matches will be identified. This makes even very fast, highly sensitive methods completely useless. Until now, to our knowledge, no attempt has been made to measure the specificity of near-duplicate finders, or even to compare them with each other. Recently, a benchmark set of near-duplicate images has been established which allows such assessment by giving a near-duplicate ground truth over a large general image collection. Using this we establish a methodology for calculating specificity. A number of the most likely candidate functions are compared with each other and accurate measurement of sensitivity vs. specificity are given. We believe these are the first such figures be to calculated for any such function.

Download


Paper Citation


in Harvard Style

Connor R. and Cardillo F. (2016). Quantifying the Specificity of Near-duplicate Image Classification Functions . In Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, (VISIGRAPP 2016) ISBN 978-989-758-175-5, pages 647-654. DOI: 10.5220/0005785406470654

in Bibtex Style

@conference{visapp16,
author={Richard Connor and Franco Alberto Cardillo},
title={Quantifying the Specificity of Near-duplicate Image Classification Functions},
booktitle={Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, (VISIGRAPP 2016)},
year={2016},
pages={647-654},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005785406470654},
isbn={978-989-758-175-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, (VISIGRAPP 2016)
TI - Quantifying the Specificity of Near-duplicate Image Classification Functions
SN - 978-989-758-175-5
AU - Connor R.
AU - Cardillo F.
PY - 2016
SP - 647
EP - 654
DO - 10.5220/0005785406470654