HyperSAX: Fast Approximate Search of Multidimensional Data

Jens Emil Gydesen, Henrik Haxholm, Niels Sonnich Poulsen, Sebastian Wahl, Bo Thiesson

2015

Abstract

The increasing amount and size of data makes indexing and searching more difficult. It is especially challenging for multidimensional data such as images, videos, etc. In this paper we introduce a new indexable symbolic data representation that allows us to efficiently index and retrieve from a large amount of data that may appear in multiple dimensions. We use an approximate lower bounding distance measure to compute the distance between multidimensional arrays, which allows us to perform fast similarity searches. We present two search methods, exact and approximate, which can quickly retrieve data using our representation. Our approach is very general and works for many types of multidimensional data, including different types of image representations. Even for millions of multidimensional arrays, the approximate search will find a result in a few milliseconds, and will in many cases return a result similar to the best match.

Download


Paper Citation


in Harvard Style

Gydesen J., Haxholm H., Poulsen N., Wahl S. and Thiesson B. (2015). HyperSAX: Fast Approximate Search of Multidimensional Data . In Proceedings of the International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-076-5, pages 190-198. DOI: 10.5220/0005185201900198

in Bibtex Style

@conference{icpram15,
author={Jens Emil Gydesen and Henrik Haxholm and Niels Sonnich Poulsen and Sebastian Wahl and Bo Thiesson},
title={HyperSAX: Fast Approximate Search of Multidimensional Data},
booktitle={Proceedings of the International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2015},
pages={190-198},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005185201900198},
isbn={978-989-758-076-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - HyperSAX: Fast Approximate Search of Multidimensional Data
SN - 978-989-758-076-5
AU - Gydesen J.
AU - Haxholm H.
AU - Poulsen N.
AU - Wahl S.
AU - Thiesson B.
PY - 2015
SP - 190
EP - 198
DO - 10.5220/0005185201900198