Schema-based Parallel Compression and Decompression of XML Data

Stefan Böttcher, Matthias Feldotto, Rita Hartel

2013

Abstract

Whenever huge amounts of XML data have to be transferred from a web server to multiple clients, the transferred data volumes can be reduced significantly by sending compressed XML instead of plain XML. Whenever applications require querying a compressed XML format and XML compression or decompression time is a bottleneck, parallel XML compression and parallel decompression may be of significant advantage. We choose the XML compressor XSDS as starting point for our new approach to parallel compression and parallel decompression of XML documents for the following reasons. First, XSDS generally reaches stronger compression ratios than other compressors like gzip, bzip2, and XMill. Second, in contrast to these compressors, XSDS not only supports XPath queries on compressed XML data, but also XPath queries can be evaluated on XSDS compressed data even faster than on uncompressed XML. We propose a String-search-based parsing approach to parallelize XML compression with XSDS, and we show that we can speed-up the compression of XML documents by a factor of 1.4 and that we can speed-up the decompression time even by a factor of up to 7 on a quad-core processor.

Download


Paper Citation


in Harvard Style

Böttcher S., Feldotto M. and Hartel R. (2013). Schema-based Parallel Compression and Decompression of XML Data . In Proceedings of the 9th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-8565-54-9, pages 77-86. DOI: 10.5220/0004366300770086

in Bibtex Style

@conference{webist13,
author={Stefan Böttcher and Matthias Feldotto and Rita Hartel},
title={Schema-based Parallel Compression and Decompression of XML Data},
booktitle={Proceedings of the 9th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2013},
pages={77-86},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004366300770086},
isbn={978-989-8565-54-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 9th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - Schema-based Parallel Compression and Decompression of XML Data
SN - 978-989-8565-54-9
AU - Böttcher S.
AU - Feldotto M.
AU - Hartel R.
PY - 2013
SP - 77
EP - 86
DO - 10.5220/0004366300770086