Bootstrapping a Semantic Lexicon on Verb Similarities

Shaheen Syed, Marco Spruit, Melania Borit

2016

Abstract

We present a bootstrapping algorithm to create a semantic lexicon from a list of seed words and a corpus that was mined from the web. We exploit extraction patterns to bootstrap the lexicon and use collocation statistics to dynamically score new lexicon entries. Extraction patterns are subsequently scored by calculating the conditional probability in relation to a non-related text corpus. We find that verbs that are highly domain related achieved the highest accuracy and collocation statistics affect the accuracy positively and negatively during the bootstrapping runs.

Download


Paper Citation


in Harvard Style

Syed S., Spruit M. and Borit M. (2016). Bootstrapping a Semantic Lexicon on Verb Similarities . In Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016) ISBN 978-989-758-203-5, pages 189-196. DOI: 10.5220/0006036901890196

in Bibtex Style

@conference{kdir16,
author={Shaheen Syed and Marco Spruit and Melania Borit},
title={Bootstrapping a Semantic Lexicon on Verb Similarities},
booktitle={Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016)},
year={2016},
pages={189-196},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006036901890196},
isbn={978-989-758-203-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016)
TI - Bootstrapping a Semantic Lexicon on Verb Similarities
SN - 978-989-758-203-5
AU - Syed S.
AU - Spruit M.
AU - Borit M.
PY - 2016
SP - 189
EP - 196
DO - 10.5220/0006036901890196