AdaptIE - Using Domain Language Concept to Enable Domain Experts in Modeling of Information Extraction Plans

Wojciech M. Barczyñski, Felix Förster, Falk Brauer, Daniel Schuster

2010

Abstract

Implementing domain specific Information Extraction (IE) technologies to retrieve structured information from unstructured data is a challenging and complex task. It requires both IE expertise (e.g., in linguistics) and domain knowledge, provided by a domain expert who is aware of, say, the text corpus specifics and entities of interest. While the IE expert role is addressed by several approaches, less has been done in enabling domain experts in the process of IE development. Our approach targets this issue. We provide a base platform for collaboration of experts through IE plan modeling languages used to compose basic IE operators into complex IE flows. We provide each of the experts with a language that is adapted to their respective expertise. IE experts leverage a fine grained view and domain experts use a coarse grain view on execution of IE. We use Model Driven Architecture concept to enable transition among the languages and operators provided by an algebraicIE framework. To prove applicability of our approach we implemented an Eclipse based tool –AdaptIE– and demonstrate it in a real world scenario for the SAP Community Network.

Download


Paper Citation


in Harvard Style

M. Barczyñski W., Förster F., Brauer F. and Schuster D. (2010). AdaptIE - Using Domain Language Concept to Enable Domain Experts in Modeling of Information Extraction Plans . In Proceedings of the 12th International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-8425-04-1, pages 249-256. DOI: 10.5220/0002902602490256

in Bibtex Style

@conference{iceis10,
author={Wojciech M. Barczyñski and Felix Förster and Falk Brauer and Daniel Schuster},
title={AdaptIE - Using Domain Language Concept to Enable Domain Experts in Modeling of Information Extraction Plans},
booktitle={Proceedings of the 12th International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2010},
pages={249-256},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002902602490256},
isbn={978-989-8425-04-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 12th International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - AdaptIE - Using Domain Language Concept to Enable Domain Experts in Modeling of Information Extraction Plans
SN - 978-989-8425-04-1
AU - M. Barczyñski W.
AU - Förster F.
AU - Brauer F.
AU - Schuster D.
PY - 2010
SP - 249
EP - 256
DO - 10.5220/0002902602490256