Weighted Voting of Different Term Weighting Methods for Natural Language Call Routing

Roman Sergienko, Iuliia Kamshilova, Eugene Semenkin, Alexander Schmitt

2016

Abstract

The text classification problem for natural language call routing was considered in the paper. Seven different term weighting methods were applied. As dimensionality reduction methods, the combination of stop-word filtering and stemming and the feature transformation based on term belonging to classes were considered. kNN and SVM-FML were used as classification algorithms. In the paper the idea of voting with different term weighting methods was proposed. The majority vote of seven considered term weighting methods provides significant improvement of classification effectiveness. After that the weighted voting based on optimization with self-adjusting genetic algorithm was investigated. The numerical results showed that weighted voting provides additional improvement of classification effectiveness. Especially significant improvement of the classification effectiveness is observed with the feature transformation based on term belonging to classes that reduces the dimensionality radically; the dimensionality equals number of classes. Therefore, it can be useful for real-time systems as natural language call routing.

Download


Paper Citation


in Harvard Style

Sergienko R., Kamshilova I., Semenkin E. and Schmitt A. (2016). Weighted Voting of Different Term Weighting Methods for Natural Language Call Routing . In Proceedings of the 13th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO, ISBN 978-989-758-198-4, pages 38-46. DOI: 10.5220/0005956600380046

in Bibtex Style

@conference{icinco16,
author={Roman Sergienko and Iuliia Kamshilova and Eugene Semenkin and Alexander Schmitt},
title={Weighted Voting of Different Term Weighting Methods for Natural Language Call Routing},
booktitle={Proceedings of the 13th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO,},
year={2016},
pages={38-46},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005956600380046},
isbn={978-989-758-198-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 13th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO,
TI - Weighted Voting of Different Term Weighting Methods for Natural Language Call Routing
SN - 978-989-758-198-4
AU - Sergienko R.
AU - Kamshilova I.
AU - Semenkin E.
AU - Schmitt A.
PY - 2016
SP - 38
EP - 46
DO - 10.5220/0005956600380046