MULTI-LABELED PATENT DOCUMENT CLASSIFICATION USING TECHNICAL TERM THESAURUS

Yoshimi Suzuki, Fumiyo Fukumoto

2011

Abstract

This paper presents amethod for patent document classification by using an expanded technical term thesaurus. For classifying structural documents such as patent documents, structural information is very useful. However, if we use documents divided into several applicant tags, the number of words are limited. For example, ‘Title of invention’ tag is very important for patent document classification. However, the number of words in the tag is very few. Therefore, in order to deal with this problem, we employ two methods. One is to classify applicant tags into semantic tags, the other is word expansion using an expanded technical term thesaurus. For thesaurus expansion, our system integrates technical terms into a thesaurus using patent documents. The classification results showed the method using the expanded thesaurus was better than that without thesaurus. Although our method is very simple, it is comparable to other methods. These results suggest that thesaurus and our method to expand thesaurus can be useful for patent document classification.

Download


Paper Citation


in Harvard Style

Suzuki Y. and Fukumoto F. (2011). MULTI-LABELED PATENT DOCUMENT CLASSIFICATION USING TECHNICAL TERM THESAURUS . In Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2011) ISBN 978-989-8425-80-5, pages 425-428. DOI: 10.5220/0003658504250428

in Bibtex Style

@conference{keod11,
author={Yoshimi Suzuki and Fumiyo Fukumoto},
title={MULTI-LABELED PATENT DOCUMENT CLASSIFICATION USING TECHNICAL TERM THESAURUS},
booktitle={Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2011)},
year={2011},
pages={425-428},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003658504250428},
isbn={978-989-8425-80-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2011)
TI - MULTI-LABELED PATENT DOCUMENT CLASSIFICATION USING TECHNICAL TERM THESAURUS
SN - 978-989-8425-80-5
AU - Suzuki Y.
AU - Fukumoto F.
PY - 2011
SP - 425
EP - 428
DO - 10.5220/0003658504250428