Naïve Bayes Domain Adaptation for Biological Sequences

Nic Herndon, Doina Caragea

2013

Abstract

The increased volume of biological data requires automatic computation tools to analyze it. Although machine learning methods have been successfully used with biological sequences in a supervised framework, their accuracy usually suffers when a classifier is learned on a source domain and applied to a different, less studied domain, in a domain adaptation framework. To address this issue, we propose to use an algorithm that combines labeled sequences from a well studied organism, the source domain, with labeled and unlabeled sequences from a related, less studied organism, the target domain. Our experimental results show that this algorithm has high classifying accuracy on the target domain.

Download


Paper Citation


in Harvard Style

Herndon N. and Caragea D. (2013). Naïve Bayes Domain Adaptation for Biological Sequences . In Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2013) ISBN 978-989-8565-35-8, pages 62-70. DOI: 10.5220/0004245500620070

in Bibtex Style

@conference{bioinformatics13,
author={Nic Herndon and Doina Caragea},
title={Naïve Bayes Domain Adaptation for Biological Sequences},
booktitle={Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2013)},
year={2013},
pages={62-70},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004245500620070},
isbn={978-989-8565-35-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2013)
TI - Naïve Bayes Domain Adaptation for Biological Sequences
SN - 978-989-8565-35-8
AU - Herndon N.
AU - Caragea D.
PY - 2013
SP - 62
EP - 70
DO - 10.5220/0004245500620070