BIO-INSPIRED AUDITORY PROCESSING FOR SPEECH FEATURE ENHANCEMENT

HariKrishna Maganti, Marco Matassoni

2011

Abstract

Mel-frequency cepstrum based features have been traditionally used for speech recognition in a number of applications, as they naturally provide a higher recognition accuracies. However, these features are not very robust in a noisy acoustic conditions. In this article, we investigate the use of bio-inspired auditory features emulating the processing performed by cochlea to improve the robustness, particularly to counter environmental reverberation. Our methodology first extracts robust noise resistant features by gammatone filtering, which emulate cochlea frequency resolution and then a long-term modulation spectral processing is performed which preserves speech intelligibility in the signal. We compare and discuss the features based upon the performance on Aurora5 meeting recorder digit task recorded with four different microphones in a hands-free mode at a real meeting room. The experimental results show that the proposed features provide considerable improvements with respect to the state of the art feature extraction techniques.

Download


Paper Citation


in Harvard Style

Maganti H. and Matassoni M. (2011). BIO-INSPIRED AUDITORY PROCESSING FOR SPEECH FEATURE ENHANCEMENT . In Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2011) ISBN 978-989-8425-35-5, pages 51-58. DOI: 10.5220/0003145800510058

in Bibtex Style

@conference{biosignals11,
author={HariKrishna Maganti and Marco Matassoni},
title={BIO-INSPIRED AUDITORY PROCESSING FOR SPEECH FEATURE ENHANCEMENT},
booktitle={Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2011)},
year={2011},
pages={51-58},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003145800510058},
isbn={978-989-8425-35-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2011)
TI - BIO-INSPIRED AUDITORY PROCESSING FOR SPEECH FEATURE ENHANCEMENT
SN - 978-989-8425-35-5
AU - Maganti H.
AU - Matassoni M.
PY - 2011
SP - 51
EP - 58
DO - 10.5220/0003145800510058