Book Description: Florian Müller : Invariant Features and Enhanced Speaker Normalization for Automatic Speech Recognition

Logos Verlag Berlin – Academic Publications in Science and Humanities

Logos Verlag Berlin
Academic Publications in Science and Humanities

MENÜ

cover

Invariant Features and Enhanced Speaker Normalization for Automatic Speech Recognition

Florian Müller

ISBN 978-3-8325-3319-9
247 pages, year of publication: 2013
price: 40.50 €
Invariant Features and Enhanced Speaker Normalization for Automatic Speech Recognition

Automatic speech recognition systems have to handle various kinds of variabilities sufficiently well in order to achieve high recognition rates in practice. One of the variabilities that has a major impact on the performance is the vocal tract length of the speakers. Normalization of the features and adaptation of the acoustic models are commonly used methods in speech recognition systems. In contrast to that, a third approach follows the idea of extracting features with transforms that are invariant to vocal tract lengths changes.

This work presents several approaches for extracting invariant features for automatic speech recognition systems. The robustness of these features under various training-test conditions is evaluated and it is described how the robustness of the features to noise can be increased. Furthermore, it is shown how the spectral effects due to different vocal tract lengths can be estimated with a registration method and how this can be used for speaker normalization.

Keywords:

Spracherkennung
invariante Merkmalextraktion
Normalisierung

BUYING OPTIONS

	38.00 €
in stock

	36.00 €
	48.00 €
	52.00 €

(D) = Within Germany
(W) = Abroad

You can purchase the eBook (PDF) alone or combined with the printed book (Bundle). In both cases we use the payment service of PayPal for charging you - nevertheless it is not necessary to have a PayPal-account. With purchasing the eBook or eBundle you accept our licence for eBooks.

For multi-user or campus licences (MyLibrary) please fill in the form or write an email to order@logos-verlag.de

» deutsch

» MyLibrary

Publications

» New Releases
» By Subject
» Series
» Journals

Information

» Publish with us
» OpenAccess
» Purchase
» Libraries
» About us
» Interviews
» Contact

Logos Verlag Berlin GmbH, Georg-Knorr-Str. 4, Geb. 10, D-12681 Berlin, Tel.: +49 (0)30 4285 1090, FAX: +49 (0)30 4285 1092 Datenschutz

Invariant Features and Enhanced Speaker Normalization for Automatic Speech Recognition

Florian Müller

ISBN 978-3-8325-3319-9 247 pages, year of publication: 2013 price: 40.50 € Invariant Features and Enhanced Speaker Normalization for Automatic Speech Recognition

ISBN 978-3-8325-3319-9
247 pages, year of publication: 2013
price: 40.50 €
Invariant Features and Enhanced Speaker Normalization for Automatic Speech Recognition