An Investigation of Tandem MLP Features for ASR

TitleAn Investigation of Tandem MLP Features for ASR
Publication TypeTechnical Report
Year of Publication2007
AuthorsFaria, A.
Other Numbers2212
Abstract

This project explores speech feature representations produced by discriminatively trained multi-layer perceptrons. Previous research has demonstrated that such a tandem approach can be successfully exploited for large-vocabulary automatic speech recognition systems. The principal aim of this work is to empirically evaluate some variants of these features. While experimental results validate some of the design choices of the standard implementation, other evidence suggests alternatives that may improve performance. From this exploratory investigation, we hypothesize which of the various modifications are most promising; applied to a Mandarin broadcast news task, the new configuration demonstrates significant improvement. Along with the novel presentation of a “best-case scenario” and other cheating experiments, an interpretation of these results is discussed with the hope of guiding future directions of research.

URLhttp://www.icsi.berkeley.edu/pubs/techreports/faria_icsitr.pdf
Bibliographic Notes

ICSI Technical Report TR-07-003

Abbreviated Authors

A. Faria

ICSI Research Group

Speech

ICSI Publication Type

Technical Report