Efficient FPGA Implementation of a Knowledge-based Automatic Speech Classifier

Antonio Gentile, Sabato Marco Siniscalchi, Filippo Sorbello, Salvatore Vitabile, Sabato M. Siniscalchi, Salvatore Vitabile, Filippo Sorbello, Antonio Gentile, Francesca Gennaro

Risultato della ricerca: Chapter

2 Citazioni (Scopus)

Abstract

Speech recognition has become common in many application domains, from dictation systems for professional practices to vocal user interfaces for people with disabilities or hands-free system control. However, so far the performance of Automatic Speech Recognition (ASR) systems are comparable to Human Speech Recognition (HSR) only under very strict working conditions, and in general far lower. Incorporating acoustic-phonetic knowledge into ASR design has been proven a viable approach to rise ASR accuracy. Manner of articulation attributes such as vowel, stop, fricative, approximant, nasal, and silence are examples of such knowledge. Neural networks have already been used successfully as detectors for manner of articulation attributes starting from representations of speech signal frames. In this paper an optimized digital Knowledge-based Automatic Speech Classifier for real-time applications is implemented on FPGA using six attribute scoring Multi-Layer Perceptrons (MLP). Digital MLP key features are a virtual neuron architecture and use of sinusoidal activation functions for the hidden layer. Implementation results on FPGA show that use of sinusoidal activation functions decrease hardware resource usage of more than 50% for slices, FFs, LUTs and more than 35% for FPGA RAM blocks when compared with the standard sigmoid-based neuron implementation. Furthermore, neuron virtualization allows for a significant decrease of concurrent memory access, resulting in improved performance for the entire attribute scoring module.
Lingua originaleEnglish
Titolo della pubblicazione ospiteEmbedded Software and Systems : Second International Conference, ICESS 2005 : Xi'an, China, December 16-18, 2005 : proceedings
Pagine198-209
Numero di pagine12
Stato di pubblicazionePublished - 2005

Serie di pubblicazioni

NomeLECTURE NOTES IN COMPUTER SCIENCE

    Fingerprint

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Cita questo

Gentile, A., Siniscalchi, S. M., Sorbello, F., Vitabile, S., Siniscalchi, S. M., Vitabile, S., Sorbello, F., Gentile, A., & Gennaro, F. (2005). Efficient FPGA Implementation of a Knowledge-based Automatic Speech Classifier. In Embedded Software and Systems : Second International Conference, ICESS 2005 : Xi'an, China, December 16-18, 2005 : proceedings (pagg. 198-209). (LECTURE NOTES IN COMPUTER SCIENCE).