Downloads
Abstract
This paper describes the method for creating neural networks based continuous Vietnamese speech recognizer. By Vietnamese phonetic analyzing, we can determine the context-dependent phonemes from a given vocabulary, which means that one phoneme is classified differently denpending on the phonemes that surround it. The feature extraction has used the current popular model as the mel-ceptrum, where the short-time spectrum is warped according to the mel scale, then direct transformation of the log power spectrum to the cepstral domain using an inverse discrete cosine transform. The neural netwoks has been applied to estimate context-dependent phoneme probabilities that outputs value in the range 0 to 1. The phoneme probabilities for the successive frames are arranged in a matrix. We then use the Viterbi algorithm to find the legal string of phonmes throught the matrix gives us highest score, that is also the target word. The experiments were programmed in Microsoft Visual C++ 6.0. The high accurate results confirmed applicable of the neural networks for Vietnamese speech recognition.
Issue: Vol 5 No 10 (2002)
Page No.: 13-21
Published: Oct 31, 2002
Section: Article
DOI: https://doi.org/10.32508/stdj.v5i10.3445
Download PDF = 318 times
Total = 318 times
Most read articles by the same author(s)
- Le Tien Thuong, Le Ngoc Phu, VIETNAMESE SPEECH SYNTHESIS BASED ON PITCH PERIODS , Science and Technology Development Journal: Vol 8 No 4 (2005)