Restoring the missing features of the corrupted speech using linear interpolation methods

dc.contributor.author Rassem, Taha H.
dc.contributor.author Makbol, Nasrin M.
dc.contributor.author Hasan, Ali Muttaleb
dc.contributor.author Zaki, Siti Syazni Mohd
dc.contributor.author Girija, P. N.
dc.date.accessioned 2022-03-27T05:51:52Z
dc.date.available 2022-03-27T05:51:52Z
dc.date.issued 2017-10-03
dc.description.abstract One of the main challenges in the Automatic Speech Recognition (ASR) is the noise. The performance of the ASR system reduces significantly if the speech is corrupted by noise. In spectrogram representation of a speech signal, after deleting low Signal to Noise Ratio (SNR) elements, the incomplete spectrogram is obtained. In this case, the speech recognizer should make modifications to the spectrogram in order to restore the missing elements, which is one direction. In another direction, speech recognizer should be able to restore the missing elements due to deleting low SNR elements before performing the recognition. This is can be done using different spectrogram reconstruction methods. In this paper, the geometrical spectrogram reconstruction methods suggested by some researchers are implemented as a toolbox. In these geometrical reconstruction methods, the linear interpolation along time or frequency methods are used to predict the missing elements between adjacent observed elements in the spectrogram. Moreover, a new linear interpolation method using time and frequency together is presented. The CMU Sphinx III software is used in the experiments to test the performance of the linear interpolation reconstruction method. The experiments are done under different conditions such as different lengths of the window and different lengths of utterances. Speech corpus consists of 20 males and 20 females; each one has two different utterances are used in the experiments. As a result, 80% recognition accuracy is achieved with 25% SNR ratio.
dc.identifier.citation AIP Conference Proceedings. v.1891
dc.identifier.issn 0094243X
dc.identifier.uri 10.1063/1.5005452
dc.identifier.uri http://aip.scitation.org/doi/abs/10.1063/1.5005452
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/8455
dc.title Restoring the missing features of the corrupted speech using linear interpolation methods
dc.type Conference Proceeding. Conference Paper
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: