A SILENCE REMOVAL AND ENDPOINT DETECTION APPROACH FOR SPEECH PROCESSING

Authors

  • Muhammad Asadullah National University of Computer and Emerging Sciences, Peshawar
  • Shibli Nisar National University of Computer and Emerging Sciences, Peshawar

Abstract

In this paper a brief overview of silence removal and voice activity detection is discussed and a new method for silence removal is suggested. The objective of suggested method is to delete the silence and unvoiced segments from the speech signal which are very useful to increase the performance and accuracy of the system. Endpoint detection is used to remove the DC offset value from the signal after silence removal process. Silence removal and Endpoint detection are main part of many applications such as speaker and speech recognition. The proposed method uses Root Mean Square (RMS) to delete the unvoiced segments from the speech signal. This work showed better results for silence removal and endpoint detection than existing methods. The performance of this research work is evaluated using MATLAB tool and accuracy of 97.2% is achieved.

Author Biographies

Muhammad Asadullah, National University of Computer and Emerging Sciences, Peshawar

National University of Computer and Emerging Sciences, Peshawar

Shibli Nisar, National University of Computer and Emerging Sciences, Peshawar

National University of Computer and Emerging Sciences, Peshawar

References

A. M. Cordovilla, N.Ma, V. Sánchez, J. L. Carmona, A. M. Peinado, J. Barker, “A Pitch Based Noise Estimation Technique for Robust Speech Recognition with Missing Dataâ€, IEEE ICASSP, 2011 , pp. 4808 – 4811.

N. Soo Kim, W. Sung, “A statistical model-based voice activity detectionâ€, IEEE Signal Processing Letters, 1999, vol. 6, pp. 1 – 3.

D. G. Childers, J. M. Larar, M. Hand “Silent and Voiced/Unvoied/ Mixed Excitation (Four-Way), Classification of Speechâ€, IEEE Transaction on ASSP, IEEE, 1989, Vol.37, pp. 1771-1774

H. Dou, Z. Wu, Y. Feng, Y. Qian, “Voice Activity Detection Based on the Bi-spectrumâ€, IEEE 10th International conference on Signal Processing, IEEE, 2010, pp. 502-505.

D. Enqing, L. Guizhong, Z. Yatong, C. Yu “Voice activity detection based on short-time energy and noise spectrum adaptation†IEEE, 6th international conference on signal processing, 2002, vol. 1, pp. 464 – 467.

E. A. E-Sotelo, E. E-Hernandez, E. G-Rios, H. M. P-Meana “Endpoint Detector Algorithm for Speech Recognition Applicationâ€, 2012 22nd International Conference on ELECOMP, IEEE, 2012, pp. 252 - 256

Poonam Sharma, Abha Kiran, “Automatic Identification of Silence, Unvoiced and Voiced Chunks in Speech†Academy & Industry Research Collaboration Center (AIRCC), Computer Science & Information Technology, 2013, 3 (5), pp. 87-96.

G. Saha, S. Chakroborty, S. Senapati, “A New Silence Removal and Endpoint Detection Algorithm for Speech and Speaker Recognition Applicationsâ€, IJIGSP, December 2014, pp. 1-5.

In-Sung Han, Chan-Shik Ahn, “Voice Detection using Speech Energy Maximization and Silence Feature Normalizationâ€, Advanced Science and Technology Letters, Vol.49 (ICSS 2014), pp.25-29.

T.R Sahoo, S. Patra, “Silence Removal and Endpoint Detection of Speech Signal for Text Independent Speaker Identificationâ€, I.J. Image, Graphics and Signal Processing, 2014, vol. 6, pp. 27-35.

Andrew KInghorn and Mark Greenwood, “SUVing: Automatic Silence/Unvoiced/Voiced Classification of Speech'', Presented at the university of Sheffield.

Downloads

Published

2017-05-15