International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 123 | Views: 295

M.Tech / M.E / PhD Thesis | Electronics & Communication Engineering | India | Volume 5 Issue 6, June 2016 | Popularity: 6.8 / 10


     

Discriminating Speech and Nonspeech from Video Signals using SFF VAD

Avani S Babu, Amrutha V Nair


Abstract: An image processing approach is used for speech/nonspeech discrimination. The approach is based on single frequency filtering (SFF) and visual VAD. SFF is the amplitude envelope of the signal is obtained at each frequency with high temporal and spectral resolution where visual VAD is a classifier to determine whether a speaker is silent or not in a frame using the associated video signal. The high resolution property of SFF helps to exploit the resulting high signal-to-noise ratio (SNR) regions in time and frequency. But in SFF method, nonspeech is also considered as speech in the audio signal at particular situations. To avoid this issue, a technique is proposed with the combination of SFF and Visual VAD in which the speech is extracted from the video signals by the lip movement. In this method uses lip shape and degree of lip opening as visual features representing a subjects lip motion. After the lip movement analysis, the audio analyzed output and video analyzed output is combined together to distinguish the voiced/unvoiced region with a SVM classifier.


Keywords: Single Frequency Filtering SFF, Voice Activity Detection VAD, spectral resolution, lip motion, Support Vector Machine SVM


Edition: Volume 5 Issue 6, June 2016


Pages: 1669 - 1672


DOI: https://www.doi.org/10.21275/NOV164643



Make Sure to Disable the Pop-Up Blocker of Web Browser




Text copied to Clipboard!
Avani S Babu, Amrutha V Nair, "Discriminating Speech and Nonspeech from Video Signals using SFF VAD", International Journal of Science and Research (IJSR), Volume 5 Issue 6, June 2016, pp. 1669-1672, https://www.ijsr.net/getabstract.php?paperid=NOV164643, DOI: https://www.doi.org/10.21275/NOV164643



Similar Articles

Downloads: 112

Review Papers, Electronics & Communication Engineering, India, Volume 6 Issue 3, March 2017

Pages: 2281 - 2283

A Review on Pedestrian Detection Techniques

Roshan Baba Kawade

Share this Article

Downloads: 126

Survey Paper, Electronics & Communication Engineering, India, Volume 5 Issue 2, February 2016

Pages: 701 - 703

Survey on Car Detection Techniques Using Aerial Images

Azharoddin Inamdar, Zameer Farooqui

Share this Article

Downloads: 128

M.Tech / M.E / PhD Thesis, Electronics & Communication Engineering, India, Volume 4 Issue 8, August 2015

Pages: 782 - 786

Identification Classification and Monitoring of Traffic Sign Using HOG and Neural Networks

Karthik B, Hari Krishna Murthy, Mukul Manohar

Share this Article

Downloads: 139

M.Tech / M.E / PhD Thesis, Electronics & Communication Engineering, India, Volume 5 Issue 7, July 2016

Pages: 307 - 310

Implementation of CORDIC based SVM for Speaker Verification System

Pavithra R, Saritha N. R.

Share this Article

Downloads: 143 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Review Papers, Electronics & Communication Engineering, India, Volume 5 Issue 7, July 2016

Pages: 175 - 179

Automated Brain Tumor Detection and Brain MRI Classification Using Artificial Neural Network - A Review

Kalpana U. Rathod, Y. D. Kapse

Share this Article
Top