[HOME] [EDUCATION] [PUBLICATIONS] [RESEARCH] [SOFTWARE] [INTERNSHIPS] [PROJECTS]
TIME DELAY ESTIMATION USING EXCITATION SOURCE INFORMATION IN SPEECH
Speaker
Localization using excitation source information in speech Vikas
C. Raykar, B.Yegnanarayana, S.
R. Mahadeva Prasanna, and Ramani Duraiswami, IEEE Transactions on
Speech and Audio Processing, Volume 13, Issue 5, Part 2, pp. 751-761,
Sep. 2005.
We propose a novel method to estimate the time-delay
between the signals received by a pair of microphones in a noisy
reverberant room, using the excitation source information in speech.
The time-delay is computed by locating the peak in the
cross-correlation of the Hilbert envelope of the Linear Prediction
Residuals. Results show that our method gives better performance than
the GCC-PHAT, GCC-ML and Brandstein's pitch based methods.
[ Matlab code ] [ Source localization demo ] [ Demo Setup] [Face Detection Demo] [ Poster ]
Related publications
Tracking a moving speaker using
excitation source information
Vikas C. Raykar, Ramani
Duraiswami, B.Yegnanarayana, and S. R. Mahadeva Prasanna, In
Proceedings of the 8th Eur. Conf. Speech Communication Technology
(Eurospeech 2003), Geneva, September 2003, pp. 69-72.