TY - GEN
T1 - Speaker segmentation of interviews using integrated video and audio change detectors
AU - Lagrange, Mathieu
AU - Martins, Luis Gustavo
AU - Teixeira, Luis F.
AU - Tzanetakis, George
N1 - Copyright:
Copyright 2011 Elsevier B.V., All rights reserved.
PY - 2007
Y1 - 2007
N2 - In this paper, we study the use of audio and visual cues to perform speaker segmentation of audiovisual recordings of formal meetings such as interviews, lectures, or court-room sessions. The sole use of audio cues for such recordings can be ineffective due to low recording quality and high level of background noise. We propose to use additional cues from the video stream by exploiting the relative static locations of speakers among the scene. The experiments show that the combination of those multiple cues helps to identify more robustly the transitions among speakers.
AB - In this paper, we study the use of audio and visual cues to perform speaker segmentation of audiovisual recordings of formal meetings such as interviews, lectures, or court-room sessions. The sole use of audio cues for such recordings can be ineffective due to low recording quality and high level of background noise. We propose to use additional cues from the video stream by exploiting the relative static locations of speakers among the scene. The experiments show that the combination of those multiple cues helps to identify more robustly the transitions among speakers.
UR - http://www.scopus.com/inward/record.url?scp=46749117574&partnerID=8YFLogxK
U2 - 10.1109/CBMI.2007.385415
DO - 10.1109/CBMI.2007.385415
M3 - Conference contribution
AN - SCOPUS:46749117574
SN - 1424410118
SN - 9781424410118
T3 - CBMI'2007 - 2007 International Workshop on Content-Based Multimedia Indexing, Proceedings
SP - 219
EP - 226
BT - CBMI'2007 - 2007 International Workshop on Content-Based Multimedia Indexing, Proceedings
T2 - CBMI'2007 - 2007 International Workshop on Content-Based Multimedia Indexing
Y2 - 25 June 2007 through 27 June 2007
ER -