Real Time Recognition Of Speakers From Internet Audio Stream

In this paper we present an automatic speaker recognition technique with the use of the Internet radio lossy (encoded) speech signal streams. We show an influence of the audio encoder (e.g., bitrate) on the speaker model quality. The model of each speaker was calculated with the use of the Gaussian mixture model (GMM) approach. Both the speaker recognition and the further analysis were realized with the use of short utterances to facilitate real time processing. The neighborhoods of the speaker models were analyzed with the use of the ISOMAP algorithm. The experiments were based on four 1-hour public debates with 7–8 speakers (including the moderator), acquired from the Polish radio Internet services. The presented software was developed with the MATLAB environment.

eISSN:: 2300-3405
Language:: English

Publication timeframe:: 4 times per year
Journal Subjects:: Computer Sciences, Artificial Intelligence, Software Development

Journal RSS Feed

Real Time Recognition Of Speakers From Internet Audio Stream

Published Online: Sep 30, 2015

Page range: 223 - 233

DOI: https://doi.org/10.1515/fcds-2015-0014

Keywords
Speaker recognition, GMM, Internet radio, ISOMAP

© Radoslaw Weychan et al.

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.

Real Time Recognition Of Speakers From Internet Audio Stream

Published Online: Sep 30, 2015

Page range: 223 - 233

DOI: https://doi.org/10.1515/fcds-2015-0014

KeywordsSpeaker recognition, GMM, Internet radio, ISOMAP

© Radoslaw Weychan et al.

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.

Keywords
Speaker recognition, GMM, Internet radio, ISOMAP