講演抄録/キーワード |
講演名 |
2008-10-24 09:30
LOW-COMPLEXITY SPEAKER IDENTIFICATION IN AAC DOMAIN ○Ai Haojun(Wuhan Univ.)・Miki Haseyama(Hokudai) |
抄録 |
(和) |
(まだ登録されていません) |
(英) |
This paper presents an implementation of a low-complexity speaker identification algorithm working in the compressed audio domain. The goal is to perform speaker modeling and identification without decoding the AAC bitstream to extract speaker dependent features, thus saving important system resource. The silence detection and MFCC parameters are calculated from MDCT coefficient other than from the FFT spectrum. Each speaker is modeled by a GMM, which is trained using the EM algorithm to refine the weight and the parameters of each component. The recognition accuracies of our algorithm reach 97% for ARCTIC database with 16% CPU overload comparing to the algorithms based on the analysis of the decoded PCM signals. |
キーワード |
(和) |
/ / / / / / / |
(英) |
Speaker recognition / discrete cosine transforms / audio coding / / / / / |
文献情報 |
映情学技報, vol. 32, pp. 31-34, 2008年10月. |
資料番号 |
|
発行日 |
2008-10-16 (ME) |
ISSN |
Print edition: ISSN 1342-6893 |
PDFダウンロード |
|