DOAJ Open Access 2023

Joint short-time speaker recognition and tracking using sparsity-based source detection

Guo Yao Zhu Hongyan

Abstrak

A random finite set-based sequential Monte–Carlo tracking method is proposed to track multiple acoustic sources in indoor scenarios. The proposed method can improve tracking performance by introducing recognized speaker identities from the received signals. At the front-end, the degenerate unmixing estimation technique (DUET) is employed to separate the mixed signals, and the time delay of arrival (TDOA) is measured. In addition, a criterion to select the reliable microphone pair is designed to quickly obtain accurate speaker identities from the mixed signals, and the Gaussian mixture model universal background model (GMM-UBM) is employed to train the speaker model. In the tracking step, the update of the weight for each particle is derived after introducing the recognized speaker identities, which results in better association between the measurements and sources. Simulation results demonstrate that the proposed method can improve the accuracy of the filter states and discriminate the sources close to each other.

Topik & Kata Kunci

Acoustics in engineering. Acoustical engineering Acoustics. Sound

Penulis (2)

Guo Yao

Zhu Hongyan

Format Sitasi

APA MLA BibTeX

Yao, G., Hongyan, Z. (2023). Joint short-time speaker recognition and tracking using sparsity-based source detection. https://doi.org/10.1051/aacus/2023004

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →

Lihat di Sumber doi.org/10.1051/aacus/2023004

Informasi Jurnal

Tahun Terbit: 2023
Sumber Database: DOAJ
DOI: 10.1051/aacus/2023004
Akses: Open Access ✓