Repository logo
Log In(current)
  • Inicio
  • Personal de Investigación
  • Unidad Académica
  • Publicaciones
  • Colecciones
    Datos de Investigacion Divulgacion cientifica Personal de Investigacion Protecciones Proyectos Externos Proyectos Internos Publicaciones Tesis
  1. Home
  2. Universidad de Santiago de Chile
  3. Publicaciones
  4. Estimating the Number of Speakers by Novel Zig-Zag Nested Microphone Array Based on Wavelet Packet and Adaptive Gcc Method
Details

Estimating the Number of Speakers by Novel Zig-Zag Nested Microphone Array Based on Wavelet Packet and Adaptive Gcc Method

Journal
2022 8th International Conference on Signal Processing and Communication, Icsc 2022
Date Issued
2022
Author(s)
Adasme-Soto, P  
DOI
https://doi.org/10.1109/ICSC56524.2022.10009025
Abstract
In this paper, a new speaker counting algorithm is proposed by novel zig-zag nested array (ZZNA) combining with adaptive generalized cross-correlation (GCC) function (with phase transform (PHAT) and maximum likelihood (ML)) and wavelet packet transform (WPT) with an agglomerative classification method by Elbow decisioning criteria. The proper ZZNA is introduced for covering the acoustical environments and removing the spatial aliasing. Then, the WPT with different frequency resolution is considered for preparing the frequency subbands. The adaptive GCC function based on PHAT and ML weighting filters is done on the microphone pairs for each subbands. Finally, the unsupervised agglomerative classification method with Elbow criteria is considered for classifying the information and speakers counting. The proposed ZZNA-WAGC method is compared with Hilbert envelope, multi-channel correlational recurrent neural network by using of ambisonics features (AF-CRNN) and estimating the number of speakers by density-based classification and clustering decision (ENS-DCCD) algorithms to show the superiority of the method in undesirable scenarios. © 2022 IEEE.
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your Institution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Logo USACH

Universidad de Santiago de Chile
Avenida Libertador Bernardo O'Higgins nº 3363. Estación Central. Santiago Chile.
ciencia.abierta@usach.cl © 2023
The DSpace CRIS Project - Modificado por VRIIC USACH.

  • Accessibility settings
  • Privacy policy
  • End User Agreement
  • Send Feedback
Logo DSpace-CRIS
Repository logo COAR Notify