Agenda

PhD Thesis Defence

Multi-Microphone Signal Parameter Estimation in Various Acoustic Scenarios

Changheng Li

Many modern devices, such as mobile phones, hearing aids and (hands-free) acoustic humanmachine interfaces are equipped with microphone arrays that can be used for various applications. These applications include source separation, audio quality enhancement, speech intelligibility improvement and source localization. In an ideal anechoic chamber, the signals received by ideal microphones are just attenuated and delayed version of the original sound. However, in practice, obstacles such as the floor, the ceiling and the surrounding walls will reflect the sound to the microphones. Also, the microphone itself will generate noise, distorting the recorded signals. Lastly, it is possible that multiple point sources are active simultaneously. When we consider one point source as the target signal, the other sources could be considered interfering signals. These distortions make it difficult to get access to the target signal. Therefore, spatial filtering is often applied to the microphone signals.

To achieve satisfying performance, these spatial filters typically need to be adaptive to the (changing) scene. Specifically, the filter coefficients depend on the acoustic-scene related parameters that model the microphone signals. These parameters, such as the relative transfer functions (RTFs) of the sources, the power spectral densities (PSDs) of the sources, the late reverberation and the ambient noise, are typically unknown in practice. Therefore, estimation of these parameters is crucial and thus the main focus of the dissertation. While it is relatively straightforward to estimate these parameters in less complex acoustic scenes, these algorithms are usually not applicable and not extendable to more complex acoustic scenes. Therefore, the complexity of the estimation methods needed depends on the complexity of the acoustic scene.

In his thesis, the author considers to estimate the RTF under varying assumptions and conditions, resulting in the joint estimation of the RTF and the power spectral densities of the sources, the late reverberation, and the noise.

Additional information ...

Overview of PhD Thesis Defence