MAXIMUM-LIKELIHOOD BASED 3D ACOUSTICAL SIGNATURE ESTIMATION

2014-04-25
An audio recording, made in a real environment, carries an acoustical signature which changes according to the acoustical characteristics of the environment and the recording positions. This signature which is similar to a 3D room impulse response contains the directions, levels and arrival times of the direct source and reflections. Although it is easy to obtain reverberant recordings by convolving clean recordings with the acoustical signature, estimating the signature from any recording is a difficult inverse problem. Acoustical signature estimation is important in acoustical analysis, audio forensics for authentication, room size and shape estimation and improving speech intelligibility by dereverberation. In this work, the statistical modelling of intensity vector directions, which are obtained from compact microphone array recordings is made. Obtained statistical distribution is used for reducing the reverberation based on the maximum-likelihood estimation method. This dereverberated sound enables deconvolving the reverberant recordings to estimate the acoustical signature.

Suggestions

ACOUSTIC SOURCE SEPARATION USING RIGID SPHERICAL MICROPHONE ARRAYS VIA SPATIALLY WEIGHTED ORTHOGONAL MATCHING PURSUIT
Coteli, Mert Burkay; Hacıhabiboğlu, Hüseyin (2018-09-20)
Acoustic source separation refers to the extraction of individual source signals from microphone array recordings of multiple sources made in multipath environments such as rooms. The most straightforward approach to acoustic source separation involves spatial filtering via beamforming. While beamforming works well for a few sources and under low reverberation, its performance diminishes for a high number of sources and/or high reverberation. An informed acoustic source separation method based on the applic...
Spherical harmonics based acoustic scene analysis for object-based audio
Çöteli, Mert Burkay; Hacıhabiboğlu, Hüseyin; Department of Information Systems (2021-2-19)
Object-based audio relies on elemental audio signals from individual sound sources and their associated metadata to be reconstructed at the listener side. While defining audio objects in a production setting is straightforward, it is not trivial to extract audio objects from more realistic recording scenarios such as concerts. Thus, existing object-based audio standards also define scene-based formats alongside objectbased representations that provide immersive audio, but without the flexibility provided by...
ON THE ACCURACY OF OPEN SPHERICAL MICROPHONE ARRAYS FOR MEASURING ACOUSTIC INTENSITY
Hacıhabiboğlu, Hüseyin (2013-10-23)
Acoustic intensity can be used for different purposes such as sound source localisation, source separation and spatial audio object coding. Three-dimensional measurement of the acoustic intensity requires the design of special microphone arrays. A theoretical analysis and numerical simulations of intensity measurements using open spherical microphone arrays are presented in this paper. The calculation of the acoustic intensity using signals from an open spherical microphone array is presented first. Error m...
Multiple Sound Source Localization with Rigid Spherical Microphone Arrays via Residual Energy Test
Coteli, Mert Burkay; Hacıhabiboğlu, Hüseyin (2019-05-01)
The estimation of the directions-of-arrival (DOAs) of multiple sound sources is a fundamental stage in acoustic scene analysis. Many application areas such as robot audition and object-based audio (OBA) broadcast require that DOA estimation is computationally efficient to allow real-time operation. We propose a new DOA estimation approach based on a sparse representation of recorded sound fields as a linear combination of spatially bandlimited impulses in this paper. The proposed algorithm operates on a tim...
Panoramic recording and reproduction of multichannel audio using a circular microphone array
Hacıhabiboğlu, Hüseyin (2009-10-18)
Multichannel audio reproduction generally suffers from one or both of the following problems: i) the recorded audio has to be artificially manipulated to provide the necessary spatial cues, which reduces the consistency of the reproduced sound field with the actual one, and ii) reproduction is not panoramic, which degrades realism when the listener is not seated in a desired ideal position facing the center channel. A recording method using a circularly symmetric array of differential microphones, and a rep...
Citation Formats
B. Günel Kılıç, “MAXIMUM-LIKELIHOOD BASED 3D ACOUSTICAL SIGNATURE ESTIMATION,” 2014, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/55577.