Ajin Tom

Technology Researcher | Audio Engineer | Composer | Pianist

Headphones recommended…

 Link to listening test : Chapter 4 (approach 1) - Spatialization relying on best panning practices using ERB-based sinusoidal filters:


Here is my paper associated to the above listening test, published at the 146th Audio Engineering Society Convention 2019 (DSP Track) - Dublin, Ireland:

An automatic mixing system for multitrack spatialization for stereo based on unmasking & panning practices

Designed and developed a novel automatic mixing algorithm to spatialize sound sources of a multitrack recording for masking minimization.

Utilized time-frequency analysis techniques: spectral decomposition, sub-grouping, frequency based spreading, particle swarm optimization based on MPEG Psychoacoustic Masking Model.

Link to publication: https://secure.aes.org/forum/pubs/conventions/?elib=20311

AES: http://www.aes.org/events/146/presenters/?ID=8018

Violin excerpt played twice:

1) original track,
2) output track of listener at (0,20) and source rotating about the origin (0,0). Angle of the source with respect to listener evolves from 0 degrees to 180 degrees.

At 0 degrees (direction of maximum power), we observe a flat response and at 90 degrees we perceive the directivity effect (low-pass structures of the template directivity equation 5.10 in section 5.2.2).

This example can be thought of as a scenario in which a violin player rotating abut their axis while performing. (0 degrees - facing the listener, 90 degrees - facing away from the listener).

Audio samples for Chapter 5 (approach 2) - Spatialization relying on directivity: