Darius Pétermann

Deep Learning-Source Separation

April 2024

ICASSP 2024: Hyperbolic Distance-Based Speech Separation

We explore the task of hierarchical distance-based speech separation defined on a hyperbolic manifold.

Deep Learning-Source Separation

July 2023

Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks in IEEE/ACM Transactions on Audio, Speech, and Language Processing

We extend the cocktail fork source separation formulation by tackling transcription for each of the three sources involved: audio tagging for music and sound effect, and ASR for speech

Deep Learning-Source Separation

June 2023

[Best paper Award!]
ICASSP 2023: Hyperbolic Audio Source Separation

Our ICASSP paper on audio source separation using embeddings on a hyperbolic manifold won the best student paper award!

Deep Learning-Audio Coding

June 2023

ICASSP 2023: Multi-Band Neural Audio Coding

A multi-band approach to neural audio coding, which exploits the U-Net upsampling capabilities and offers band-specific bitrate assignments.

Deep Learning-Source Separation

May 2022

ICASSP 2022: SpaIn-Net: Spatially Informed Music Source Separation

We present an interactive music source separation system allowing a user to specify the spatial location of the instruments of interest. We explore various conditioning mechanisms ranging from simple concatenation to positional encoding and AdaIN.

Deep Learning-Source Separation

May 2022

ICASSP 2022: The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks

We introduce a new source separation dataset, "Divide and Remaster" (DnR), for training and testing algorithms aiming at separating monaural audio signals into speech, music, and sound effects/background stems.

Deep Learning-Audio Codec

October 2021

WASPAA 2021: Accepted Paper on Neural Audio Coding

Our paper entitled "HARP-Net: Hyper-Autoencoded Reconstruction Propagation for Scalable Neural Audio Coding" has been accepted to the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

Deep Learning-Music

August 2020

ISMIR 2020: SATB Voice Separation

Our paper entitled "Deep Learning Based Source Separation Applied to Choir Ensembles" has been accepted to the 21st International Society for Music Information Retrieval Conference.

Csound-Generative

September 2015

Generative Music in C

A simple yet powerful program written in C for algorithmic music generation.

ICASSP 2024: Hyperbolic Distance-Based Speech Separation

Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks in IEEE/ACM Transactions on Audio, Speech, and Language Processing

[Best paper Award!]ICASSP 2023: Hyperbolic Audio Source Separation

ICASSP 2023: Multi-Band Neural Audio Coding

ICASSP 2022: SpaIn-Net: Spatially Informed Music Source Separation

ICASSP 2022: The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks

WASPAA 2021: Accepted Paper on Neural Audio Coding

ISMIR 2020: SATB Voice Separation

Generative Music in C

[Best paper Award!]
ICASSP 2023: Hyperbolic Audio Source Separation