Publications

DualTurn: Learning Turn-Taking from Dual-Channel Generative Speech Pretraining

March 2026 Shangeth Rajaa Interspeech 2026 (Accepted)

#Voice AI #Turn-Taking #Spoken Dialogue #Speech LLM

Dual-channel generative pretraining for learning natural turn-taking in spoken dialogue without labeled data. A 0.5B model that outperforms models 6x its size on turn prediction.

View

Improving End-to-End SLU Performance with Prosodic Attention and Distillation

August 2023 Shangeth Rajaa Interspeech 2023, pp. 1114–1118

#Voice AI #Spoken Language Understanding #Prosody #Speech

Two techniques for incorporating prosody into end-to-end SLU: prosody-attention and prosody-distillation. Up to 8% intent classification accuracy improvement on SLURP.

View

Improving Spoken Language Identification with Map-Mix

June 2023 Shangeth Rajaa, Kriti Anandan, Swaraj Dalmia, Tarun Gupta, Eng Siong Chng ICASSP 2023 — IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 1–5

#Voice AI #Speech #Language Identification #Data Augmentation

Map-Mix: a data augmentation approach using model training dynamics to guide latent mixup sampling, giving ~2% weighted F1 improvement on low-resource dialect classification.

View

Skit-S2I: An Indian Accented Speech to Intent Dataset

December 2022 Shangeth Rajaa, Swaraj Dalmia, Kumarmanas Nethil arXiv preprint arXiv:2212.13015

#Voice AI #Spoken Language Understanding #Dataset #Speech

The first public Indian-accented SLU dataset in the banking domain. SSL speech representations beat ASR-based approaches for intent classification.

View

Learning Speaker Representation with Semi-supervised Learning Approach for Speaker Profiling

October 2021 Shangeth Rajaa, Pham Van Tung, Chng Eng Siong arXiv preprint arXiv:2110.13653

#Voice AI #Speaker Profiling #Speech Representation #Semi-supervised Learning

A semi-supervised framework for speaker profiling that leverages external unlabelled corpora via supervised, unsupervised, and consistency training, achieving RMSE of 6.8 years on age estimation.

View

Towards Automated Deep Learning: Analysis of the AutoDL Challenge Series 2019

June 2020 Zhengying Liu, Zhen Xu, Shangeth Rajaa, Meysam Madadi, Julio C. S. Jacques Junior, Sergio Escalera, Adrien Pavao, Sebastien Treguer, Wei-Wei Tu, Isabelle Guyon Proceedings of Machine Learning Research (PMLR), NeurIPS 2019 Competition Track, Vol. 123, pp. 242–252

#AutoML #Deep Learning #NeurIPS

Design and results of the AutoDL challenge series 2019 (AutoCV, AutoCV2, AutoNLP, AutoSpeech, AutoDL), showing winning solutions generalize to unseen datasets.

View

Overview and Unifying Conceptualization of Automated Machine Learning

September 2019 Zhengying Liu, Zhen Xu, Meysam Madadi, Julio Jacques Junior, Sergio Escalera, Shangeth Rajaa, Isabelle Guyon ECML PKDD 2019 Workshop on Automating Data Science (ADS)

#AutoML #Machine Learning #Meta-learning

A novel generic mathematical formulation of AutoML unifying HPO and meta-learning, showing meta-learning addresses AutoML more fundamentally than hyperparameter optimization.

Convolutional Feature Extraction and Neural Arithmetic Logic Units for Stock Prediction

April 2019 Shangeth Rajaa, Jajati Keshari Sahoo International Conference on Advances in Computing and Data Sciences (ICACDS 2019), Springer, pp. 349–359

#Deep Learning #Finance #CNN

A data-driven deep learning approach combining CNN feature extraction with Neural Arithmetic Logic Units (NALU) for stock price prediction using historical price data.

View