Blog | Shangeth Rajaa

May 2024

#Voice AI #Speech LLM #Conversational AI

A multimodal speech LLM that processes audio directly to enhance conversational AI while reducing overhead compared to traditional ASR-LLM-TTS pipelines.

View

Feature Disentanglement - I

February 2022

#Voice AI #Speech Representation #Deep Learning

How deep learning models can isolate independent factors of variation in data through VAEs and Beta-TCVAE, enabling controlled synthesis and better downstream representations.

View

Code Mixing in NLP and Speech

August 2021

#Speech #NLP #Deep Learning

Notes from a seminar covering six papers on code-mixing across NLP, speech synthesis, and speech recognition — including multilingual synthesis and code-mixed ASR.

View

KL Divergence: Entropy, Cross Entropy, and Mutual Information in PyTorch

September 2020

#Information Theory #Python #PyTorch

A walkthrough of information entropy, KL divergence, mutual information, and cross entropy — with PyTorch implementations.

Off-Policy Monte Carlo Prediction with Importance Sampling

August 2020

#Reinforcement Learning #Python

How importance sampling lets us estimate value functions under a target policy using episodes collected by a different behavior policy.