Avatar of Shangeth Rajaa

Shangeth Rajaa

Anyreach AI

Senior ML Scientist working on Voice AI, Turn-Taking, Full-Duplex Spoken Dialogue Systems, and Multi-Modal Speech LLMs.

Resume
  • About
  • CV
  • Publications
  • Blog

#reinforcement learning

Content tagged with "reinforcement learning"

Off-Policy Monte Carlo Prediction with Importance Sampling
2020-08-01
#Reinforcement Learning #Python

How importance sampling lets us estimate value functions under a target policy using episodes collected by a different behavior policy.

© 2026 Shangeth Rajaa.