I’m a Research Engineer at Meta, working on speech synthesis.
My work focuses on how machines generate and understand human voice.
Previously at Play.AI (acquired by Meta), I worked across the full lifecycle of generative voice modelling:
- 🗣️ Instant Voice Cloning across multiple languages.
- 🗣️ PlayDiffusion — the first audio-diffusion model for speech synthesis (feature blog).
- 🗣️ Our work powers the voice of Deepika Padukone on the Meta AI app.
- Before that, I co-founded SiteRecon where we analyze aerial imagery to map outdoor spaces. SiteRecon is part of TinySeed 2023 Spring Batch. Prior to it, I was an Applied Scientist at Amazon India, working on weakly and semi-supervised learning.
- I received my bachelor’s and master’s degree in Mathematics and Computing Sciences from IIT Kharagpur, and had small stints at MILA, UNLV, Morphle Labs, and ParallelDots.
- Research publications at Google Scholar.
- Curating my read books on Goodreads.