I work on autonomous systems. I have several years of experience developing, deploying and improving production ML systems in various domains at scale with a penchant for speeding and compressing models.
I graduated with Master's from IIT-Bombay where I worked at Vision, Graphics and Imaging Lab (ViGIL) on semi and weakly supervised deep learning methods.
I promote Open Source Software and keep making my little contributions as and when time permits. My publications can be found on Google Scholar.
Selected Publications
Towards Conversational Medical AI with Eyes, Ears and a Voice
LiRaNet: End-to-End Trajectory Prediction using Spatio-Temporal Radar Fusion
MS-Net: Mixed-Supervision Fully-Convolutional Networks for Full-Resolution Segmentation
Cycle-Consistency for Robust Visual Question Answering
Pythia: A Platform for Vision & Language Research
Towards VQA Models That Can Read