About

Shital Shah
If universe is an optimizer, what is its loss function?
Hello there! I’m a Research Engineer at Microsoft Research with interests in deep learning and reinforcement learning.
Some of my open source works:
- I lead a team for the code infrastructure to train the Phi series of models: Phi-1, Phi-2, Phi-3, Phi-4.
- I co-created Archai, Neural Architecture Search (NAS) framework that we used to create one of the super tiny Transformer models powering the text completition feature in many Microsoft products.
- I conceived and created AirSim, a physically and visually realistic cross-platform simulator for AI research
- I conceived and created TensorWatch, a new approach for debugging training and visualization of vision models.
You can find a lot of my hobby projects on GitHub.
You can find research papers I contributed to at Google Scholar.
You can follow me on twitter for posts mainly on deep learning code and research.