Skip to main content

About

Shital Shah

Shital Shah

If universe is an optimizer, what is its loss function?

Hello there! I’m a Research Engineer at Microsoft Research with interests in deep learning and reinforcement learning.

Some of my open source works:

  • I lead a team for the code infrastructure to train the Phi series of models: Phi-1, Phi-2, Phi-3, Phi-4.
  • I co-created Archai, Neural Architecture Search (NAS) framework that we used to create one of the super tiny Transformer models powering the text completition feature in many Microsoft products.
  • I conceived and created AirSim, a physically and visually realistic cross-platform simulator for AI research
  • I conceived and created TensorWatch, a new approach for debugging training and visualization of vision models.

You can find a lot of my hobby projects on GitHub.

You can find research papers I contributed to at Google Scholar.

You can follow me on twitter for posts mainly on deep learning code and research.