Tapan Chugh

I am a PhD student at the University of Washington, where I am advised by Arvind Krishnamurthy and Ratul Mahajan. I am broadly interested in distributed systems and machine learning systems. My thesis research revolves around designing performance models and algorithms to optimize resource efficiency in large-scale machine learning deployments. My ongoing projects seek to improve the quality and efficiency of long context generations in large language models.
My academic journey has been complemented by valuable practical experiences: I am actively contributing to the Model Context Protocol specification and open-source projects in that ecosystem. Until June 2025, I was a student researcher at Systems Research @ Google. Previously, I collaborated with Srikanth Kandula and Ishai Menache as a research intern and visitor at Microsoft Research. Before starting my PhD, I spent two years as a Research Fellow at Microsoft Research India working with Muthian Sivathanu.
selected publications
Publications
-
NSDIEfficient Direct-Connect Topologies for Collective CommunicationsIn 22nd USENIX Symposium on Networked Systems Design and Implementation, NSDI 2025, Philadelphia, PA, USA, April 28-30, 2025, 2025
-
SoCCAnticipatory Resource Allocation for ML TrainingIn Proceedings of the 14th ACM Symposium on Cloud Computing, 2023
-
SIGMETRICSDremel: Adaptive Configuration Tuning of RocksDB KV-StoreProceedings of the ACM on Measurement and Analysis of Computing Systems, 2022
-
SIGCOMMGimbal: enabling multi-tenant storage disaggregation on SmartNIC JBOFsIn Proceedings of the 2021 ACM SIGCOMM 2021 Conference, 2021
-
ASPLOSAstra: Exploiting predictability to optimize deep learningIn Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 2019