Publications
Show only highlights
Clear all filters. X of X publications are hidden by the filters.
2025
Towards End-to-End Latency Guarantee in MEC Live Video Analytics with App-RAN Mutual Awareness
Juheon Yi,
Goodsol Lee,
Minkyung Jeong,
Seokgyeong Shin,
Daehyeok Kim,
Youngki Lee
HALoS: Hierarchical Asynchronous Local SGD over Slow Networks for Geo-Distributed Large Language Model Training
Geon-Woo Kim,
Junbo Li,
Shashidhar Gandham,
Omar Baldonado,
Adithya Gangidi,
Pavan Balaji,
Zhangyang Wang,
Aditya Akella
On-the-Fly Adaptive Distillation of Transformer to Dual-State Linear Attention for Long-Context LLM Serving
Yeonju Ro,
Zhenyu Zhang,
Souvik Kundu,
Zhangyang Wang,
Aditya Akella
MTP: Transport for In-Network Computing
Tao Ji,
Rohan Vardekar,
Balajee Vamanan,
Brent E. Stephens,
Aditya Akella
Portable and High-Performance SmartNIC Programs with Alkali
Jiaxin Lin,
Zhiyuan Guo,
Mihir Shah,
Tao Ji,
Yiying Zhang,
Daehyeok Kim,
Aditya Akella
CONGO: Compressive Online Gradient Optimization
Jeremy Carleton,
Prathik Vijaykumar,
Divyanshu Saxena,
Dheeraj Narasimha,
Srinivas Shakkottai,
Aditya Akella
Copper and Wire: Bridging Expressiveness and Performance for Service Mesh Policies
Divyanshu Saxena,
William Zhang,
Shankara Pailoor,
Isil Dillig,
Aditya Akella
How I learned to stop worrying and love learned OS policies
Divyanshu Saxena,
Jiayi Chen,
Sujay Yadalam,
Yeonju Ro,
Rohit Dwivedula,
Eric H. Campbell,
Aditya Akella,
Christopher J. Rossbach,
Michael Swift
2024
Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design
Ruisi Cai*,
Yeonju Ro*,
Geon-woo Kim,
Peihao Wang,
Babak Ehteshami Bejnordi,
Aditya Akella,
Zhangyang Wang
MOSEL: Inference Serving Using Dynamic Modality Selection
Bodun Hu,
Le Xu,
Jeongyoon Moon,
Neeraja Yadwadkar,
Aditya Akella
FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping
Ajay Jaiswal,
Bodun Hu,
Lu Yin,
Yeonju Ro,
Shiwei Liu,
Tianlong Chen,
Aditya Akella
Optimizing Transformer Inference with Selective Distillation: Layerwise Conversion to Linear Attention
Yeonju Ro,
Zhenyu Zhang,
Vijay Chidambaram,
Aditya Akella
On the Criticality of Integrity Protection in 5G Fronthaul Networks
Jiarong Xing,
Sophia Yoo,
Xenofon Foukas,
Daehyeok Kim,
Michael K. Reiter
ChainedFilter: Combining Membership Filters by Chain Rule
Haoyu Li,
Liuhui Wang,
Qizhi Chen,
Jianan Ji,
Yuhan Wu,
Yikai Zhao,
Tong Yang,
Aditya Akella
Cassini: Network-Aware Job Scheduling in Machine Learning Clusters
Sudarsanan Rajasekaran,
Manya Ghobadi,
Aditya Akella
2023
On a Foundation Model for Operating Systems
Divyanshu Saxena,
Nihal Sharma,
Donghyun Kim,
Rohit Dwivedula,
Jiayi Chen,
Chenxi Yang,
Sriram Ravula,
Zichao Hu,
Aditya Akella,
Sebastian Angel,
Joydeep Biswas,
Swarat Chaudhuri,
Isil Dillig,
Alex Dimakis,
Daehyeok Kim,
Chris Rossbach,
Gang Wang
Yama: Providing Performance Isolation for Black-Box Offloads
Tao Ji,
Divyanshu Saxena,
Brent E. Stephens,
Aditya Akella
Jiarong Xing,
Junzhi Gong,
Xenofon Foukas,
Anuj Kalia,
Daehyeok Kim,
Manikanta Kotaru
LogNIC: A High-Level Performance Model for SmartNICs
Zerui Guo,
Jiaxin Lin,
Yuebin Bai,
Daehyeok Kim,
Michael Swift,
Aditya Akella,
Ming Liu
Configuring the OS for Tomorrow's Robots
Madhav Tummala,
Daehyeok Kim,
Joydeep Biswas,
Aditya Akella
Resilient Baseband Processing in Virtualized RANs with Slingshot
Nikita Lazarev,
Tao Ji,
Anuj Kalia,
Daehyeok Kim,
Ilias Marinos,
Francis Y. Yan,
Christina Delimitrou,
Zhiru Zhang,
Aditya Akella
Darwin: Flexible Learning-based CDN Caching
Jiayi Chen,
Nihal Sharma,
Tarannum Khan,
Shu Liu,
Brian Chang,
Aditya Akella,
Sanjay Shakkottai,
Ramesh Sitaraman
Blink-hash: An Adaptive Hybrid Index for In-Memory Time-Series Databases
Hokeun Cha,
Xiangpeng Hao,
Tianzheng Wang,
Huanchen Zhang,
Aditya Akella,
Xiangyao Yu
Navigating Performance-Efficiency Tradeoffs in Serverless Computing: Deduplication to the Rescue!
Divyanshu Saxena,
Tao Ji,
Arjun Singhvi,
Junaid Khalid,
Aditya Akella
Lowering the Pre-training Tax for Gradient-based Subset Training: A Lightweight Distributed Pre-Training Toolkit
Yeonju Ro,
Zhangyang Wang,
Vijay Chidambaram,
Aditya Akella
Invited Paper: Towards Efficient Microservice Communication
Divyanshu Saxena,
William Zhang,
Madhav Tummala,
Saksham Goel,
Aditya Akella
Invited paper
Better Together: Jointly Optimizing ML Collective Scheduling and Execution Planning using SYNDICATE
Kshiteej Mahajan,
Ching-Hsiang Chu,
Srinivas Sridharan,
Aditya Akella
Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning
Pengfei Zheng,
Rui Pan,
Tarannum Khan,
Shivaram Venkataraman,
Aditya Akella
RingLeader: Efficiently Offloading Intra-Server Orchestration to NICs
Jiaxin Lin,
Adney Cardoza,
Tarannum Khan,
Yeonju Ro,
Brent E. Stephens,
Hassan Wassel,
Aditya Akella
Sketchovsky: Enabling Ensembles of Sketches on Programmable Switches
Hun Namkung,
Zaoxing Liu,
Daehyeok Kim,
Vyas Sekar,
Peter Steenkiste
ExoPlane: An Operating System for On-Rack Switch Resource Augmentation
Daehyeok Kim,
Vyas Sekar,
Srinivasan Seshan
Towards a Machine Learning-Assisted Kernel with LAKE
Henrique Fingler,
Isha Tarte,
Hangchen Yu,
Ariel Szekely,
Bodun Hu,
Aditya Akella,
Christopher Rossbach
Towards Accelerating Data Intensive Application's Shuffle Process Using SmartNICs
Jiaxin Lin,
Tao Ji,
Xiangpeng Hao,
Hokeun Cha,
Yanfang Le,
Xiangyao Yu,
Aditya Akella
SIGMETRICS in Orlando, FL, USA
2023
2022
Congestion Control in Machine Learning Clusters
Sudarsanan Rajasekaran,
Manya Ghobadi,
Gautam Kumar,
Aditya Akella
Multi-agent Databases via Independent Learning
Chi Zhang,
Olga Papaemmanouil,
Josiah P. Hanna,
Aditya Akella
Impact of RoCE Congestion Control Policies on Distributed Training of DNNs
Tarannum Khan,
Saeed Rashidi,
Srinivas Sridharan,
Pallavi Shurpali,
Aditya Akella,
Tushar Krishna
Think before you shuffle: data-driven shuffles for geo-distributed analytics
Maruth Goyal,
Aditya Akella
Jiffy: Elastic Far-Memory for Stateful Serverless Analytics
Anurag Khandelwal,
Yupeng Tang,
Rachit Agarwal,
Aditya Akella,
Ion Stoica
Memory Deduplication for Serverless Computing with Medes
Divyanshu Saxena,
Tao Ji,
Arjun Singhvi,
Junaid Khalid,
Aditya Akella
Elastic Model Aggregation with Parameter Service
Juncheng Gu,
Mosharaf Chowdhury,
Kang G. Shin,
Aditya Akella
2021
Doing more by doing less: how structured partial backpropagation improves deep learning clusters
Adarsh Kumar,
Kausik Subramanian,
Shivaram Venkataraman,
Aditya Akella
PL2: Towards Predictable Low Latency in Rack-Scale Networks
Yanfang Le,
Radhika Niranjan Mysore,
Lalith Suresh,
Gerd Zellweger,
Sujata Banerjee,
Aditya Akella,
Michael M. Swift
TCP is Harmful to In-Network Computing: Designing a Message-Oriented Transport Protocol (MTP)
Brent E. Stephens,
Darius Grassi,
Hamidreza Almasi,
Balajee Vamanan,
Aditya Akella
A Vision for Runtime Programmable Networks
Jiarong Xing,
Yiming Qiu,
Kuo-Feng Hsu,
Hongyi Liu,
Matty Kadosh,
Alan Lo,
Aditya Akella,
Thomas Anderson,
Arvind Krishnamurthy,
T. S. Eugene Ng,
Ang Chen
Atoll: A Scalable Low-Latency Serverless Platform
Arjun Singhvi,
Arjun Balasubramanian,
Kevin Houck,
Mohammed Danish Shaikh,
Shivaram Venkataraman,
Aditya Akella
D2R: Policy-Compliant Fast Reroute
Kausik Subramanian,
Anubhavnidhi Abhashkumar,
Loris D'Antoni,
Aditya Akella
CliqueMap: Productionizing an RMA-Based Distributed Caching System
Arjun Singhvi,
Aditya Akella,
Maggie Anderson,
Rob Cauble,
Harshad Deshmukh,
Dan Gibson,
Milo M. K. Martin,
Amanda Strominger,
Thomas F. Wenisch,
Amin Vahdat
Running BGP in Data Centers at Scale
Anubhavnidhi Abhashkumar*,
Kausik Subramanian*,
Alexey Andreyev,
Hyojeong Kim,
Nanda Kishore,
Jingyi Yang,
Petr Lapukhov,
Aditya Akella,
James Hongyi Zeng
ATP: In-network Aggregation for Multi-tenant Learning
ChonLam Lao*,
Yanfang Le*,
Kshiteej Mahajan,
Yixi Chen,
Wenfei Wu,
Aditya Akella,
Michael Swift
Best Paper Award
Whiz: Data-Driven Analytics Execution
Arjun Singhvi*,
Robert Grandl*,
Raajay Viswanathan,
Aditya Akella
Accelerating Deep Learning Inference via Learned Caches
Arjun Balasubramanian,
Adarsh Kumar,
Yuhan Liu,
Han Cao,
Shivaram Venkataraman,
Aditya Akella
2020
AED: Incrementally Synthesizing Policy-Compliant and Manageable Configurations
Anubhavnidhi Abhashkumar,
Aaron Gember-Jacobson,
Aditya Akella
PANIC: A High-Performance Programmable NIC for Multi-tenant Networks
Jiaxin Lin,
Kiran Patel,
Brent Stephens,
Anirudh Sivaraman,
Aditya Akella
SNF: Serverless Network Functions
Arjun Singhvi,
Junaid Khalid,
Aditya Akella,
Sujata Banerjee
Network-accelerated Distributed Machine Learning for Multi-Tenant Settings
Raajay Viswanathan,
Arjun Balasubramanian,
Aditya Akella
1RMA: Re-envisioning Remote Memory Access for Multi-tenant Datacenters
Arjun Singhvi,
Aditya Akella,
Dan Gibson,
Thomas F. Wenisch,
Monica Wong-Chan,
Sean Clark,
Milo M. K. Martin,
Moray McLaren,
Prashant Chandra,
Rob Cauble,
Hassan M. G. Wassel,
Behnam Montazeri,
Simon L. Sabato,
Joel Scherpelz,
Amin Vahdat
Detecting Network Load Violations for Distributed Control Planes
Kausik Subramanian,
Anubhavnidhi Abhashkumar,
Loris D'Antoni,
Aditya Akella
Tiramisu: Fast and General Network Verification
Anubhavnidhi Abhashkumar,
Aaron Gember-Jacobson,
Aditya Akella
Themis: Fair and Efficient GPU Cluster Scheduling for Machine Learning Workloads
Kshiteej Mahajan,
Arjun Balasubramanian,
Arjun Singhvi,
Shivaram Venkataraman,
Aditya Akella,
Amar Phanishayee,
Shuchi Chawla
Liveness Verification of Stateful Networks.
Farnaz Yousefi,
Anubhavnidhi Abhashkumar,
Kausik Subramanian,
Kartik Hans,
Soudeh Ghorbani,
Aditya Akella
Automated Verification of Customizable Middlebox Properties with Gravel
Kaiyuan Zhang,
Danyang Zhuo,
Aditya Akella,
Arvind Krishnamurthy,
Xi Wang
2019
Smurf: Self-Service String Matching Using Random Forests
Paul Suganthan,
Adel Ardalan,
AnHai Doan,
Aditya Akella
On the Impact of Cluster Configuration on RoCE Application Design
Yanfang Le,
Brent Stephens,
Aditya Akella,
Michael Swift
Best Paper Award
Accelerating Deep Learning Inference via Freezing
Adarsh Kumar,
Arjun Balasubramanian,
Shivaram Venkataraman,
Aditya Akella
The Design and Operation of a Platform for Advanced Cloud Experimentation
Cloudlab Team
Loom: Flexible and Efficient NIC Packet Scheduling
Brent Stephens,
Aditya Akella,
Michael Swift
Correctness and Performance for Stateful Chained Network Functions
Junaid Khalid,
Aditya Akella
2018
Synthesis of Fault-Tolerant Distributed Router Configurations
Kausik Subramanian,
Loris D' Antoni,
Aditya Akella
Your Programmable NIC Should be a Programmable Switch
Brent Stephens,
Aditya Akella,
Michael Swift
Dynamic Query Re-Planning using QOOP
Kshiteej Mahajan,
Mosharaf Chowdhury,
Aditya Akella,
Shuchi Chawla
Iron: Isolating Network-based CPU in Container Environments
Junaid Khalid,
Eric Rozner,
Wesley Felter,
Cong Xu,
Karthick Rajamani,
Alexandre Ferreira,
Aditya Akella
Monarch: Gaining Command on Geo-Distributed Graph Analytics
Anand Padmanabha Iyer,
Aurojit Panda,
Mosharaf Chowdhury,
Aditya Akella,
Scott Shenker,
Ion Stoica
Bridging the GAP: Towards Approximate Graph Analytics
Anand Padmanabha Iyer,
Aurojit Panda,
Shivaram Venkataraman,
Mosharaf Chowdhury,
Aditya Akella,
Scott Shenker,
Ion Stoica
Best Paper Award
RoGUE: RDMA over Generic Unconverged Ethernet
Yanfang Le,
Brent Stephens,
Arjun Singhvi,
Aditya Akella,
Michael Swift
2017
Low Latency Software Rate Limiters for Cloud Networks
Keqiang He,
Weite Qin,
Qiwei Zhang,
Wenfei Wu,
Junjie Yang,
Tian Pan,
Chengchen Hu,
Jiao Zhang,
Brent Stephens,
Aditya Akella,
Ying Zhang
P5: Policy-driven optimization of P4 pipeline
Anubhavnidhi Abhashkumar,
Jeongkeun Lee,
Jean Tourrilhes,
Sujata Banerjee,
Wenfei Wu,
Joon-Myung Kang,
Aditya Akella
Genesis: Synthesizing Forwarding Tables in Multi-tenant Networks
Kausik Subramanian,
Loris D' Antoni,
Aditya Akella
UNO: Unifying Host and Smart NIC Offload for Flexible Packet Processing
Yanfang Le,
Hyunseok Chang,
Sarit Mukherjee,
Limin Wang,
Aditya Akella,
Michael Swift,
T.V. Lakshman.
Automatically Repairing Network Control Planes Using an Abstract Representation
Aaron Gember-Jacobson,
Aditya Akella,
Ratul Mahajan,
Harry Liu
Granular Computing and Network Intensive Applications: Friends or Foes?
Arjun Singhvi,
Sujata Banerjee,
Yotam Harchol,
Aditya Akella,
Mark Peek,
Pontus Rydin