Publications

2024

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference.

Wei-Lin Chiang, Lianmin Zheng, Ying Sheng, Anastasios Nikolas Angelopoulos, Tianle Li, Dacheng Li, Hao Zhang, Banghua Zhu, Michael I. Jordan, Joseph E. Gonzalez, Ion Stoica:
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference. CoRR abs/2403.04132 (2024)

Optimizing LLM Queries in Relational Workloads.

Shu Liu, Asim Biswal, Audrey Cheng, Xiangxi Mo, Shiyi Cao, Joseph E. Gonzalez, Ion Stoica, Matei Zaharia:
Optimizing LLM Queries in Relational Workloads. CoRR abs/2403.05821 (2024)

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code.

Naman Jain, King Han, Alex Gu, Wen-Ding Li, Fanjia Yan, Tianjun Zhang, Sida Wang, Armando Solar-Lezama, Koushik Sen, Ion Stoica:
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code. CoRR abs/2403.07974 (2024)

ZKML: An Optimizing System for ML Inference in Zero-Knowledge Proofs.

Bing-Jyue Chen, Suppakit Waiwitlikhit, Ion Stoica, Daniel Kang:
ZKML: An Optimizing System for ML Inference in Zero-Knowledge Proofs. EuroSys 2024: 560-574

RAFT: Adapting Language Model to Domain Specific RAG.

Tianjun Zhang, Shishir G. Patil, Naman Jain, Sheng Shen, Matei Zaharia, Ion Stoica, Joseph E. Gonzalez:
RAFT: Adapting Language Model to Domain Specific RAG. CoRR abs/2403.10131 (2024)

"We Have No Idea How Models will Behave in Production until Production": How Engineers Operationalize Machine Learning.

Shreya Shankar, Rolando Garcia, Joseph M. Hellerstein, Aditya G. Parameswaran:
"We Have No Idea How Models will Behave in Production until Production": How Engineers Operationalize Machine Learning. CoRR abs/2403.16795 (2024)

Optimizing the cloud? Don't train models. Build oracles!

Tiemo Bang, Conor Power, Siavash Ameli, Natacha Crooks, Joseph M. Hellerstein:
Optimizing the cloud? Don't train models. Build oracles! CIDR 2024

Nebula: A Privacy-First Platform for Data Backhaul.

Jean-Luc Watson, Tess Despres, Alvin Tan, Shishir G. Patil, Prabal Dutta, Raluca Ada Popa:
Nebula: A Privacy-First Platform for Data Backhaul. IACR Cryptol. ePrint Arch. 2024: 409 (2024)

Wrapping Rings in Lattices: An Algebraic Symbiosis of Incremental View Maintenance and Eventual Consistency.

Conor Power, Saikrishna Achalla, Ryan Cottone, Nathaniel Macasaet, Joseph M. Hellerstein:
Wrapping Rings in Lattices: An Algebraic Symbiosis of Incremental View Maintenance and Eventual Consistency. PaPoC@EuroSys 2024: 15-22

Composing MPC With LQR and Neural Network for Amortized Efficiency and Stable Control.

Fangyu Wu, Guanhua Wang, Siyuan Zhuang, Kehan Wang, Alexander Keimer, Ion Stoica, Alexandre M. Bayen:
Composing MPC With LQR and Neural Network for Amortized Efficiency and Stable Control. IEEE Trans Autom. Sci. Eng. 21(2): 2088-2101 (2024)

2022

Reliable Transactions in Serverless-Edge Architecture

Published: ICDE'23 (IEEE International Conference on Data Engineering)

Authors Suyash GuptaSajjad RahnamaErik LinsenmayerFaisal NawabMohammad Sadoghi

Reliable Transactions in Serverless-Edge Architecture

Modern edge applications demand novel solutions where edge applications do not have to rely on a single cloud provider (which cannot be in the vicinity of every edge device) or dedicated edge servers (which cannot scale as clouds) for processing compute-intensive tasks. A recent computing philosophy, Sky computing, proposes giving each user ability to select between available cloud providers.
In this paper, we present our serverless-edge co-design, which extends the Sky computing vision. In our serverless-edge co-design, we expect edge devices to collaborate and spawn required number of serverless functions. This raises several key challenges: (1) how will this collaboration take place, (2) what if some edge devices are compromised, and (3) what if a selected cloud provider is malicious. Hence, we design ServerlessBFT, the first protocol to guarantee Byzantine fault-tolerant (BFT) transactional flow between edge devices and serverless functions. We present an exhaustive list of attacks and their solutions on our serverless-edge co-design. Further, we extensively benchmark our architecture on a variety of parameters.

Jiffy: elastic far-memory for stateful serverless analytics

A Khandelwal, Y Tang, R Agarwal, A Akella, I Stoica Proceedings of the Seventeenth European Conference on Computer Systems (Eurosys’22)

The Sky Above The Clouds

Technology ecosystems often undergo significant transformations as they mature. For example, telephony, the Internet, and PCs all started with a single provider, but in the United States each is now served by a competitive market that uses comprehensive and universal technology standards to provide compatibility. This white paper presents our view on how the cloud ecosystem, barely over fifteen years old, could evolve as it matures.

CostCO: An automatic cost modeling framework for secure multi-party computation

Vivian Fang, Lloyd Brown, William Lin, Wenting Zheng, Aurojit Panda, Raluca Ada Popa

New Directions in Cloud Programming

11th Conference on Innovative – 11th Conference on Innovative Data Systems Research, CIDR 2021 Data Systems Research, CIDR 2021 – Cheung, A.; Crooks, N.; Hellerstein, J. M.; and Milano, M.

Serverless Boom or Bust? An Analysis of Economic Incentives

12th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud) -Charles Lin, Joseph E. Gonzalez, and Joseph M. Hellerstein.

2021

Snoopy: Surpassing the Scalability Bottleneck of Oblivious Storage

E Dauterman, V Fang, I Demertzis, N Crooks, RA Popa Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles

Basil: Breaking up BFT with ACID (transactions)

Florian Suri-Payer, Matthew Burke, Zheng Wang, Yunhao Zhang, Lorenzo Alvisi, Natacha Crooks Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles