Conference

TensorHub: Scalable and Elastic Weight Transfer for LLM RL Training
Modern LLM reinforcement learning (RL) workloads require a highly efficient weight transfer system to scale training across …
Cloudscape: A Study of Storage Services in Modern Cloud Architectures
We present Cloudscape, a dataset of nearly 400 cloud architectures deployed on AWS. We perform an in-depth analysis of the usage of …