THU FASTsys
List
Category
Archive
Tag
Parrot Efficient Serving of LLM-based Applications with Semantic Variable
Aug 13, 2024
OSDI
20
---
Reporter
周与祺
Tags
LLM
Related
Characterization of Large Language Model Development in the Datacenter
ServerlessLLM Locality-Enhanced Serverless Inference for Large Language Models
Language Model is Compression
Efficient Memory Management for Large Language Model Serving with PagedAttention