THU FASTsys

  •   List
  •   Category
  •   Archive
  •   Tag

Parrot Efficient Serving of LLM-based Applications with Semantic Variable

Aug 13, 2024

 OSDI 20


---
 Reporter
  • 周与祺
 Tags
  •  LLM
 Related
  • Characterization of Large Language Model Development in the Datacenter
  • ServerlessLLM Locality-Enhanced Serverless Inference for Large Language Models
  • Language Model is Compression
  • Efficient Memory Management for Large Language Model Serving with PagedAttention
THU FASTsys