The Embedding Report
.
Front
Search
Tools
Entities
Digest
About
Methodology
Entities
·
People
Yilong Zhao
1 article tagged with this entity.
BlendServe: Optimizing Offline Inference for Auto-regressive Large Models with Resource-aware Batching
via
export.arxiv.org
· Global
· 17h ago