Borui LI 李博睿
Borui LI 李博睿
Home
News
Publications
Services
Seminar
Students
Teaching
Talks
Misc.
> Conference Tracker
Light
Dark
Automatic
Yitao Wang
Latest
MobiLoRA: Accelerating LoRA-based LLM Inference on Mobile Devices via Context-aware KV Cache Optimization
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Decentralized Application-Level Adaptive Scheduling for Multi-Instance DNNs on Open Mobile Devices
Cite
×