MobiLoRA: Accelerating LoRA-based LLM Inference on Mobile Devices via Context-aware KV Cache Optimization

Borui Li
Borui Li
Ph.D., Assistant Professor

My research interests include Emboided AI system, LLM-empowered IoT, MLSys etc.