%0 Journal Article %T KVPR: Efficient LLM Inference with I/O-Aware KV Cache Partial Recomputation %A Jiang, Chaoyi %A Gao, Lei %A Zarch, Hossein Entezari %A Annavaram, Murali %J Computing Research Repository %V 2025 %N 2411 %D 2025-06-04 %~ DeepDyve