%0 Journal Article %T Understanding and Optimizing Multi-Stage AI Inference Pipelines %A Bambhaniya, Abhimanyu Rajeshkumar %A Wu, Hanjiang %A Subramanian, Suvinay %A Srinivasan, Sudarshan %A Kundu, Souvik %A Yazdanbakhsh, Amir %A Elavazhagan, Midhilesh %A Kumar, Madhu %A Krishna, Tushar %J Computing Research Repository %V 2025 %N 2504 %D 2025-04-20 %~ DeepDyve