IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency
S. Ghafouri, K. Razavi, M. Salmani, A. Sanaee, T.L. Botran, L. Wang, J. Doyle, P. Jamshidi, in: Companion of the 16th ACM/SPEC International Conference on Performance Engineering, ACM, 2025.
Download
No fulltext has been uploaded.
Conference Paper
| Published
| English
Author
Ghafouri, Saeid;
Razavi, Kamran;
Salmani, Mehran;
Sanaee, Alireza;
Botran, Tania Lorido;
Wang, LinLibreCat
;
Doyle, Joseph;
Jamshidi, Pooyan
Department
Publishing Year
Proceedings Title
Companion of the 16th ACM/SPEC International Conference on Performance Engineering
Conference
16th ACM/SPEC International Conference on Performance Engineering
LibreCat-ID
Cite this
Ghafouri S, Razavi K, Salmani M, et al. IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency. In: Companion of the 16th ACM/SPEC International Conference on Performance Engineering. ACM; 2025. doi:10.1145/3680256.3721266
Ghafouri, S., Razavi, K., Salmani, M., Sanaee, A., Botran, T. L., Wang, L., Doyle, J., & Jamshidi, P. (2025). IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency. Companion of the 16th ACM/SPEC International Conference on Performance Engineering. 16th ACM/SPEC International Conference on Performance Engineering. https://doi.org/10.1145/3680256.3721266
@inproceedings{Ghafouri_Razavi_Salmani_Sanaee_Botran_Wang_Doyle_Jamshidi_2025, title={IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency}, DOI={10.1145/3680256.3721266}, booktitle={Companion of the 16th ACM/SPEC International Conference on Performance Engineering}, publisher={ACM}, author={Ghafouri, Saeid and Razavi, Kamran and Salmani, Mehran and Sanaee, Alireza and Botran, Tania Lorido and Wang, Lin and Doyle, Joseph and Jamshidi, Pooyan}, year={2025} }
Ghafouri, Saeid, Kamran Razavi, Mehran Salmani, Alireza Sanaee, Tania Lorido Botran, Lin Wang, Joseph Doyle, and Pooyan Jamshidi. “IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency.” In Companion of the 16th ACM/SPEC International Conference on Performance Engineering. ACM, 2025. https://doi.org/10.1145/3680256.3721266.
S. Ghafouri et al., “IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency,” presented at the 16th ACM/SPEC International Conference on Performance Engineering, 2025, doi: 10.1145/3680256.3721266.
Ghafouri, Saeid, et al. “IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency.” Companion of the 16th ACM/SPEC International Conference on Performance Engineering, ACM, 2025, doi:10.1145/3680256.3721266.