Abstract
TensorRT and 7.8× over Tensorflow XLA.
| Original language | English |
|---|---|
| Title of host publication | ASPLOS '24: Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems |
| Publisher | ACM |
| Pages | 286 - 301 |
| Number of pages | 15 |
| Volume | 1 |
| ISBN (Print) | 9798400703720 |
| DOIs | |
| Publication status | Published - 17 Apr 2024 |
Bibliographical note
We thank our shepherd, Vinod Grover, and the anonymous reviewers for their constructive feedback.For the purpose of open access, the authors have applied a Creative Commons Attribution (CCBY) license to any Author Accepted Manuscript versionarising from this submission.
Funding
This work was supported in part by the National Key R&D Program of China under grant agreement 2021ZD0110101, the National Natural Science Foundation of China (NSFC) under grant agreements T2222026, 22003073, 62232015, and 62090024, the Innovation Funding of ICT CAS under grant agreement E361010, a Beijing Nova Program, and the UK Engineering and Physical Sciences Research Council (EPSRC) under grant agreement EP/X018202/1.
Fingerprint
Dive into the research topics of 'Optimizing Deep Learning Inference via Global Analysis and Tensor Expressions'. Together they form a unique fingerprint.Datasets
-
Data from: Optimizing Deep Learning Inference via Global Analysis and Tensor Expressions
Xia, C. (Creator), Zhao, J. (Creator), Sun, Q. (Creator), Wang, Z. (Creator), Wen, Y. (Creator), Yu, T. (Creator), Feng, X. (Creator) & Cui, H. (Creator), Zenodo, 3 Oct 2023
DOI: 10.5281/zenodo.8404599, https://zenodo.org/records/8404599
Dataset