Sign Up to like & get
recommendations!
1
Published in 2023 at "IEEE Transactions on Parallel and Distributed Systems"
DOI: 10.1109/tpds.2023.3279233
Abstract: The heterogeneity of Deep Learning models, libraries, and hardware poses an important challenge for improving model inference performance. Auto-tuners address this challenge via automatic tensor program optimization towards a target-device. However, auto-tuners incur a substantial…
read more here.
Keywords:
auto;
tensor programs;
infrastructure;
measurement ... See more keywords