Articles with "per vector" as a keyword



Photo by 20164rhodi from unsplash

A 95.6-TOPS/W Deep Learning Inference Accelerator With Per-Vector Scaled 4-bit Quantization in 5 nm

Sign Up to like & get
recommendations!
Published in 2023 at "IEEE Journal of Solid-State Circuits"

DOI: 10.1109/jssc.2023.3234893

Abstract: The energy efficiency of deep neural network (DNN) inference can be improved with custom accelerators. DNN inference accelerators often employ specialized hardware techniques to improve energy efficiency, but many of these techniques result in catastrophic… read more here.

Keywords: per vector; quantization; accuracy loss; accelerator ... See more keywords