Follow
Alexander Hoffman
Alexander Hoffman
Deeplite
Verified email at deeplite.ai - Homepage
Title
Cited by
Cited by
Year
Accelerating deep learning model inference on arm cpus with ultra-low bit quantization and runtime
S Ashfaq, MH AskariHemmat, S Sah, E Saboori, O Mastropietro, ...
arXiv preprint arXiv:2207.08820, 2022
52022
QReg: On regularization effects of quantization
MH AskariHemmat, RA Hemmat, A Hoffman, I Lazarevich, E Saboori, ...
arXiv preprint arXiv:2206.12372, 2022
22022
DeepGEMM: Accelerated Ultra Low-Precision Inference on CPU Architectures using Lookup Tables
DC Ganji, S Ashfaq, E Saboori, S Sah, S Mitra, M Askarihemmat, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
12023
QGen: On the Ability to Generalize in Quantization Aware Training
MH AskariHemmat, A Jeddi, RA Hemmat, I Lazarevich, A Hoffman, S Sah, ...
arXiv preprint arXiv:2404.11769, 2024
2024
DeepliteRT: Computer Vision at the Edge
S Ashfaq, A Hoffman, S Mitra, S Sah, MH AskariHemmat, E Saboori
arXiv preprint arXiv:2309.10878, 2023
2023
Latency-aware structured pruning of pretrained transformer-based models
A Hoffman
McGill University (Canada), 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–6