Accelerating deep learning model inference on arm cpus with ultra-low bit quantization and runtime S Ashfaq, MH AskariHemmat, S Sah, E Saboori, O Mastropietro, ... arXiv preprint arXiv:2207.08820, 2022 | 5 | 2022 |
QReg: On regularization effects of quantization MH AskariHemmat, RA Hemmat, A Hoffman, I Lazarevich, E Saboori, ... arXiv preprint arXiv:2206.12372, 2022 | 2 | 2022 |
DeepGEMM: Accelerated Ultra Low-Precision Inference on CPU Architectures using Lookup Tables DC Ganji, S Ashfaq, E Saboori, S Sah, S Mitra, M Askarihemmat, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 1 | 2023 |
QGen: On the Ability to Generalize in Quantization Aware Training MH AskariHemmat, A Jeddi, RA Hemmat, I Lazarevich, A Hoffman, S Sah, ... arXiv preprint arXiv:2404.11769, 2024 | | 2024 |
DeepliteRT: Computer Vision at the Edge S Ashfaq, A Hoffman, S Mitra, S Sah, MH AskariHemmat, E Saboori arXiv preprint arXiv:2309.10878, 2023 | | 2023 |
Latency-aware structured pruning of pretrained transformer-based models A Hoffman McGill University (Canada), 2022 | | 2022 |