Large language models (LLMs) exhibit superior performance but demand substantial computational and memory resources. Quantization emerges as a viable strategy to alleviate memory requirements and expedite inference. Nonetheless, extant quantization methods encounter challenges in preserving accuracy and ensuring hardware efficiency. This research introduces SmoothQuant, an innovative, training-free, and accuracy-preserving post-training (PTQ) approach, specifically designed to facilitate 8-bit weight and 8-bit activation (W8A8) quantization for LLMs. Leveraging the insight that weights are amenable to quantization while activations pose challenges, SmoothQuant addresses this imbalance by mitigating activation outliers. With the continuous evolution of artificial intelligence (AI) technologies, integrating innovative AI models into the field of nuclear medicine holds immense promise. This research paper explores the synergistic potential of combining Large Language Models (LLMs) and SmoothQuant, a state-of-the-art post-training quantization technique, to achieve advanced imaging precision in nuclear medicine. This is achieved through an offline migration of the quantization complexity from activations to weights, employing a mathematically equivalent transformation. This research presents a turn-key solution that not only optimizes hardware utilization but also contributes to the democratization of LLMs by mitigating associated costs. The study investigates the impact of this integration on data analysis, interpretation, and overall diagnostic accuracy.
Nuclear Medicine, Artificial Intelligence, Large Language Models, Imaging Precision, Quantization.
IRE Journals:
Sanskar Saxena , Vinod Kumar , Jatin Chopra , Pranjal Gupta , Daamini Batra; Bhaumik Tyagi
"Enhancing Nuclear Medicine Through Innovative AI Models: Integrating Large Language Models and SmoothQuant for Advanced Imaging Precision" Iconic Research And Engineering Journals Volume 7 Issue 6 2023 Page 1-7
IEEE:
Sanskar Saxena , Vinod Kumar , Jatin Chopra , Pranjal Gupta , Daamini Batra; Bhaumik Tyagi
"Enhancing Nuclear Medicine Through Innovative AI Models: Integrating Large Language Models and SmoothQuant for Advanced Imaging Precision" Iconic Research And Engineering Journals, 7(6)