International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064




Downloads: 4 | Views: 53 | Weekly Hits: ⮙1 | Monthly Hits: ⮙2

Research Paper | Computer Engineering | United States of America | Volume 13 Issue 3, March 2024 | Rating: 5 / 10


Advancing AI: Enhancing Large Language Model Performance through GPU Optimization Techniques

Sriram Sagi


Abstract: This study delves into optimizing GPU utilization for supporting Large Language Models LLMs within Generative AI frameworks. Focusing on dynamic resource allocation, kernel optimization, and memory management, our investigation reveals significant improvements in LLM efficiency and performance. By integrating NVIDIAs advanced AI technologies, we propose a scalable, cost - effective approach for deploying AI applications at the enterprise level. The findings underscore the pivotal role of GPU optimization in enhancing AI accessibility and fostering innovation across diverse sectors.


Keywords: Large Language Models (LLMs), GPU Optimization, Generative AI, Artificial Intelligence Deployment


Edition: Volume 13 Issue 3, March 2024,


Pages: 630 - 633


How to Download this Article?

Type Your Valid Email Address below to Receive the Article PDF Link


Verification Code will appear in 2 Seconds ... Wait

Top