International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 6 | Views: 153 | Weekly Hits: ⮙2 | Monthly Hits: ⮙2

Research Paper | Computer Engineering | United States of America | Volume 13 Issue 3, March 2024 | Popularity: 5.3 / 10


     

Advancing AI: Enhancing Large Language Model Performance through GPU Optimization Techniques

Sriram Sagi


Abstract: This study delves into optimizing GPU utilization for supporting Large Language Models LLMs within Generative AI frameworks. Focusing on dynamic resource allocation, kernel optimization, and memory management, our investigation reveals significant improvements in LLM efficiency and performance. By integrating NVIDIAs advanced AI technologies, we propose a scalable, cost - effective approach for deploying AI applications at the enterprise level. The findings underscore the pivotal role of GPU optimization in enhancing AI accessibility and fostering innovation across diverse sectors.


Keywords: Large Language Models (LLMs), GPU Optimization, Generative AI, Artificial Intelligence Deployment


Edition: Volume 13 Issue 3, March 2024


Pages: 630 - 633


DOI: https://www.doi.org/10.21275/SR24309100709



Make Sure to Disable the Pop-Up Blocker of Web Browser




Text copied to Clipboard!
Sriram Sagi, "Advancing AI: Enhancing Large Language Model Performance through GPU Optimization Techniques", International Journal of Science and Research (IJSR), Volume 13 Issue 3, March 2024, pp. 630-633, https://www.ijsr.net/getabstract.php?paperid=SR24309100709, DOI: https://www.doi.org/10.21275/SR24309100709