Downloads: 6 | Views: 153 | Weekly Hits: ⮙2 | Monthly Hits: ⮙2
Research Paper | Computer Engineering | United States of America | Volume 13 Issue 3, March 2024 | Popularity: 5.3 / 10
Advancing AI: Enhancing Large Language Model Performance through GPU Optimization Techniques
Sriram Sagi
Abstract: This study delves into optimizing GPU utilization for supporting Large Language Models LLMs within Generative AI frameworks. Focusing on dynamic resource allocation, kernel optimization, and memory management, our investigation reveals significant improvements in LLM efficiency and performance. By integrating NVIDIAs advanced AI technologies, we propose a scalable, cost - effective approach for deploying AI applications at the enterprise level. The findings underscore the pivotal role of GPU optimization in enhancing AI accessibility and fostering innovation across diverse sectors.
Keywords: Large Language Models (LLMs), GPU Optimization, Generative AI, Artificial Intelligence Deployment
Edition: Volume 13 Issue 3, March 2024
Pages: 630 - 633
DOI: https://www.doi.org/10.21275/SR24309100709
Make Sure to Disable the Pop-Up Blocker of Web Browser