Search Results for author: Lokesh Gupta

Found 1 papers, 1 papers with code

LLMem: Estimating GPU Memory Usage for Fine-Tuning Pre-Trained LLMs

1 code implementation16 Apr 2024 TaeHo Kim, Yanming Wang, Vatshank Chaturvedi, Lokesh Gupta, Seyeon Kim, Yongin Kwon, Sangtae Ha

Fine-tuning pre-trained large language models (LLMs) with limited hardware presents challenges due to GPU memory constraints.

Decoder

Cannot find the paper you are looking for? You can Submit a new open access paper.