Learning Rate Schedules

Linear Warmup With Cosine Annealing

Linear Warmup With Cosine Annealing is a learning rate schedule where we increase the learning rate linearly for $n$ updates and then anneal according to a cosine schedule afterwards.

Papers


Paper Code Results Date Stars

Tasks


Task Papers Share
Language Modelling 75 10.40%
Large Language Model 45 6.24%
Question Answering 39 5.41%
Retrieval 27 3.74%
Text Generation 22 3.05%
In-Context Learning 22 3.05%
Sentence 21 2.91%
Code Generation 21 2.91%
Prompt Engineering 21 2.91%

Categories