Fix CosineDecay documentation to clarify alpha is a multiplier

yashwantbezawada · yashwantbezawada · commit 93da29ed2f72 · 2025-11-06T13:05:06.000-06:00
The documentation incorrectly stated that learning rate decays
'to alpha', when it actually decays to 'initial_lr * alpha'.

Updated the docstring to make it clear that alpha is a fraction/
multiplier, not an absolute target value.
diff --git a/keras/src/optimizers/schedules/learning_rate_schedule.py b/keras/src/optimizers/schedules/learning_rate_schedule.py
@@ -584,9 +584,9 @@ class CosineDecay(LearningRateSchedule):
     schedule applies a linear increase per optimizer step to our learning rate
     from `initial_learning_rate` to `warmup_target` for a duration of
     `warmup_steps`. Afterwards, it applies a cosine decay function taking our
-    learning rate from `warmup_target` to `alpha` for a duration of
+    learning rate from `warmup_target` to `warmup_target * alpha` for a duration of
     `decay_steps`. If `warmup_target` is None we skip warmup and our decay
-    will take our learning rate from `initial_learning_rate` to `alpha`.
+    will take our learning rate from `initial_learning_rate` to `initial_learning_rate * alpha`.
     It requires a `step` value to  compute the learning rate. You can
     just pass a backend variable that you increment at each training step.