cosine_annealing
optimus_dl.modules.lr_scheduler.cosine_annealing
¶
CosineAnnealingLR
¶
Bases: BaseLRScheduler
Cosine annealing learning rate scheduler.
Decays the learning rate using a cosine curve following:
This implementation allows the learning rate to smoothly transition from its initial value down toeta_min over T_max steps.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
cfg
|
CosineAnnealingLRConfig
|
Scheduler configuration. |
required |
optimizer
|
Optimizer
|
Managed optimizer. |
required |
iterations
|
int
|
Total training iterations (used if T_max is not specified). |
required |
Source code in optimus_dl/modules/lr_scheduler/cosine_annealing.py
get_lr()
¶
Calculate learning rates using the cosine annealing formula.
Source code in optimus_dl/modules/lr_scheduler/cosine_annealing.py
load_state_dict(state_dict)
¶
Restore the scheduler's state.
Source code in optimus_dl/modules/lr_scheduler/cosine_annealing.py
state_dict()
¶
Return the scheduler's state, including cosine-specific parameters.
Source code in optimus_dl/modules/lr_scheduler/cosine_annealing.py
CosineAnnealingLRConfig
dataclass
¶
Bases: BaseLRSchedulerConfig
Configuration for cosine annealing learning rate scheduler.
Attributes:
| Name | Type | Description |
|---|
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
T_max
|
int
|
|
1000
|
eta_min
|
float
|
|
0.0
|
last_epoch
|
int
|
|
-1
|