muon
optimus_dl.modules.optim.muon
¶
Muon optimizer
MuonConfig
dataclass
¶
Bases: RegistryConfigStrict
Configuration for Muon optimizer.
Muon is a momentum-based optimizer that uses Newton-Schulz iteration for preconditioning. It's designed for efficient training of large models.
Attributes:
| Name | Type | Description |
|---|
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
lr
|
float
|
|
0.001
|
weight_decay
|
float
|
|
0.1
|
momentum
|
float
|
|
0.95
|
nesterov
|
bool
|
|
True
|
ns_coefficients
|
tuple[float, float, float]
|
|
(3.4445, -4.775, 2.0315)
|
eps
|
float
|
|
1e-07
|
ns_steps
|
int
|
|
5
|
adjust_lr_fn
|
str | None
|
|
None
|