load_strategy

`optimus_dl.modules.checkpoint.load_strategy` ¶

Checkpoint loading strategy configuration.

This module defines the LoadStrategy class that controls which components of a checkpoint are loaded. This is useful for fine-tuning scenarios where you might want to load model weights but not optimizer state, or for resuming training with different configurations.

`LoadStrategy` `dataclass` ¶

Configuration for selective checkpoint loading.

This class controls which components are loaded from a checkpoint. It's particularly useful for:

Fine-tuning: Load model weights but not optimizer state
Resuming with different configs: Load model but reset optimizer
Evaluation: Load only model weights
Debugging: Load specific components to isolate issues

All fields default to True, meaning by default everything is loaded.

Example

# Fine-tuning: load model only
strategy = LoadStrategy(
    load_model=True,
    load_optimizer=False,
    load_scheduler=False,
    load_iteration=False,
)

# Resume training: load everything
strategy = LoadStrategy()  # All defaults to True

# Evaluation: model only
strategy = LoadStrategy(
    load_model=True,
    load_optimizer=False,
    load_scheduler=False,
    load_data_sources=False,
    load_dataloaders=False,
    load_metrics=False,
    load_iteration=False,
)

Parameters:

Name	Type	Description	Default
`load_model`	`bool`	Whether to load model weights.	`True`
`load_optimizer`	`bool`	Whether to load optimizer state.	`True`
`load_scheduler`	`bool`	Whether to load learning rate scheduler state.	`True`
`load_data_sources`	`bool`	Whether to load data source state (e.g. dataset position).	`True`
`load_dataloaders`	`bool`	Whether to load full dataloader state.	`True`
`load_metrics`	`bool`	Whether to load accumulated metrics.	`True`
`load_iteration`	`bool`	Whether to resume the iteration count.	`True`
`extra_ignore_keys`	`list[str] \| None`	List of specific keys to ignore in the checkpoint state dict.	`None`

Source code in optimus_dl/modules/checkpoint/load_strategy.py

@dataclass
class LoadStrategy:
    """Configuration for selective checkpoint loading.

    This class controls which components are loaded from a checkpoint. It's
    particularly useful for:

    - Fine-tuning: Load model weights but not optimizer state
    - Resuming with different configs: Load model but reset optimizer
    - Evaluation: Load only model weights
    - Debugging: Load specific components to isolate issues

    All fields default to True, meaning by default everything is loaded.

    Example:
        ```python
        # Fine-tuning: load model only
        strategy = LoadStrategy(
            load_model=True,
            load_optimizer=False,
            load_scheduler=False,
            load_iteration=False,
        )

        # Resume training: load everything
        strategy = LoadStrategy()  # All defaults to True

        # Evaluation: model only
        strategy = LoadStrategy(
            load_model=True,
            load_optimizer=False,
            load_scheduler=False,
            load_data_sources=False,
            load_dataloaders=False,
            load_metrics=False,
            load_iteration=False,
        )

        ```"""

    load_model: bool = field(
        default=True, metadata={"description": "Whether to load model weights."}
    )
    load_optimizer: bool = field(
        default=True, metadata={"description": "Whether to load optimizer state."}
    )
    load_scheduler: bool = field(
        default=True,
        metadata={"description": "Whether to load learning rate scheduler state."},
    )
    load_data_sources: bool = field(
        default=True,
        metadata={
            "description": "Whether to load data source state (e.g. dataset position)."
        },
    )
    load_dataloaders: bool = field(
        default=True, metadata={"description": "Whether to load full dataloader state."}
    )
    load_metrics: bool = field(
        default=True, metadata={"description": "Whether to load accumulated metrics."}
    )
    load_iteration: bool = field(
        default=True, metadata={"description": "Whether to resume the iteration count."}
    )
    extra_ignore_keys: list[str] | None = field(
        default=None,
        metadata={
            "description": "List of specific keys to ignore in the checkpoint state dict."
        },
    )

load_strategy

optimus_dl.modules.checkpoint.load_strategy ¶

LoadStrategy dataclass ¶

`optimus_dl.modules.checkpoint.load_strategy` ¶

`LoadStrategy` `dataclass` ¶