Index
optimus_dl
¶
Optimus-DL: A modular, high-performance framework for training Large Language Models.
Optimus-DL is a research framework built on PyTorch that provides:
- Modular "Recipe" architecture for clean separation of concerns
- Hydra-based configuration management
- Universal metrics system with distributed aggregation
- Modern PyTorch features (AMP, FSDP2, Tensor Parallelism, torch.compile)
- Efficient kernels via Liger-Kernel integration
- Registry system for easy component swapping
Example
Basic training: