config
optimus_dl.recipe.metrics.config
¶
Configuration for metrics evaluation recipe.
MetricsConfig
dataclass
¶
Bases: RegistryConfigStrict
Complete metrics evaluation configuration.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
args
|
dict
|
dict() -> new empty dictionary dict(mapping) -> new dictionary initialized from a mapping object's (key, value) pairs dict(iterable) -> new dictionary initialized as if via: d = {} for k, v in iterable: d[k] = v dict(**kwargs) -> new dictionary initialized with the name=value pairs in the keyword argument list. For example: dict(one=1, two=2) |
<class 'dict'>
|
common
|
MetricsRecipeConfig
|
Configuration for metrics evaluation recipe common settings. |
<dynamic>
|
model
|
ModelConfig | None
|
|
None
|
data
|
DataConfig
|
|
'???'
|
criterion
|
CriterionConfig
|
|
'???'
|
metrics
|
dict[str, list[dict]]
|
Metric configurations mapped by dataset name |
<class 'dict'>
|
model_transforms
|
list[ModelTransformConfig]
|
List of model transforms to apply |
<dynamic>
|
loggers
|
list[MetricsLoggerConfig] | None
|
|
None
|
model_builder
|
Any
|
|
ModelBuilderConfig(_name='base')
|
criterion_builder
|
Any
|
|
CriterionBuilderConfig(_name='base')
|
data_builder
|
Any
|
|
DataBuilderConfig(_name='base')
|
checkpoint_manager
|
Any
|
|
CheckpointManagerConfig(_name='base')
|
logger_manager
|
Any
|
|
LoggerManagerConfig(_name='base')
|
evaluator
|
Any
|
|
EvaluatorConfig(_name='base')
|
Source code in optimus_dl/recipe/metrics/config.py
MetricsRecipeConfig
dataclass
¶
Configuration for metrics evaluation recipe common settings.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
name
|
str
|
Experiment name for loggers |
'metrics-eval'
|
seed
|
int
|
|
42
|
data_seed
|
int
|
|
42
|
output_path
|
str
|
Base directory for outputs (logs, etc.) |
'outputs/metrics'
|
checkpoint_path
|
str | None
|
Path to checkpoint to load from |
None
|
use_gpu
|
bool
|
|
True
|
distributed
|
DistributedConfig
|
Configuration for distributed training topologies. Attributes: tp_size: Degree of Tensor Parallelism (number of GPUs to shard each layer across). sharding_world_size: Size of FSDP sharding groups. If None, defaults to the number of GPUs per node (intra-node sharding). |
<dynamic>
|
max_iterations
|
int | None
|
Max number of batches to process per dataset |
None
|
tokenizer
|
Any
|
Tokenizer configuration |
None
|