3.2. MAPS¶

A MAPS (Model Analysis and Processing Structure) is the cornerstone of experiment management in ClinicaDL: a single directory that holds everything about an experiment. It is represented in Python by the Maps class.

3.2.1. What a MAPS contains¶

Whenever you train, validate or test a model, the Trainer writes to its MAPS:

Outputs of training and evaluation — the trained network weights and checkpoints, the computed metrics, the logs, computational statistics, and the results of testing on new data.
All the hyperparameters used — the model, the optimization settings, the computational settings, the callbacks, the metrics and the data splits.

Because everything lives in one place, a MAPS serves the two goals of this chapter at once:

Experiment management

One self-contained, shareable folder fully describes one experiment — results, logs and configuration together. No more scattered output files.

Reproducibility

Since the MAPS records the configuration of every object used, an experiment can be rebuilt and run again from the folder alone.

3.2.2. Creating and reopening a MAPS¶

You rarely create a MAPS by hand: a Trainer creates one for you from the path you give it (see Chapter 2).

from clinicadl.train import Trainer
from clinicadl.metrics.config import LossMetricConfig, MSEMetricConfig
from clinicadl.callbacks import ModelCheckpointCallback

trainer = Trainer(
    maps="maps_directory",
    model=model,
    metrics={"loss": LossMetricConfig(), "mse": MSEMetricConfig()},
    callbacks=[ModelCheckpointCallback(metric="loss", epochs=range(1, 100, 10))],
)
trainer.train(split)
trainer.test(
    model_checkpoint="split-0_best-loss", group_name="test", dataloader=test_loader
)
trainer.test(
    model_checkpoint="split-0_epoch-10", group_name="test"
)

will produce a MAPS like this:

maps_directory
├── environment.txt
├── metrics.json
├── model.json
...
├── test
│   └── group-test
│       ├── data.tsv
│       ├── dataloader.json
│       ├── dataset.json
│       └── results
│           └── split-0
│               ├── best-loss
│               │   ├── computational.json
│               │   └── metrics
│               │       ├── aggregated.tsv
│               │       └── details.tsv
│               └── epoch-10
└── training
    ├── data
    │   ├── data.tsv
    │   ├── train
    │   │   ├── split-0
    │   │   │   ├── data.tsv
    │   │   │   ├── dataloader.json
    │   │   │   └── dataset.json
    │   │   └── split-1
    │   └── validation
    │       ├── split-0
    │       └── split-1
    ├── optimization.json
    └── split-0
        ├── computational.json
        ├── logs
        │   ├── computational.tsv
        │   ├── training_loss.tsv
        │   ...
        ├── models
        │   ├── best_models
        │   │   └── best-loss
        │   │       ├── model.pt
        │   │       └── validation_metrics
        │   ├── checkpoints
        │   │   ├── epoch-1
        │   │   ├── epoch-10
        │   │   ...
        │   └── final
        ├── summary.log
        ├── validation_metrics
        └── warning.log

Later you can reopen the experiment. Trainer.from_maps rebuilds the whole trainer from the configurations stored in the MAPS:

trainer = Trainer.from_maps("maps_directory")

To inspect the folder directly, open it as a Maps, which exposes the path of every file and subdirectory it contains:

from clinicadl.io.maps import Maps

maps = Maps("maps_directory")

>>> best_metrics_path = maps.training.splits[0].models.best_models.metrics["loss"].validation_metrics.aggregated_tsv
>>> best_metrics_path
PosixPath('maps_directory/training/split-0/models/best_models/best-loss/validation_metrics/aggregated.tsv')
>>> maps.open_file(best_metrics_path)
    epoch       mse      loss
0      42  1.347617  0.853633

Warning

Reproducibility relies on configuration classes. If you used a raw object that has no configuration class (a custom transform, an external network, etc.), ClinicaDL may not be able to rebuild it. Prefer configuration classes whenever one exists.

3.2.3. Reproducing a Python environment¶

When a MAPS is created, ClinicaDL records the current Python environment and saves it inside the env subdirectory. Up to three files are written, each starting with a commented header (Python version, platform, etc.):

environment.txt — always written, the output of pip freeze;
environment.yml — only inside a conda environment, the output of conda env export (exact versions and build strings);
environment_portable.yml — only inside a conda environment, the output of conda env export --no-builds --ignore-channels. It is a less faithful copy of the environment than environment.yml, but it makes the environment more likely to be recreated on another machine without errors

To recreate the environment on another machine, pick the file that matches your setup:

# with pip
pip install -r maps_directory/env/environment.txt

# with conda
conda env create -f maps_directory/env/environment.yml

# with conda, if the previous command fails (e.g. different OS)
conda env create -f maps_directory/env/environment_portable.yml

Limitations

None of these files guarantees a strictly identical environment: the GPU driver and the operating system live below the package level and are never captured.

environment.txt only lists the packages seen by pip: it captures neither the Python interpreter nor any non-Python, system-level library (CUDA, BLAS, etc.). In addition, when the MAPS was created inside a conda environment, packages installed by conda may appear in a form that pip cannot reinstall, so prefer the .yml files in that case.

The .yml files are more complete: conda env export also pins the Python interpreter and the system-level libraries that conda manages. However, environment.yml is tied to the operating system and architecture it was exported on, whereas environment_portable.yml relaxes this but may still need manual edits to be reproduced on a different OS.

Tip for maximum reproducibility

A more reliable way to reproduce an environment is to use a lock file (e.g., poetry.lock or uv.lock). A lock file pins the whole dependency graph, with hashes, so the environment can be rebuilt more precisely (e.g. poetry install or uv sync). If you manage your project with Poetry or uv, we recommend keeping a copy of the lock file in the env directory of your MAPS.

This closes Chapter 3. The final chapter shows how to go beyond the ready-made objects and tailor ClinicaDL to your own needs.