Underlying libraries¶

The current metrics module relies on:

torchmetrics for classification and detection metric implementations
faster-coco-eval as the default backend for detection mean average precision

To tweak specific options via the RAITAP config, you might need to refer to the underlying library's documentation.

Classification metrics¶

BinaryClassificationMetrics, MulticlassClassificationMetrics, and MultilabelClassificationMetrics are the task-specific adapters for classification models. They each wrap the following TorchMetrics classes (instantiated for the matching task):

The adapters map to the task types exposed by TorchMetrics that RAITAP currently uses:

BinaryClassificationMetrics → binary
MulticlassClassificationMetrics → multiclass
MultilabelClassificationMetrics → multilabel

RAITAP adds a thin layer of validation and conventions around these metrics. In particular:

MulticlassClassificationMetrics requires num_classes
MultilabelClassificationMetrics requires num_labels and defaults to a threshold of 0.5 if none is provided
average="none" stores per-class or per-label values in artifacts.json instead of flattening them into scalar metrics

For typical classification runs, the scalar results written to metrics/metrics.json are accuracy, precision, recall, and f1.

Detection metrics¶

DetectionMetrics is ideal for object detection models. It wraps TorchMetrics' MeanAveragePrecision.

Detection inputs follow the structure expected by TorchMetrics: a list of prediction dictionaries and a list of target dictionaries, usually one entry per image. Predictions include boxes, scores, and labels, while targets include boxes and labels.

Scalar detection results such as map are written to metrics.json. Larger structured outputs, such as class-wise or extended summaries, are written to artifacts.json.

Third-party adapters¶

Third-party adapters published to PyPI can register under the raitap.adapters entry-point group and are auto-discovered at config-registration time. Once installed they appear alongside in-tree metric computers. See Writing a plugin.