Configuration¶
This page describes how to configure the data used to assess the model.
Options¶
Name |
Allowed |
Default |
Description |
|---|---|---|---|
|
|
|
Name shown in outputs and tracking metadata. |
|
|
|
Optional human-readable dataset description. |
|
|
|
Path to a local data directory, or a named sample set such as |
|
|
|
Optional path to a labels file. Supported formats are CSV, TSV, and Parquet. |
|
|
|
Optional sample-ID column used to align labels with filenames, for example |
|
|
|
Optional class-label column. If omitted, one-hot numeric columns are reduced with |
|
|
|
Optional label parsing strategy. |
YAML example¶
data:
name: "my-dataset"
description: "Internal validation set"
source: "./data/images"
labels:
source: "./data/labels.csv"
id_column: "image"
column: "label"
encoding: "index"
CLI override example¶
uv run raitap data.source="./data/images" data.labels.source="./data/labels.csv" data.labels.column=label
raitap data.source="./data/images" data.labels.source="./data/labels.csv" data.labels.column=label