Preprocessing¶

This page explains how RAITAP preprocessing works. By default, absolutely no preprocessing is applied; pretrained image models that expect ImageNet normalization, or tabular models that expect z-scored features, will produce wrong outputs.

The 2 following config keys are available:

Knob	Where	Typical contents
`data.preprocessing`	loader, before batch reaches model	Resize + CenterCrop (images), feature scaling (tabular)
`data.model_input_transformation`	model boundary, every forward pass. ONLY FOR TORCH MODELS.	Normalize, learnable input layers

preprocessing runs in the loader so mixed-size image folders can stack at all, and so the work is outside autograd.
model_input_transformation inside autograd so Captum/SHAP attribution and PGD/FGSM attacks operate on the same input space you do.

Warning

Not supported for object detection. Detection models take native per-image inputs and resize/normalise internally (e.g. torchvision GeneralizedRCNNTransform), so neither knob applies: a data-side resize/crop/pad would corrupt the ground-truth box coordinates (labels are not transformed with the pixels), and a model-side transform would double-process the input. Setting data.preprocessing or data.model_input_transformation for a detection model raises an error — leave both unset.

The following values are allowed for both keys:

null (default): no preprocessing
"model-bundled": use the preprocessing bundled inside the model file (e.g. torchvision models)
path to a .py file: load a user factory decorated with the matching RAITAP decorator (see custom examples below). Requires consent — see --allow-preprocessing-exec.

Custom-file rules¶

A decorated factory must:

carry @raitap_preprocessing_factory (data side) or @raitap_model_input_transformation_factory (model side),
take no required arguments,
return an nn.Module.

One factory per side per file. Pointing a knob at a file with no matching decorator raises before the model is built. Two matching factories raise with their names.

Two Protocol types ship for static analysis:

from raitap.data import DataPreprocessingFactory, ModelInputTransformationFactory

_data_check: DataPreprocessingFactory = resize_and_crop
_model_check: ModelInputTransformationFactory = normalize

RAITAP records each file's path and SHA-256 in tracking metadata so changes between runs surface in your history.

When does data-side run per-image?¶

Image sources (.jpg, .png, …) load per-image — the data-side module is lifted to a single-image (C, H, W) → Tensor callable and applied as each file is read. That's the only way a directory of varied-size images can be stacked into one batch.

Tabular sources (.csv, .tsv, .parquet) load as (N, F) in one shot — the data-side module runs once on the full batch.

If you set data.preprocessing: null on an image source, every file must already be the same height and width.

Preprocessing¶

Examples¶

Torchvision image model, bundled both sides¶

Tabular model, custom feature scaling¶

Bundled Resize/Crop + custom Normalize¶

Both sides custom, single file¶

Already preprocessed upstream¶

Custom-file rules¶

When does data-side run per-image?¶