Quick Start
Run your training with Matcha
Prefix your training command with matcha run:
Reading the output
What counts as duration
Duration covers everything from when Matcha starts to when your training process exits. This includes model compilation, data loading, warmup steps, training, validation, checkpointing, and serialization. It is wall-clock time, not just training time.
Zero overhead
Matcha does not pipe or intercept your training output. Your process writes directly to the terminal. The only work Matcha does is polling NVML in a background thread, which has no measurable impact on training performance.