ONNX

May 11, 20262 min read

torch.onnx.export(
    model,
    dummy_input,
    "dynamic_model.onnx",
    input_names=["input"],
    output_names=["output"],
    dynamic_axes={
        "input": {0: "batch_size", 2: "height", 3: "width"},
        "output": {0: "batch_size", 2: "height", 3: "width"}
    }
)

(batch_size, channels, height, width)
0 -> Batch size
1 -> Channels
2 -> Height
3 -> Width

dynamic_axes={
    "input": {0: "batch_size", 2: "height", 3: "width"},
    "output": {0: "batch_size", 2: "height", 3: "width"}
}
"input": {0: "batch_size"} → Batch size can change.
"input": {2: "height", 3: "width"} → Height and width can also vary.

(1, 3, 224, 224), (4, 3, 512, 512), or (8, 3, 300, 400)

dynamic_axes={
    "input": {0: "batch_size", 1: "seq_length"},
    "output": {0: "batch_size", 1: "seq_length"}
}
inputs are usually in (batch_size, sequence_length, embedding_dim) format.