torch.onnx¶
Open Neural Network eXchange (ONNX) is an open standard format for representing machine learning models. The torch.onnx module can export PyTorch models to ONNX. The model can then be consumed by any of the many runtimes that support ONNX.
Example: AlexNet from PyTorch to ONNX¶
Here is a simple script which exports a pretrained AlexNet to an ONNX file named alexnet.onnx
.
The call to torch.onnx.export
runs the model once to trace its execution and then exports the
traced model to the specified file:
import torch
import torchvision
dummy_input = torch.randn(10, 3, 224, 224, device="cuda")
model = torchvision.models.alexnet(pretrained=True).cuda()
# Providing input and output names sets the display names for values
# within the model's graph. Setting these does not change the semantics
# of the graph; it is only for readability.
#
# The inputs to the network consist of the flat list of inputs (i.e.
# the values you would pass to the forward() method) followed by the
# flat list of parameters. You can partially specify names, i.e. provide
# a list here shorter than the number of inputs to the model, and we will
# only set that subset of names, starting from the beginning.
input_names = [ "actual_input_1" ] + [ "learned_%d" % i for i in range(16) ]
output_names = [ "output1" ]
torch.onnx.export(model, dummy_input, "alexnet.onnx", verbose=True, input_names=input_names, output_names=output_names)
The resulting alexnet.onnx
file contains a binary protocol buffer
which contains both the network structure and parameters of the model you exported
(in this case, AlexNet). The argument verbose=True
causes the
exporter to print out a human-readable representation of the model:
# These are the inputs and parameters to the network, which have taken on
# the names we specified earlier.
graph(%actual_input_1 : Float(10, 3, 224, 224)
%learned_0 : Float(64, 3, 11, 11)
%learned_1 : Float(64)
%learned_2 : Float(192, 64, 5, 5)
%learned_3 : Float(192)
# ---- omitted for brevity ----
%learned_14 : Float(1000, 4096)
%learned_15 : Float(1000)) {
# Every statement consists of some output tensors (and their types),
# the operator to be run (with its attributes, e.g., kernels, strides,
# etc.), its input tensors (%actual_input_1, %learned_0, %learned_1)
%17 : Float(10, 64, 55, 55) = onnx::Conv[dilations=[1, 1], group=1, kernel_shape=[11, 11], pads=[2, 2, 2, 2], strides=[4, 4]](%actual_input_1, %learned_0, %learned_1), scope: AlexNet/Sequential[features]/Conv2d[0]
%18 : Float(10, 64, 55, 55) = onnx::Relu(%17), scope: AlexNet/Sequential[features]/ReLU[1]
%19 : Float(10, 64, 27, 27) = onnx::MaxPool[kernel_shape=[3, 3], pads=[0, 0, 0, 0], strides=[2, 2]](%18), scope: AlexNet/Sequential[features]/MaxPool2d[2]
# ---- omitted for brevity ----
%29 : Float(10, 256, 6, 6) = onnx::MaxPool[kernel_shape=[3, 3], pads=[0, 0, 0, 0], strides=[2, 2]](%28), scope: AlexNet/Sequential[features]/MaxPool2d[12]
# Dynamic means that the shape is not known. This may be because of a
# limitation of our implementation (which we would like to fix in a
# future release) or shapes which are truly dynamic.
%30 : Dynamic = onnx::Shape(%29), scope: AlexNet
%31 : Dynamic = onnx::Slice[axes=[0], ends=[1], starts=[0]](%30), scope: AlexNet
%32 : Long() = onnx::Squeeze[axes=[0]](%31), scope: AlexNet
%33 : Long() = onnx::Constant[value={9216}](), scope: AlexNet
# ---- omitted for brevity ----
%output1 : Float(10, 1000) = onnx::Gemm[alpha=1, beta=1, broadcast=1, transB=1](%45, %learned_14, %learned_15), scope: AlexNet/Sequential[classifier]/Linear[6]
return (%output1);
}
You can also verify the output using the ONNX library, which you can install using conda:
conda install -c conda-forge onnx
Then, you can run:
import onnx
# Load the ONNX model
model = onnx.load("alexnet.onnx")
# Check that the model is well formed
onnx.checker.check_model(model)
# Print a human readable representation of the graph
print(onnx.helper.printable_graph(model.graph))
You can also run the exported model with one of the many runtimes that support ONNX. For example after installing ONNX Runtime, you can load and run the model:
import onnxruntime as ort
ort_session = ort.InferenceSession("alexnet.onnx")
outputs = ort_session.run(
None,
{"actual_input_1": np.random.randn(10, 3, 224, 224).astype(np.float32)},
)
print(outputs[0])
Here is a more involved tutorial on exporting a model and running it with ONNX Runtime.
Tracing vs Scripting¶
Internally, torch.onnx.export()
requires a torch.jit.ScriptModule
rather than
a torch.nn.Module
. If the passed-in model is not already a ScriptModule
,
export()
will use tracing to convert it to one:
Tracing: If
torch.onnx.export()
is called with a Module that is not already aScriptModule
, it first does the equivalent oftorch.jit.trace()
, which executes the model once with the givenargs
and records all operations that happen during that execution. This means that if your model is dynamic, e.g., changes behavior depending on input data, the exported model will not capture this dynamic behavior. Similarly, a trace is likely to be valid only for a specific input size. We recommend examining the exported model and making sure the operators look reasonable. Tracing will unroll loops and if statements, exporting a static graph that is exactly the same as the traced run. If you want to export your model with dynamic control flow, you will need to use scripting.Scripting: Compiling a model via scripting preserves dynamic control flow and is valid for inputs of different sizes. To use scripting:
Use
torch.jit.script()
to produce aScriptModule
.Call
torch.onnx.export()
with theScriptModule
as the model, and set theexample_outputs
arg. This is required so that the types and shapes of the outputs can be captured without executing the model.
See Introduction to TorchScript and TorchScript for more details, including how to compose tracing and scripting to suit the particular requirements of different models.
Avoiding Pitfalls¶
Avoid NumPy and built-in Python types¶
PyTorch models can be written using NumPy or Python types and functions, but during tracing, any variables of NumPy or Python types (rather than torch.Tensor) are converted to constants, which will produce the wrong result if those values should change depending on the inputs.
For example, rather than using numpy functions on numpy.ndarrays:
# Bad! Will be replaced with constants during tracing.
x, y = np.random.rand(1, 2), np.random.rand(1, 2)
np.concatenate((x, y), axis=1)
Use torch operators on torch.Tensors:
# Good! Tensor operations will be captured during tracing.
x, y = torch.randn(1, 2), torch.randn(1, 2)
torch.cat((x, y), dim=1)
And rather than using torch.Tensor.item()
(which converts a Tensor to a Python
built-in number):
# Bad! y.item() will be replaced with a constant during tracing.
def forward(self, x, y):
return x.reshape(y.item(), -1)
Use torch’s support for implicit casting of single-element tensors:
# Good! y will be preserved as a variable during tracing.
def forward(self, x, y):
return x.reshape(y, -1)
Avoid Tensor.data¶
Using the Tensor.data field can produce an incorrect trace and therefore an incorrect ONNX graph.
Use torch.Tensor.detach()
instead. (Work is ongoing to
remove Tensor.data entirely).
Limitations¶
Types¶
Only torch.Tensors, numeric types that can be trivially converted to torch.Tensors (e.g. float, int), and tuples and lists of those types are supported as model inputs or outputs. Dict and str inputs and outputs are accepted in tracing mode, but:
Any computation that depends on the value of a dict or a str input will be replaced with the constant value seen during the one traced execution.
Any output that is a dict will be silently replaced with a flattened sequence of its values (keys will be removed). E.g.
{"foo": 1, "bar": 2}
becomes(1, 2)
.Any output that is a str will be silently removed.
Certain operations involving tuples and lists are not supported in scripting mode due to limited support in ONNX for nested sequences. In particular appending a tuple to a list is not supported. In tracing mode, the nested sequences will be flattened automatically during the tracing.
Differences in Operator Implementations¶
Due to differences in implementations of operators, running the exported model on different runtimes may produce different results from each other or from PyTorch. Normally these differences are numerically small, so this should only be a concern if your application is sensitive to these small differences.
Unsupported Tensor Indexing Patterns¶
Tensor indexing patterns that cannot be exported are listed below.
If you are experiencing issues exporting a model that does not include any of
the unsupported patterns below, please double check that you are exporting with
the latest opset_version
.
Reads / Gets¶
When indexing into a tensor for reading, the following patterns are not supported:
# Tensor indices that includes negative values.
data[torch.tensor([[1, 2], [2, -3]]), torch.tensor([-2, 3])]
Writes / Sets¶
When indexing into a Tensor for writing, the following patterns are not supported:
# Multiple tensor indices if any has rank >= 2
data[torch.tensor([[1, 2], [2, 3]]), torch.tensor([2, 3])] = new_data
# Multiple tensor indices that are not consecutive
data[torch.tensor([2, 3]), :, torch.tensor([1, 2])] = new_data
# Tensor indices that includes negative values.
data[torch.tensor([1, -2]), torch.tensor([-2, 3])] = new_data
Adding support for operators¶
When exporting a model that includes unsupported operators, you’ll see an error message like:
RuntimeError: ONNX export failed: Couldn't export operator foo
When that happens, you’ll need to either change the model to not use that operator, or add support for the operator.
Adding support for operators requires contributing a change to PyTorch’s source code. See CONTRIBUTING for general instructions on that, and below for specific instructions on the code changes required for supporting an operator.
During export, each node in the TorchScript graph is visited in topological order.
Upon visiting a node, the exporter tries to find a registered symbolic functions for
that node. Symbolic functions are implemented in Python. A symbolic function for
an op named foo
would look something like:
def foo(
g: torch._C.Graph,
input_0: torch._C.Value,
input_1: torch._C.Value) -> Union[None, torch._C.Value, List[torch._C.Value]]:
"""
Modifies g (e.g., using "g.op()"), adding the ONNX operations representing
this PyTorch function.
Args:
g (Graph): graph to write the ONNX representation into.
input_0 (Value): value representing the variables which contain
the first input for this operator.
input_1 (Value): value representing the variables which contain
the second input for this operator.
Returns:
A Value or List of Values specifying the ONNX nodes that compute something
equivalent to the original PyTorch operator with the given inputs.
Returns None if it cannot be converted to ONNX.
"""
...
The torch._C
types are Python wrappers around the types defined in C++ in
ir.h.
The process for adding a symbolic function depends on the type of operator.
ATen operators¶
ATen is PyTorch’s built-in tensor library.
If the operator is an ATen operator (shows up in the TorchScript graph with the prefix
aten::
):
Define the symbolic function in
torch/onnx/symbolic_opset<version>.py
, for example torch/onnx/symbolic_opset9.py. Make sure the function has the same name as the ATen function, which may declared intorch/_C/_VariableFunctions.pyi
ortorch/nn/functional.pyi
(these files are generated at build time, so will not appear in your checkout until you build PyTorch).The first arg is always the ONNX graph that is being built for export. Other arg names must EXACTLY match the names in
_VariableFunctions.pyi
, because dispatch is done with keyword arguments.In the symbolic function, if the operator is in the ONNX standard operator set, we only need to create a node to represent the ONNX operator in the graph. If not, we can create a graph of several standard operators that have equivalent semantics to the ATen operator.
If an input argument is a Tensor, but ONNX asks for a scalar, we have to explicitly do the conversion.
symbolic_helper._scalar()
can convert a scalar tensor into a python scalar, andsymbolic_helper._if_scalar_type_as()
can turn a Python scalar into a PyTorch tensor.
Here is an example of handling missing symbolic function for the ELU
operator.
If we run the following code:
print(
torch.jit.trace(torch.nn.ELU(), # module
torch.ones(1) # example input
).graph)
We see something like:
graph(%self : __torch__.torch.nn.modules.activation.___torch_mangle_0.ELU,
%input : Float(1, strides=[1], requires_grad=0, device=cpu)):
%4 : float = prim::Constant[value=1.]()
%5 : int = prim::Constant[value=1]()
%6 : int = prim::Constant[value=1]()
%7 : Float(1, strides=[1], requires_grad=0, device=cpu) = aten::elu(%input, %4, %5, %6)
return (%7)
Since we see aten::elu
in the graph, we know this is an ATen operator.
We check the ONNX operator list,
and confirm that Elu
is standardized in ONNX.
We find a signature for elu
in torch/nn/functional.pyi
:
def elu(input: Tensor, alpha: float = ..., inplace: bool = ...) -> Tensor: ...
We add the following lines to symbolic_opset9.py
:
def elu(g, input, alpha, inplace=False):
return g.op("Elu", input, alpha_f=_scalar(alpha))
Now PyTorch is able to export models containing the aten::elu
operator!
See the symbolic_opset*.py
files for more examples.
torch.autograd.Functions¶
If the operator is defined in a sub-class of torch.autograd.Function
,
there are two ways to export it.
Static Symbolic Method¶
You can add a static method named symbolic
to your function class. It should return
ONNX operators that represent the function’s behavior in ONNX. For example:
class MyRelu(torch.autograd.Function):
@staticmethod
def forward(ctx, input: torch.Tensor) -> torch.Tensor:
ctx.save_for_backward(input)
return input.clamp(min=0)
@staticmethod
def symbolic(g: torch._C.graph, input: torch._C.Value) -> torch._C.Value:
return g.op("Clip", input, g.op("Constant", value_t=torch.tensor(0, dtype=torch.float)))
PythonOp Symbolic¶
Alternatively, you can register a custom symbolic function.
This gives the symoblic function access to more info through the
TorchScript Node
object for the original operation, which gets passed in as the second
argument (after the Graph
object).
All autograd Function``s are emitted in the TorchScript graph as ``prim::PythonOp
nodes.
In order to differentiate between different Function
subclasses, the
symbolic function should use the name
kwarg which gets set to the name of the class.
register_custom_op_symbolic()
does does not allow registration for ops in
the prim
namespace, so for this use case, there’s a back door: register the
symbolic for "::prim_PythonOp"
.
The example below shows how you can access requires_grad
via the Node
object:
class MyClip(torch.autograd.Function):
@staticmethod
def forward(ctx, input, min):
ctx.save_for_backward(input)
return input.clamp(min=min)
class MyRelu(torch.autograd.Function):
@staticmethod
def forward(ctx, input):
ctx.save_for_backward(input)
return input.clamp(min=0)
def symbolic_pythonop(g: torch._C.Graph, n: torch._C.Node, *args, **kwargs):
print("original node: ", n)
for i, out in enumerate(n.outputs()):
print("original output {}: {}, requires grad: {}".format(i, out, out.requiresGrad()))
import torch.onnx.symbolic_helper as sym_helper
for i, arg in enumerate(args):
requires_grad = arg.requiresGrad() if sym_helper._is_value(arg) else False
print("arg {}: {}, requires grad: {}".format(i, arg, requires_grad))
name = kwargs["name"]
if name == "MyClip":
return g.op("Clip", args[0], min_f=args[1])
elif name == "MyRelu":
return g.op("Relu", args[0])
else:
# Logs a warning and returns None
return _unimplemented("prim::PythonOp", "unknown node kind: " + name)
from torch.onnx import register_custom_op_symbolic
register_custom_op_symbolic("::prim_PythonOp", symbolic_pythonop, 1)
Custom operators¶
If a model uses a custom operator implemented in C++ as described in Extending TorchScript with Custom C++ Operators, you can export it by following this example:
from torch.onnx import register_custom_op_symbolic
from torch.onnx.symbolic_helper import parse_args
# Define custom symbolic function
@parse_args("v", "v", "f", "i")
def symbolic_foo_forward(g, input1, input2, attr1, attr2):
return g.op("custom_domain::Foo", input1, input2, attr1_f=attr1, attr2_i=attr2)
# Register custom symbolic function
register_custom_op_symbolic("custom_ops::foo_forward", symbolic_foo_forward, 9)
class FooModel(torch.nn.Module):
def __init__(self, attr1, attr2):
super(FooModule, self).__init__()
self.attr1 = attr1
self.attr2 = attr2
def forward(self, input1, input2):
# Calling custom op
return torch.ops.custom_ops.foo_forward(input1, input2, self.attr1, self.attr2)
model = FooModel(attr1, attr2)
torch.onnx.export(
model,
(example_input1, example_input1),
"model.onnx",
# only needed if you want to specify an opset version > 1.
custom_opsets={"custom_domain": 2})
You can export it as one or a combination of standard ONNX ops, or as a custom operator.
The example above exports it as a custom operator in the “custom_domain” opset.
When exporting a custom operator, you can specify the custom domain version using the
custom_opsets
dictionary at export. If not specified, the custom opset version defaults to 1.
The runtime that conumes the model needs to support the custom op. See
Caffe2 custom ops,
ONNX Runtime custom ops,
or your runtime of choice’s documentation.
Frequently Asked Questions¶
Q: I have exported my LSTM model, but its input size seems to be fixed?
The tracer records the shapes of the example inputs. If the model should accept inputs of dynamic shapes, set
dynamic_axes
when callingtorch.onnx.export()
.
Q: How to export models containing loops?
See Tracing vs Scripting.
Q: How to export models with primitive type inputs (e.g. int, float)?
Support for primitive numeric type inputs was added in PyTorch 1.9. However, the exporter does not support models with str inputs.
Q: Does ONNX support implicit scalar datatype casting?
No, but the exporter will try to handle that part. Scalars are exported as constant tensors. The exporter will try to figure out the right datatype for scalars. However when it is unable to do so, you will need to manually specify the datatype. This often happens with scripted models, where the datatypes are not recorded. For example:
class ImplicitCastType(torch.jit.ScriptModule): @torch.jit.script_method def forward(self, x): # Exporter knows x is float32, will export "2" as float32 as well. y = x + 2 # Currently the exporter doesn't know the datatype of y, so # "3" is exported as int64, which is wrong! return y + 3 # To fix, replace the line above with: # return y + torch.tensor([3], dtype=torch.float32) x = torch.tensor([1.0], dtype=torch.float32) torch.onnx.export(ImplicitCastType(), x, "implicit_cast.onnx", example_outputs=ImplicitCastType()(x))We are trying to improve the datatype propagation in the exporter such that implicit casting is supported in more cases.
Q: Are lists of Tensors exportable to ONNX?
Yes, for
opset_version
>= 11, since ONNX introduced the Sequence type in opset 11.
Functions¶
-
torch.onnx.
export
(model, args, f, export_params=True, verbose=False, training=<TrainingMode.EVAL: 0>, input_names=None, output_names=None, operator_export_type=None, opset_version=None, _retain_param_name=True, do_constant_folding=True, example_outputs=None, strip_doc_string=True, dynamic_axes=None, keep_initializers_as_inputs=None, custom_opsets=None, enable_onnx_checker=True, use_external_data_format=False)[source]¶ Exports a model into ONNX format. If
model
is not atorch.jit.ScriptModule
nor atorch.jit.ScriptFunction
, this runsmodel
once in order to convert it to a TorchScript graph to be exported (the equivalent oftorch.jit.trace()
). Thus this has the same limited support for dynamic control flow astorch.jit.trace()
.- Parameters
model (torch.nn.Module, torch.jit.ScriptModule or torch.jit.ScriptFunction) – the model to be exported.
args (tuple or torch.Tensor) –
args can be structured either as:
ONLY A TUPLE OF ARGUMENTS:
args = (x, y, z)
The tuple should contain model inputs such that
model(*args)
is a valid invocation of the model. Any non-Tensor arguments will be hard-coded into the exported model; any Tensor arguments will become inputs of the exported model, in the order they occur in the tuple.A TENSOR:
args = torch.Tensor([1])
This is equivalent to a 1-ary tuple of that Tensor.
A TUPLE OF ARGUMENTS ENDING WITH A DICTIONARY OF NAMED ARGUMENTS:
args = (x, {'y': input_y, 'z': input_z})
All but the last element of the tuple will be passed as non-keyword arguments, and named arguments will be set from the last element. If a named argument is not present in the dictionary, it is assigned the default value, or None if a default value is not provided.
Note
If a dictionary is the last element of the args tuple, it will be interpreted as containing named arguments. In order to pass a dict as the last non-keyword arg, provide an empty dict as the last element of the args tuple. For example, instead of:
torch.onnx.export( model, (x, # WRONG: will be interpreted as named arguments {y: z}), "test.onnx.pb")
Write:
torch.onnx.export( model, (x, {y: z}, {}), "test.onnx.pb")
f – a file-like object (such that
f.fileno()
returns a file descriptor) or a string containing a file name. A binary protocol buffer will be written to this file.export_params (bool, default True) – if True, all parameters will be exported. Set this to False if you want to export an untrained model. In this case, the exported model will first take all of its parameters as arguments, with the ordering as specified by
model.state_dict().values()
verbose (bool, default False) – if True, prints a description of the model being exported to stdout.
training (enum, default TrainingMode.EVAL) –
TrainingMode.EVAL
: export the model in inference mode.TrainingMode.PRESERVE
: export the model in inference mode if model.training is False and in training mode if model.training is True.TrainingMode.TRAINING
: export the model in training mode. Disables optimizations which might interfere with training.
input_names (list of str, default empty list) – names to assign to the input nodes of the graph, in order.
output_names (list of str, default empty list) – names to assign to the output nodes of the graph, in order.
operator_export_type (enum, default None) –
None usually means
OperatorExportTypes.ONNX
. However if PyTorch was built with-DPYTORCH_ONNX_CAFFE2_BUNDLE
, None meansOperatorExportTypes.ONNX_ATEN_FALLBACK
.OperatorExportTypes.ONNX
: Export all ops as regular ONNX ops (in the default opset domain).OperatorExportTypes.ONNX_FALLTHROUGH
: Try to convert all ops to standard ONNX ops in the default opset domain. If unable to do so (e.g. because support has not been added to convert a particular torch op to ONNX), fall back to exporting the op into a custom opset domain without conversion. Applies to custom ops as well as ATen ops. For the exported model to be usable, the runtime must support these non-standard ops.OperatorExportTypes.ONNX_ATEN
: All ATen ops (in the TorchScript namespace “aten”) are exported as ATen ops (in opset domain “org.pytorch.aten”). ATen is PyTorch’s built-in tensor library, so this instructs the runtime to use PyTorch’s implementation of these ops.Warning
Models exported this way are probably runnable only by Caffe2.
This may be useful if the numeric differences in implementations of operators are causing large differences in behavior between PyTorch and Caffe2 (which is more common on untrained models).
OperatorExportTypes.ONNX_ATEN_FALLBACK
: Try to export each ATen op (in the TorchScript namespace “aten”) as a regular ONNX op. If we are unable to do so (e.g. because support has not been added to convert a particular torch op to ONNX), fall back to exporting an ATen op. See documentation on OperatorExportTypes.ONNX_ATEN for context. For example:graph(%0 : Float): %3 : int = prim::Constant[value=0]() # conversion unsupported %4 : Float = aten::triu(%0, %3) # conversion supported %5 : Float = aten::mul(%4, %0) return (%5)
Assuming
aten::triu
is not supported in ONNX, this will be exported as:graph(%0 : Float): %1 : Long() = onnx::Constant[value={0}]() # not converted %2 : Float = aten::ATen[operator="triu"](%0, %1) # converted %3 : Float = onnx::Mul(%2, %0) return (%3)
If an op is in the TorchScript namespace “quantized”, it will be exported in the ONNX opset domain “caffe2”. These ops are produced by the modules described in Quantization.
Warning
Models exported this way are probably runnable only by Caffe2.
opset_version (int, default 9) – Must be
== _onnx_main_opset or in _onnx_stable_opsets
, defined in torch/onnx/symbolic_helper.py.do_constant_folding (bool, default False) – Apply the constant-folding optimization. Constant-folding will replace some of the ops that have all constant inputs with pre-computed constant nodes.
example_outputs (T or a tuple of T, where T is Tensor or convertible to Tensor, default None) – Must be provided when exporting a ScriptModule or ScriptFunction, ignored otherwise. Used to determine the type and shape of the outputs without tracing the execution of the model. A single object is treated as equivalent to a tuple of one element.
strip_doc_string (bool, default True) – do not include the field
doc_string`
from the exported model. Otherwise the field will mention the source code locations formodel
.dynamic_axes (dict<string, dict<python:int, string>> or dict<string, list(int)>, default empty dict) –
By default the exported model will have the shapes of all input and output tensors set to exactly match those given in
args
(andexample_outputs
when that arg is required). To specify axes of tensors as dynamic (i.e. known only at run-time), setdynamic_axes
to a dict with schema:KEY (str): an input or output name. Each name must also be provided in
input_names
oroutput_names
.VALUE (dict or list): If a dict, keys are axis indices and values are axis names. If a list, each element is an axis index.
For example:
class SumModule(torch.nn.Module): def forward(self, x): return torch.sum(x, dim=1) torch.onnx.export(SumModule(), (torch.ones(2, 2),), "onnx.pb", input_names=["x"], output_names=["sum"])
Produces:
input { name: "x" ... shape { dim { dim_value: 2 # axis 0 } dim { dim_value: 2 # axis 1 ... output { name: "sum" ... shape { dim { dim_value: 2 # axis 0 ...
While:
torch.onnx.export(SumModule(), (torch.ones(2, 2),), "onnx.pb", input_names=["x"], output_names=["sum"], dynamic_axes={ # dict value: manually named axes "x": {0: "my_custom_axis_name"}, # list value: automatic names "sum": [0], })
Produces:
input { name: "x" ... shape { dim { dim_param: "my_custom_axis_name" # axis 0 } dim { dim_value: 2 # axis 1 ... output { name: "sum" ... shape { dim { dim_param: "sum_dynamic_axes_1" # axis 0 ...
keep_initializers_as_inputs (bool, default None) –
If True, all the initializers (typically corresponding to parameters) in the exported graph will also be added as inputs to the graph. If False, then initializers are not added as inputs to the graph, and only the non-parameter inputs are added as inputs. This may allow for better optimizations (e.g. constant folding) by backends/runtimes.
If
opset_version
< 9, initializers MUST be part of graph inputs and this argument will be ignored and the behavior will be equivalent to setting this argument to True.If None, then the behavior is chosen automatically as follows:
If
operator_export_type=OperatorExportTypes.ONNX
, the behavior is equivalent to setting this argument to False.Else, the behavior is equivalent to setting this argument to True.
custom_opsets (dict<str, int>, default empty dict) –
A dictionary to indicate
A dict with schema:
KEY (str): opset domain name
VALUE (int): opset version
If a custom opset is referenced by
model
but not mentioned in this dictionary, the opset version is set to 1.enable_onnx_checker (bool, default True) – If True the onnx model checker will be run to ensure the exported model is a valid ONNX model.
use_external_data_format (bool, default False) – If True, then some of the model parameters are stored in external data files and not in the ONNX model file itself. Models larger than 2GB cannot be exported in one file because of size limits imposed by Protocol Buffers. For details see onnx.proto. If True, argument
f
must be a string specifying the location of the model. The external data files will be stored in the same directory asf
. This argument is ignored unlessoperator_export_type=OperatorExportTypes.ONNX
.
-
torch.onnx.
export_to_pretty_string
(*args, **kwargs)[source]¶ Same as
export()
, but returns a text representation of the exported model.
-
torch.onnx.
register_custom_op_symbolic
(symbolic_name, symbolic_fn, opset_version)[source]¶ Registers
symbolic_fn
to handlesymbolic_name
. See “Custom Operators” in the module documentation for an example usage.- Parameters
symbolic_name (str) – The name of the custom operator in “<domain>::<op>” format.
symbolic_fn (Callable) – A function that takes in the ONNX graph and the input arguments to the current operator, and returns new operator nodes to add to the graph.
opset_version (int) – The ONNX opset version in which to register.