flatbuffer_direct Migration Guide

Goal

Operate with the default flatbuffer_direct backend while preserving production stability and diagnosability, and keep tf_converter available only as an explicit compatibility path when needed.

Backend differences (quick view)

Item	`tf_converter`	`flatbuffer_direct`
Default	No (explicit fallback)	Yes
Final generation path	TensorFlow Lite Converter	Direct FlatBuffer builder
Optimization behavior	TF-path accumulated rewrites/heuristics	Direct preprocess + strict dispatch constraints
Failure model	Many patterns absorbed by TF conversion	Explicit failure with `reason_code`
Custom op path	Implicitly minimized by TF path	Explicit opt-in + allowlist
Fallback	N/A	N/A (no fallback)

Recommended rollout

Keep the default CI lane on flatbuffer_direct and enable --report_op_coverage there.
Add an explicit compatibility lane with --tflite_backend tf_converter if you still need to monitor legacy behavior.
Resolve direct-path failures by reason_code and adjust model/export options.
Only after stable float32/float16 conversion, enable quantization and split evaluation.

Stage-by-stage commands

Stage 0: Baseline direct export + diagnostics

python -m onnx2tf.onnx2tf \
  -i model.onnx \
  -o out \
  --report_op_coverage

Stage 1: Quantization + ONNX-based accuracy check

python -m onnx2tf.onnx2tf \
  -i model.onnx \
  -o out \
  -odrqt -oiqt \
  --eval_with_onnx \
  --eval_target_tflite full_integer_quant \
  --eval_compare_mode dequant \
  --report_op_coverage

Stage 2: Split generation + split accuracy check

python -m onnx2tf.onnx2tf \
  -i model.onnx \
  -o out \
  --auto_split_tflite_by_size \
  --tflite_split_target_bytes 1060000000 \
  --tflite_split_max_bytes 1073741824 \
  --eval_split_models \
  --report_op_coverage

Stage 3: Production strict-fail operation

python -m onnx2tf.onnx2tf \
  -i model.onnx \
  -o out

When direct export fails, conversion stops with an explicit error. Use tf_converter explicitly if the legacy TensorFlow Lite Converter path is still required operationally.

Preprocess scope in direct path

flatbuffer_direct applies staged preprocess rules before lowering:

pattern_fusion_wave2
- ReLU/Clip chain normalization
- GELU chain fusion
- SpaceToDepth chain fusion
pseudo_ops_wave1
- HardSwish / LeakyRelu / PRelu / Gelu / limited Pow rewrites
constant_fold_a5
- Limited constant folding for shape/axes and arithmetic helper chains
normalize_attrs_a5
- perm/axes normalization and softmax-axis bridge insertion

Use preprocess_report.applied_rules in *_op_coverage_report.json to inspect actual rewrites.

Custom OP policy

Use custom-op lowering only when builtin mapping is not feasible.

python -m onnx2tf.onnx2tf \
  -i model.onnx \
  -o out \
  --tflite_backend flatbuffer_direct \
  --flatbuffer_direct_allow_custom_ops \
  --flatbuffer_direct_custom_op_allowlist Einsum,TopK \
  --report_op_coverage

Behavior:

Without custom-op enablement, custom candidates fail with reason_code=custom_op_candidate_disabled.
If allowlist is specified and op is missing, conversion fails with reason_code=custom_op_not_in_allowlist.

Known limitations and mitigation

Symptom (`reason_code`)	Cause	Mitigation
`unsupported_onnx_op`	No direct builtin/custom path	Use `tf_converter` or model rewrite
`requires_constant_input`	Dynamic axes/perm/shape where constants are required	Pre-fold graph (`onnxsim`) or rewrite to constants
`unsupported_attribute_value`	Direct constraints unmet (axis/rank/mode)	Adjust exporter flags or rewrite subgraph
`custom_op_candidate_disabled`	Custom candidate encountered while custom mode disabled	Enable custom ops only if runtime supports them
`custom_op_not_in_allowlist`	Candidate op not in allowlist	Add to allowlist explicitly

Report files

Accuracy report: *_accuracy_report.json
Split plan: *_split_plan.json
Split manifest: *_split_manifest.json
Split accuracy: *_split_accuracy_report.json
OP coverage: *_op_coverage_report.json

Operational checklist

Keep the default flatbuffer_direct lane green at all times.
Keep an explicit tf_converter lane only if you still rely on that compatibility path.
Gate flatbuffer_direct rollout by model family (small -> medium -> large).
Require --report_op_coverage in CI for the direct lane.
Review unsupported_reason_counts and custom_op_policy for every failure.
Avoid custom-op expansion unless runtime/serving side is ready.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

flatbuffer_direct Migration Guide

Goal

Backend differences (quick view)

Recommended rollout

Stage-by-stage commands

Stage 0: Baseline direct export + diagnostics

Stage 1: Quantization + ONNX-based accuracy check

Stage 2: Split generation + split accuracy check

Stage 3: Production strict-fail operation

Preprocess scope in direct path

Custom OP policy

Known limitations and mitigation

Report files

Operational checklist

FilesExpand file tree

FLATBUFFER_DIRECT_MIGRATION_GUIDE.md

Latest commit

History

FLATBUFFER_DIRECT_MIGRATION_GUIDE.md

File metadata and controls

flatbuffer_direct Migration Guide

Goal

Backend differences (quick view)

Recommended rollout

Stage-by-stage commands

Stage 0: Baseline direct export + diagnostics

Stage 1: Quantization + ONNX-based accuracy check

Stage 2: Split generation + split accuracy check

Stage 3: Production strict-fail operation

Preprocess scope in direct path

Custom OP policy

Known limitations and mitigation

Report files

Operational checklist