Interpreting the TOON vs JSON Benchmark Results #195

jmfloreszazo · 2025-11-19T09:21:58Z

jmfloreszazo
Nov 19, 2025

TOON Format: Benchmark & Architecture — what I found when comparing TOON vs JSON (and why the 25% saving actually matters)

Full article: https://medium.com/@jmfloreszazo

Benchmark repo (.NET / C#): https://github.com/jmfloreszazo/dotnet_llm_toon_format_demo

After running real benchmarks with full invoice datasets and reasoning models like o1, I found that TOON doesn’t deliver the 30–60% savings often claimed — but it does deliver a consistent ~25% reduction in input tokens compared to compact JSON, without losing accuracy or increasing latency.

It’s not magic, but it’s real architecture: in MCP pipelines and multi-agent systems, that 25% compounds across thousands of tool-calls and becomes meaningful FinOps impact.

What do you think?

Thanks for your work!

johannschopplich · 2025-11-20T12:45:14Z

johannschopplich
Nov 20, 2025
Maintainer

Hey @jmfloreszazo,

I appreciate the thourough analysis! Is there anything we can include in the TOON docs based on your analysis?

0 replies

jmfloreszazo · 2025-12-10T17:06:30Z

jmfloreszazo
Dec 10, 2025
Author

Thanks a lot for the reply! @johannschopplich

Yes — the main thing I think would be valuable to include in the docs is about setting realistic expectations regarding token savings.

Based on the benchmarks I ran (full invoice datasets, compact/minified JSON, and reasoning models like o1), TOON consistently delivers around ~25% input-token reduction versus already-minified JSON.
The higher ranges (30–60%) only appear when comparing against verbose / pretty-printed / less-optimized formats, which are becoming less common in production systems.

So the “~25% vs minified JSON” figure would help align expectations, especially for teams integrating TOON into MCP pipelines or multi-agent architectures, where that 25% compounds across thousands of calls.

Happy to contribute more details if helpful — and thanks again for your work!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interpreting the TOON vs JSON Benchmark Results #195

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Interpreting the TOON vs JSON Benchmark Results #195

Uh oh!

jmfloreszazo Nov 19, 2025

Replies: 2 comments

Uh oh!

johannschopplich Nov 20, 2025 Maintainer

Uh oh!

jmfloreszazo Dec 10, 2025 Author

jmfloreszazo
Nov 19, 2025

johannschopplich
Nov 20, 2025
Maintainer

jmfloreszazo
Dec 10, 2025
Author