JSON Parser Benchmarking
With the caveat that each parser provides different functionality and access to the resulting parse tree, I benchmarked sajson, rapidjson, vjson, YAJL, and Jansson. My methodology was simple: given large-ish real-world JSON files, parse them as many times as possible in one second. Â To include the cost of reading the parse tree in the benchmark, I then iterated over the entire document and summed the number of strings, numbers, object values, array elements, etc.
The documents are:
- apache_builds.json: Data from the Apache Jenkins installation. Mostly it's a array of three-element objects, with string keys and string values.
- github_events.json: JSON data from GitHub's events feature. Nested objects several levels deep, mostly strings, but also contains booleans, nulls, and numbers.
- instruments.json: A very long array of many-key objects.
- mesh.json: 3D mesh data. Almost entirely consists of floating point numbers.
- update-center.json: Also from Jenkins though I'm not sure where I found it.
apache_builds.json, github_events.json, and instruments.json are pretty-printed with a great deal of interelement whitespace.
Now for the results. The Y axis is parses per second. Thus, higher is faster.
Core 2 Duo E6850, Windows 7, Visual Studio 2010, x86
data:image/s3,"s3://crabby-images/7eaea/7eaea14e9c8cd4ef6113e970b9e4ce88d9019e35" alt="core2duo_32_apache_builds"
data:image/s3,"s3://crabby-images/b06a7/b06a7a4ff392c48bd886e4ce5079bc5a5e76e91c" alt="core2duo_32_github_events"
data:image/s3,"s3://crabby-images/dcaca/dcaca634ab4dea1e8117e6a9dbd9a4702d2218f6" alt="core2duo_32_instruments"
data:image/s3,"s3://crabby-images/551a5/551a5a3dce8f169cd183426e8eed0c8522a20672" alt="core2duo_32_mesh"
data:image/s3,"s3://crabby-images/bdc34/bdc343b7b8ac67956c38941b19a2c314ccc9022b" alt="core2duo_32_update-center"
Core 2 Duo E6850, Windows 7, Visual Studio 2010, AMD64
data:image/s3,"s3://crabby-images/3b057/3b0578ac2da69a49bcdd9ac32f7013107f712dfd" alt="core2duo_64_apache_builds"
data:image/s3,"s3://crabby-images/a1704/a170429cfff4ef2f62fa4fae81246bcb85b6018f" alt="core2duo_64_github_events"
data:image/s3,"s3://crabby-images/22e86/22e86ce995f7cb1f469326dc4dfc1113f7a14a9f" alt="core2duo_64_instruments"
data:image/s3,"s3://crabby-images/1bdcc/1bdcc29cd7013c00ade02845e0a74803cecb5bf4" alt="core2duo_64_mesh"
data:image/s3,"s3://crabby-images/ff6c9/ff6c9ff57e0506202eb462d35c93f5880b2584a5" alt="core2duo_64_update-center"
Atom D2700, Ubuntu 12.04, clang 3.0, AMD64
data:image/s3,"s3://crabby-images/03f3a/03f3adb95b9b5fe2f54a160f1f2e0be8667c2004" alt="atom_apache_builds"
data:image/s3,"s3://crabby-images/28cd9/28cd96808696ea39eaefe9ec783a11f40cf9476d" alt="atom_github_events"
data:image/s3,"s3://crabby-images/f2e54/f2e54181b38c844eab03a25327373106450574ce" alt="atom_instruments"
data:image/s3,"s3://crabby-images/82560/825607d43a16d27b90d8ca32378607f6b112ae26" alt="atom_mesh"
data:image/s3,"s3://crabby-images/30c7d/30c7d90ab22dc73fb164eb433200418e0fceefb0" alt="atom_update-center"
Raspberry Pi
data:image/s3,"s3://crabby-images/cdeac/cdeac2612ebbcdd629fce6d803d1a945428b1319" alt="raspi_apache_builds"
data:image/s3,"s3://crabby-images/7ace2/7ace21a1cb29a9b36fd87d1f52c223583cf0e626" alt="raspi_github_events"
data:image/s3,"s3://crabby-images/5c648/5c648753ce57713ecd02d07f7704e34f582a57cf" alt="raspi_instruments"
data:image/s3,"s3://crabby-images/8a006/8a00643ef38e3e4b5536541490f958f2c6678963" alt="raspi_mesh"
data:image/s3,"s3://crabby-images/905d2/905d20ac44a6475edbecfcfa26cecc5f6e7e11f0" alt="raspi_update-center"
Conclusions
sajson compares favorably to rapidjson and vjson, all of which stomp the C-based YAJL and Jansson parsers. 64-bit builds are noticeably faster: presumably because the additional registers are helpful. Raspberry Pis are slow. :)
Interesting! Would you mind sharing your sample apps? I wrote a json parser (https://github.com/mfascia/minja) and I would be interested in comparing its speed with these "reference" parsers. Thank you!
I was looking at a resource-constrained environment ( embedded MCU ) parser, found rapidjson and sajson and vjson, but i still dont like allocations at all.
so i came across jsmn ( http://zserge.bitbucket.org/jsmn.html ) and ultaminimal js0n ( https://github.com/quartzjer/js0n ). I'm not particularly interested in speed but robustness and stability, but for kicks if you still have your benchmark setup code around i would rerun with these included - and im mostly interested in running on modern ARM-CortexM's clocked at less than 100mhz : )
Hi kert,
I did consider js0n but it wasn't what I was looking for in that the generated "parse tree" isn't complete. sajson is a similar concept (single-pass, fills one array) but the resulting parse tree is actually useful.
I was not aware of jsmn at the time, thanks! If I update the benchmarks I'll add jsmn: it looks pretty reasonable.
Thanks, Chad
If you did an update, please include jvar (referenced on json.org)--a new C++ parser which uses JS style programming model in C++ and is quite fast.