Page not organized well and under development, but here are the highlights…
Support for binary data in the Universal Binary JSON specification was in discussion for 2 years before it was finalized. Many, many different approaches were considered and discarded all in the name of maintaining compatibility with JSON while keeping an eye on performance.
The result is a surprisingly simple and binary-efficient construct that is also easily translated to JSON and back to UBJSON again with the help of a good library, namely: a strongly-typed array of uint8 values.
Compatibility with JSON
Representing binary data efficiently in Universal Binary JSON while still maintaining compatibility with JSON is deceptively simple: leverage a strongly-typed array of uint8 values — essentially a list of integers.
There is no explicit binary type, but instead the ability to represent binary inside of Universal Binary JSON in a very optimized and JSON-compatible construct.
The #1 goal of Universal Binary JSON is compatibility with JSON. Compatibility is defined as:
if A.ubjson -> translated to -> B.json && B.json -> translated to -> C.ubjson then A.ubjson == C.ubjson
All of the Universal Binary JSON value and container types are 1:1 compatible with JSON. The only semantically (but not structurally) incompatible construct in UBJSON is strongly-typed containers in that once the container is converted to JSON the typing of the container is lost. Converting the container back to UBJSON and re-enabling the strong-typing does require assistance from the encoding library.
Since JSON has no direct support for binary data or this style of strongly-typed container, the translation to JSON converts the strongly-typed array to an array of simple JSON types – in the case of binary data, it would be an array of number values (In the example above this is the translation step from A.ubjson to B.json).
Going from JSON back to UBJSON (B.json -> C.ubjson) has the potential for losing the strongly-typed container information and has to be handled with care to re-enable the optimized representation of that information back in the UBJSON format.
Library Implementation Recommendation
The library implementors are encouraged to provide this functionality in the form of two optional settings that can be turned on during generation:
- [x] Automatically use strongly typed containers when possible
- [x] Force use of strongly typed containers based on first element type
The idea being that the library can either make an automated attempt at reconstructing the strongly typed containers OR if you have a lot of knowledge of your data, you can force the library to reconstitute what looks to be a strongly typed container based on the fist element type.
Still under development…
Something to be aware of when converting UBJSON containing a large amount of binary data is that each strongly-typed container of uint8 values will convert to a JSON array of number values, because this translation also introduces a ‘,’ character between every value in the array, this effectively doubles the size of the binary data.