Skip to content

Favorite data-efficient format for tabular data #2483

Answered by daviewales
cool-RR asked this question in Q&A
Discussion options

You must be logged in to vote

You might try jsonla, which is an alternative version of jsonl which stores each row as a JSON list, rather than a JSON object.
This means you specify the column headers once on the first row (as JSON list), then only specify the data on subsequent rows.
So it's more compact than JSONL (closer to CSV), but retains the stronger structure of JSONL.

See this comment:
#1726 (comment)

And the PR where I added support for file format jsonla (Short for JSON Lines Arrays):
#1730

See the JSONL documentation:
https://jsonlines.org/examples/

There is also usv, which is basically CSV, but with unicode separators for columns and rows, which means you can freely include commas, newlines, tabs, quotes, …

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@cool-RR
Comment options

@cool-RR
Comment options

@saulpw
Comment options

@daviewales
Comment options

Answer selected by cool-RR
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants