Skip to content

Latest commit

 

History

History
93 lines (58 loc) · 3 KB

File metadata and controls

93 lines (58 loc) · 3 KB
title dataset
abstract The Dataset Project provides tools for working with collections of JSON documents. It uses a simple key and object pair to organize JSON documents into a collection. It supports SQL querying of the objects stored in a collection. It was designed for temporary storage of JSON objects in data processing pipelines. It can be used as persistent storage mechanism for collections of JSON objects you wish to distribute when used in conjunction with pairtree or SQLite3 storage. The Dataset Project provides command line programs and a web service for working with JSON objects as a collection or individual objects. As such it is well suited for data science and web base applications.
authors
family_name given_name id
Doiel
R. S.
family_name given_name id
Morrell
Thomas E
maintainer
family_name given_name id
Doiel
R. S.
family_name given_name id
Morrell
Thomas E
repository_code https://github.com/caltechlibrary/dataset
version 2.5.1
license_url https://caltechlibrary.github.io/dataset/LICENSE
programming_language
Go
SQL
keywords
metadata
data storage
JSON
date_released 2026-06-03

About this software

dataset 2.5.1

  • Split SQLite driver by build target: normal builds use glebarez/go-sqlite (pure-Go, no WASM runtime overhead, eliminates spurious stderr warnings); WASM/wasip1 builds use ncruces/go-sqlite3 with the SQLite binary embedded
  • Fixed nil pointer dereference in dsquery when querying a pairtree collection with --pt-index

Authors

Maintainers

The Dataset Project provides tools for working with collections of JSON documents. It uses a simple key and object pair to organize JSON documents into a collection. It supports SQL querying of the objects stored in a collection.

It was designed for temporary storage of JSON objects in data processing pipelines. It can be used as persistent storage mechanism for collections of JSON objects you wish to distribute when used in conjunction with pairtree or SQLite3 storage.

The Dataset Project provides command line programs and a web service for working with JSON objects as a collection or individual objects. As such it is well suited for data science and web base applications.

Programming languages

  • Go
  • SQL

Software Requirements

  • Golang >= 1.26
  • CMTools >= 0.0.43

Software Suggestions

  • Pandoc >= 3.9
  • GNU Make >= 3.8