Frictionless Standards

At the core of Frictionless is a set of patterns for describing data including Table Schema (for tables), Data Resource (for files), and Data Package (for datasets).

This site houses the formal specifications of these patterns.

For more info about the project as a whole, please visit frictionlessdata.io

Overview

What's a Data Package?

A Data Package is a simple container format used to describe and package a collection of data (a dataset).

A Data Package can contain any kind of data. At the same time, Data Packages can be specialized and enriched for specific types of data so there are, for example, Tabular Data Packages for tabular data, Geo Data Packages for geo data etc.

Data Package Specs Suite

When you look more closely you'll see that Data Package is actually a suite of specifications. This suite is made of small specs, many of them usuable on their own, that you can also combine together.

This approach also reflects our philosophy of "small pieces, loosely joined" as well as "make the simple things simple and complex things possible": it easy to just use the piece you need as well to scale up to more complex needs.

For example, for tabular data we can create a Tabular Data Package spec by combining three other specs together: the Data Package spec for the dataset, the Table Schema spec to describe the table structure, and finally CSV or JSON for the data itself.

We also broke down the Data Package spec into Data Package itself and Data Resource. The Data Resource spec just describes an individual data files and a Data Package is a collection of one or more Data Resources with additional dataset-level metadata.

Example: Data Resource spec + Table Schema spec becomes a Tabular Data Resource spec

   graph TD
   
   dr[Data Resource] --add table schema--> tdr[Tabular Data Resource]

Example: How a Tabular Data Package is composed out of other specs

graph TD

  dr[Data Resource] --> tdr
  tdr[Tabular Data Resource] --> tdp[Tabular Data Package]
  dp[Data Package] --> tdp
  jts[Table Schema] --> tdr
  csvddf[CSV Data Descriptor] -.optional.-> tdr

  style tdp fill:#f9f,stroke:#333,stroke-width:4px;

Design Philosophy

Simplicity

Seek zen-like simplicity in which there is nothing to add and nothing to take away.

Extensibility

Design for extensibility and customisation. This makes hard things possible and permits future evolution -- nothing we build will be perfect.

Human-editable and machine-usable

Specs should preserve human readability and editability whilst making machine-use easy.

Reuse

Reuse and build on existing standards and formats.

Cross technology

Support a broad range of languages, technologies and infrastructures -- avoid being tied to any one specific system.

Contribute

Contributions, comments and corrections are warmly welcomed. Most work proceeds in an RFC-style manner with discussion in the issue tracker.

Material is kept in a git repo on GitHub - fork and submit a pull request to add material. There is also an issue tracker which can be used for specific issues or suggestions.

For Editors

This repository is the canonical repository for the core Frictionless Data specifications. The repository features:

JSON Schema representations of all specifications. These are used both in the site itself, to generate the specification pages, and likewise in the schema registry that is used by a range of libraries that implement the specifications.

Quick start

Clone the repository
npm install # install the dependencies to build the specifications
npm run build # build the specifications
npm run test # test the specifications
npm start # start the local server

Contribute to the specifications

All the source data for the specifications is in the /schemas directory. In there, you will find a .json file for each specification and a set of YAML files under /schemas/dictionary/*. There is a build.js script to build the specifications.

.json files are JSON Schemas for each spec, normalised using the $ref feature of JSON Schema. This normalisation ensures consistency in the way the specifications are written and validated, but is only used directly by the build.js script, which generated denormalised versions.
/build.js creates denormalised versions of each specification be dereferencing each $ref in the source schemas, and then saves these denormalised versions to /build/schemas directory.
/schemas/dictionary/* has all the property definitions for each specification. This is the place to add new properties or property collections, to edit contextual information and descriptive examples, and so on. See how this information is rendered in the macros template.

Adding a new specification

Yes we welcome and encourage additions to the registry! Any spec that is added must meet the following criteria:

Be related to the Data Packages family of specifications.
Have a publicly-accessible web page describing the specification.
Have a JSON Schema file that describes the specification.

See the existing entries in the registry, and then take the following steps to add a new entry:

Make a new pull request called registry/{NAME_OF_SPECIFICATION}
The pull request features a JSON Schema file for the new specification, and adds the spec to registry.csv
Write a brief description of the spec as part of the pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 1,049 Commits
.github/workflows		.github/workflows
.vuepress		.vuepress
csv-dialect		csv-dialect
data-package-identifier		data-package-identifier
data-package		data-package
data-resource		data-resource
fiscal-data-package--budgets		fiscal-data-package--budgets
fiscal-data-package--spending		fiscal-data-package--spending
fiscal-data-package		fiscal-data-package
guides		guides
patterns		patterns
profiles		profiles
schemas		schemas
security		security
table-schema		table-schema
tabular-data-package		tabular-data-package
tabular-data-resource		tabular-data-resource
tabular-diff		tabular-diff
taxonomies/fiscal		taxonomies/fiscal
test		test
views		views
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
build.js		build.js
contributing.md		contributing.md
package.json		package.json
tailwind.config.js		tailwind.config.js
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Frictionless Standards

Overview

What's a Data Package?

Data Package Specs Suite

Design Philosophy

Simplicity

Extensibility

Human-editable and machine-usable

Reuse

Cross technology

Contribute

For Editors

Quick start

Contribute to the specifications

Adding a new specification

About

Releases

Packages

Languages

License

aborruso/specs

Folders and files

Latest commit

History

Repository files navigation

Frictionless Standards

Overview

What's a Data Package?

Data Package Specs Suite

Design Philosophy

Simplicity

Extensibility

Human-editable and machine-usable

Reuse

Cross technology

Contribute

For Editors

Quick start

Contribute to the specifications

Adding a new specification

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages