When you have data of a yet-undefined data structure, it is useful to manage them inside a CSV file which can be viewed and edited by a CSV editor, such as Excel.

This is extremely useful in developing a normalized structure for such data, as you can ensure that the existing data can be normalized according to a defined structure.

Ultimately, the data is to be meant to exported to a YAML file.

This script supports UTF-8 CSV files.

NOTE: This was originally developed to create over 50 normalized data models for ITU Operational Bulletin data. See https://github.com/ituob/ for more details.

== Installation

Add this line to your application's Gemfile:

[source,ruby]

gem 'structured_csv'

and then run:

[source,sh]

bundle install

Or install it without a Gemfile:

[source,sh]

gem install structured_csv

== Usage

[source,sh]

$ structured_csv_to_yaml [input-file.csv]

Where,

input-file.csv:: is the input CSV file, the output will be named as input-file.yaml.

== Details

A Structured CSV file has these properties:

Two structured sections. A section is defined by the first column on an otherwise empty row that is either the first row or a row preceded by an empty row. Two section types are allowed: METADATA and DATA.

The METADATA section has values organized like key-value pairs:

Column 1 is the name of key
Column 2 is the value

The key can be a normal string or namespaced:

foobar, this maps to the YAML key foobar:
foo.bar.boo, this maps to the YAML structure: +

[source,yaml]

foo: bar: boo:

A typical YAML output is like:

[source,yaml]

metadata: locale: bar: en: beef fr: boeuf jp: 牛肉 data: foo: bar: ...

A sample METADATA section looks like this table:

And generates this YAML:

[source,yaml]

metadata: locale: bar: en: beef fr: boeuf jp: 牛肉

The DATA section has values organized in a table form. The first row is the header row. The first column is assumed to be the key.

A sample DATA section looks like this table:

By default, this table generates this YAML format:

[source,yaml]

data: beef: foo: bar: en: beef fr: boeuf jp: 牛肉 description: Yummy! pork: foo: bar: en: pork fr: porc jp: 豚肉 description: Delicious! ...

In cases where there is no DATA key, you have to specify the type=array to generate an array:

[source,yaml]

data:

foo: bar: en: beef fr: boeuf jp: 牛肉 description: Yummy!
foo: bar: en: pork fr: porc jp: 豚肉 description: Delicious! ...

You are also allowed to specify the data types of columns. The types of string, boolean and integer are supported.

[source,yaml]

data: beef: foo: bar: en: beef fr: boeuf yummy: true availability: 3 pork: foo: bar: en: pork fr: porc yummy: false availability: 10 ...

== Examples

The samples/ folder contains a number of complex examples.

FAQs

What is structured_csv?

Is structured_csv well maintained?

Package last updated on 17 May 2021

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

structured_csv

[source,ruby]

gem 'structured_csv'

[source,sh]

bundle install

[source,sh]

gem install structured_csv

[source,sh]

$ structured_csv_to_yaml [input-file.csv]

[source,yaml]

foo: bar: boo:

[source,yaml]

metadata: locale: bar: en: beef fr: boeuf jp: 牛肉 data: foo: bar: ...

[source,yaml]

metadata: locale: bar: en: beef fr: boeuf jp: 牛肉

[source,yaml]

data: beef: foo: bar: en: beef fr: boeuf jp: 牛肉 description: Yummy! pork: foo: bar: en: pork fr: porc jp: 豚肉 description: Delicious! ...

[source,yaml]

[source,yaml]

data: beef: foo: bar: en: beef fr: boeuf yummy: true availability: 3 pork: foo: bar: en: pork fr: porc yummy: false availability: 10 ...

Related posts

Follow-up and Clarification on Recent Malicious Ruby Gems Campaign

ESLint Adds Support for Parallel Linting, Closing 10-Year-Old Feature Request