Creativity Sharing and
Entrepreneurial Support

CSV-X: A Linked Data Enabled Schema Language, Model, and Processing Engine for Non-Uniform CSV

License : CRAPL (http://matt.might.net/articles/crapl/)
CSV-X is a schema language, model, and processing engine for non-uniform CSV enabling annotation, validation, cross-referencing, Linked Data, RDF serialization, and transformation to other formats.
Update: Jan 22, 2017

Text detail
【エントリー部門】基盤技術部門 【応募者属性】学生 【応募者名】Wirawit Chaochaisit 【エントリー作品の権利指定】CRAPL 【エントリー作品のURL】https://github.com/nabito/csv-x 【利用しているオープンデータ】Tokyo Air Pollution Sensor Measurements, UK Telecom Revenue, LAPD Crime Report, etc. 【利用しているパートナーリソース】- 【エントリー作品の詳細説明】Please refer to below texts.
Update: Jan 15, 2017 (nabito)
Text detail
CSV-X is a schema language, model, and processing engine for non-uniform CSV enabling annotation, validation, cross-referencing, Linked Data, RDF serialization, and transformation to RDF and other formats. [Features] - Parsing (Of course!) CSV, TSV, and SSV (Space-Separated Value) - Annotation, IRI support, thus Linked Data enabled - Validation (XML Schema Datatype + RegEx) - Value Alteration - Cross-Referencing (Dynamic Variable Declaration & Ref, RegEx capturing group) - RDF Serialization (Turtle) - Template-based Transformation - Commandline Interface (CLI) - New! in v0.11.0 Embedded Scripting (JavaScript)
Update: Jan 15, 2017 (nabito)
Web page detail
[GitHub] A fully open source implementation of CSV-X processing engine with schema specification, API documentation, and pre-compiled binary. There is also a link to online demo application at http://www.dadfha.com:3232
GitHub - nabito/csv-x: A Linked Data Enabled Schema Language, Model, and Processing Engine for Non-Uniform CSV https://github.com/nabito/csv-x
csv-x - A Linked Data Enabled Schema Language, Model, and Processing Engine for Non-Uniform CSV
Update: Jan 15, 2017 (nabito)
Text detail
Abstract: At present there are numbers of tools and schemas to aid in publishing CSV as RDF. However, there are still many non-uniform CSV datasets, which cannot effectively be processed using available solutions. This paper proposes CSV-X, a novel schema language, model, and processing engine for non-uniform CSV enabling annotation, validation, cross-referencing, linked data, RDF serialization, and transformation to other formats. Common non-uniform patterns are identified and used as a basis to create language constructs that can describe these patterns. The value reference via dynamic variable declaration making it possible to describe arbitrary relations between schema entities. Each schema entity can be mapped with a parameterizable template, allowing flexible transformation to other formats. The language processor is implemented and evaluated against real- world non-uniform CSV datasets. The results show that CSV-X can effectively process CSVs with non-uniform patterns into more advance data model in RDF.
Update: Jan 15, 2017 (nabito)
Paper detail
To be appeared in IEEE Conference Proceedings published by IEEE Computer Society Press (indexed by EI).
CSV-X: A Linked Data Enabled Schema Language, Model, and Processing Engine for Non-Uniform CSV
Wirawit Chaochaisit, Ken Sakamura, Noboru Koshizuka, Masahiro Bessho
IEEE Conference Proceedings, (2017)
Update: Jan 15, 2017 (nabito)
Slide detail
Update: Jan 16, 2017 (nabito)

Grant Information

Suggested fund information for realizing the idea.

Comments