Linked Open Data for Structured Data Formats
Structured data refers to any data that resides in a fixed field within a record or file. This can include data contained in relational databases, spreadsheets, or marked up text. Structured data may also be plain-text delimited.
The significant properties of structured data records are documented in the Structured Data: Generic Preservation Plan, which can be used as test criteria for tools and processes used in format transformations.
NARA makes its Linked Open Data available in Resource Description Framework Terse RDF Triple Language or RDF Turtle (.ttl files). These files can be opened in any text editor. The Digital Preservation Framework as Linked Open Data includes the same elements as are available in the version of the Preservation Plans on GitHub.
These plans are not exhaustive nor universally applicable proposed actions and recommended or endorsed tools: these represent file formats and variant versions in NARA holdings, the current NARA risk assessment, processing capabilities, and tools in use at NARA.
Format Name | File Extension(s) | Digital Preservation Framework Category/Categories | NARA Format ID | NARA Linked Open Data TTL |
---|---|---|---|---|
Comma Separated Values | csv | Structured Data | NF00143 | https://www.archives.gov/files/lod/dpframework/id/NF00143.ttl |
Extended Binary Coded Decimal Interchange Code (EBCDIC) | ebcdic | Structured Data | NF00183 | https://www.archives.gov/files/lod/dpframework/id/NF00183.ttl |
Extensible Forms Description Language (XFDL) | xfdl | Web Records|Software and Code|Structured Data|Textual and Word Processing | NF00686 | https://www.archives.gov/files/lod/dpframework/id/NF00686.ttl |
eXtensible Markup Language 1.0 | xml | Web Records|Software and Code|Structured Data|Textual and Word Processing | NF00187 | https://www.archives.gov/files/lod/dpframework/id/NF00187.ttl |
eXtensible Markup Language 1.1 | xml | Web Records|Software and Code|Structured Data|Textual and Word Processing | NF00561 | https://www.archives.gov/files/lod/dpframework/id/NF00561.ttl |
eXtensible Markup Language unspecified version | xml | Web Records|Software and Code|Structured Data|Textual and Word Processing | NF00654 | https://www.archives.gov/files/lod/dpframework/id/NF00654.ttl |
eXtensible Metadata Platform | xmp | Structured Data | NF00189 | https://www.archives.gov/files/lod/dpframework/id/NF00189.ttl |
HLM Multivariate Data Matrix Format | mdm | Structured Data | NF00721 | https://www.archives.gov/files/lod/dpframework/id/NF00721.ttl |
JavaScript Object Notation (JSON) | json|txt | Structured Data | NF00218 | https://www.archives.gov/files/lod/dpframework/id/NF00218.ttl |
Mathematica Computable Document Format | cdf | Structured Data | NF00582 | https://www.archives.gov/files/lod/dpframework/id/NF00582.ttl |
Microsoft Project 2000-2003 | mpp | Presentation and Publishing|Structured Data | NF00682 | https://www.archives.gov/files/lod/dpframework/id/NF00682.ttl |
Microsoft Project 2007 | mpp | Presentation and Publishing|Structured Data | NF00683 | https://www.archives.gov/files/lod/dpframework/id/NF00683.ttl |
Microsoft Project 2010 | mpp | Presentation and Publishing|Structured Data | NF00684 | https://www.archives.gov/files/lod/dpframework/id/NF00684.ttl |
Microsoft Project 4.0 | mpp | Presentation and Publishing|Structured Data | NF00679 | https://www.archives.gov/files/lod/dpframework/id/NF00679.ttl |
Microsoft Project 95 | mpp | Presentation and Publishing|Structured Data | NF00680 | https://www.archives.gov/files/lod/dpframework/id/NF00680.ttl |
Microsoft Project 98 | mpp | Presentation and Publishing|Structured Data | NF00681 | https://www.archives.gov/files/lod/dpframework/id/NF00681.ttl |
Microsoft Project unspecified version | mpp | Presentation and Publishing|Structured Data | NF00842 | https://www.archives.gov/files/lod/dpframework/id/NF00842.ttl |
OpenProj Project | pod | Presentation and Publishing|Structured Data | NF00781 | https://www.archives.gov/files/lod/dpframework/id/NF00781.ttl |
Resource Description Framework (RDF) XML Triple | rdf | Structured Data | NF00605 | https://www.archives.gov/files/lod/dpframework/id/NF00605.ttl |
SEG-Y rev 0 | sgy|segy | Geospatial|Structured Data | NF00845 | https://www.archives.gov/files/lod/dpframework/id/NF00845.ttl |
Standard Generalized Markup Language (SGML) | sgm|sgml | Structured Data | NF00410 | https://www.archives.gov/files/lod/dpframework/id/NF00410.ttl |
STATA Data file version 118 | dta | Presentation and Publishing|Structured Data | NF00696 | https://www.archives.gov/files/lod/dpframework/id/NF00696.ttl |
Structured Data eXchange Format | sdxf | Structured Data | NF00415 | https://www.archives.gov/files/lod/dpframework/id/NF00415.ttl |
Synchronized Multimedia Integration Language | smi|smil | Structured Data | NF00782 | https://www.archives.gov/files/lod/dpframework/id/NF00782.ttl |
Tab Separated Values | tab|tsv | Structured Data | NF00418 | https://www.archives.gov/files/lod/dpframework/id/NF00418.ttl |