Introducing the UK National Archives Metadata Vocabulary
Robert Walpole
Devexe Limited
Photo by Robert Walpole
This presentation is in no way intended to express views or opinions of the UK National Archives and is solely the work of Robert Walpole, an employee of Devexe Limited who were contracted to assist in the development of the Digital Records Infrastructure at The National Archives (TNA) in Kew, London, England.
Over 11 million historical government and public records.
Documents, maps, files and various kinds of images covering over 1000 years of history.
Photo by The National Archives
By 2025 it is expected that the Archives will receive almost exclusively born-digital records.
Photo by Regregex
Photo by Derrick Coetzee
OAIS - Open Archival Information System
Reference model developed by Consultative Committee for Space Data Systems (CCSDS) which became ISO 14721 in 2003
But what exactly is a SIP?
(or AIP or DIP for that matter)
The digital objects (files and folders) for preservation..
..together with metadata about these objects.
Why XML?
<XIP xmlns="http://www.tessella.com/XIP/v4">
<Collections>
<Collection status="same">
<CollectionRef>d94bcdd5-ea94-473f-9f3b-008fa93caeb8</CollectionRef>
<CollectionTypeRef>1</CollectionTypeRef>
<CollectionCode>LEVES</CollectionCode>
<Title>Inquiry into the Culture, Practices and Ethics of the Press
(The Leveson Inquiry)</Title>
<SecurityTag>open</SecurityTag>
</Collection>
</Collections>
<DeliverableUnits>
<DeliverableUnit status="same">
<DeliverableUnitRef>3140421b-02c3-9a06-1a197c497ba8</DeliverableUnitRef>
<CollectionRef>d94bcdd5-ea94-9f3b-008fa93caeb8</CollectionRef>
<AccessionRef>58310a0f-5fd4-b565-2ac46fff4d59</AccessionRef>
<AccumulationRef>025519fa-4ca3-bdbd-4538f0f44f5e</AccumulationRef>
<CatalogueReference>LEV/2/CHN2/Z</CatalogueReference>
<CoverageFrom>2014-02-11T21:16:09.380Z</CoverageFrom>
<CoverageTo>2014-02-11T21:16:09.380Z</CoverageTo>
<Title>LEV 2</Title>
<SecurityTag>open</SecurityTag>
<Metadata/>
<row>
<elem name="title">Letter.doc</elem>
<elem name="identifier">file:/T:/LEV_3/Letter.doc</elem>
<elem name="date">2013-05-13T14:26:56</elem>
<elem name="folder">file</elem>
<elem name="checksum">file</elem>
<elem name="copyright">Crown copyright</elem>
</row>
<dcterms:identifier>WO/409/27/1</dcterms:identifier>
<Metadata/> element defined as follows:
"Arbitary contents, which may conform to an XML Schema. Used to store extra metadata, particularly descriptive metadata... Allows for controlled extension of the schema"
<dcterms:identifier xsi:type="tnacat:itemIdentifier">
<departmentCode>WO</departmentCode>
<seriesCode>409</seriesCode>
<pieceCode>27/1</pieceCode>
<itemCode>1</itemCode>
</dcterms>
Provides a persistent inventory and process control system for DRI based on Semantic Web technologies:
https://goo.gl/NexoRC
<rdf:Description rdf:about="http://example.org/book/1234">
<ex:title>A Good Book</ex:title>
</rdf:Description>
<rdf:Description rdf:about="http://example.org/book/1234"
ex:title="A Good Book"/>
"RDF/XML never became popular with XML people because of the potential difficulty and complexity in processing it.." Bob du Charme (Learning SPARQL)
Photo by Jason Coleman
There may be more in the future...
<tna:BornDigitalRecord rdf:about="http://example.org/66/LEV/2/D4SL/Z">
<tna:legalStatus>Public Record</tna:legalStatus>
</tna:BornDigitalRecord>
<tna:BornDigitalRecord rdf:about="http://example.org/66/LEV/2/D4SL/Z">
<tna:legalStatus rdf:resource="http://example.org/Public_record"/>
</tna:BornDigitalRecord>
<dcterms:identifier xsi:type="tnacat:itemIdentifier">
<departmentCode>WO</departmentCode>
<seriesCode>409</seriesCode>
<pieceCode>27/1</pieceCode>
<itemCode>1</itemCode>
</dcterms>
<tna:BornDigitalRecord rdf:about="http://example.org/66/LEV/2/D4SL/Z">
<tna:cataloguing>
<tna:Cataloguing>
<tna:departmentIdentifier>WO</tna:departmentIdentifier>
<tna:seriesIdentifier>409</tna:seriesIdentifier>
<tna:pieceIdentifier>27/1</tna:piecceIdentifier>
<tna:itemIdentifier>1</tna:itemIdentifier>
</tna:Cataloguing>
</tna:cataloguing>
</tna:BornDigitalRecord>
https://github.com/digital-preservation/dri-vocabulary
tna.owl
A work in progress...
...but mature enough to share.