Mapping myExperiment.org Packs to the DataBank Repository
I. Problem Summary
The impetus for the internship was the user-driven need for a mechanism with which to archive and reference Packs in scholarly communication.
Pack to citable object
http://www.doi.org/logo.html
II. Initial Proposals
Research Object Principles
http://www.researchobject.org/overview/
Pack Formalization
- Packs to be formalized into Research Objects
- Support for this is being developed in myExperiment (Alpha) implementation of the RO API
- May also be formed manually using the RO Manager console application
Deposit in DataBank
- Existing Support for RO Ingest
- RDF Manifests should be mergable
- Compressed directories retain structure when unpacked in DataBank
- Bottlenecks
- Databank is unaware of structure of objects
- Only unpacks top level of files
Unique Identification
- Value of issuing a DOI
- Existing DOIs wrapped in the RO should be documented in the manifest
- DOI will resolve to the DataBank landing page for the RO
III. Summary of Work Completed
Interviewed Stakeholders
- Oxford e-Research Centre myExperiment team
- Bodleian Digital Library Systems and Services
- Research Object model developer
Assessed RO Deposit Workflows
- Current DataBank web interface
- Merging manifests
- Exposing structural metadata
Developed Recommendations
- Research Object final report
IV. Recommendations
Finzalize myExperiment Alpha2 Implementation of RO API
- Successfully produce ROs from Packs
Improve Discoverability
- Develop customized index from RO manifests
- Extract metadata from nested manifests for indexing
http://www.realmdigital.co.za/post/bridging-the-online-book-discoverability-gap-part-/
Pre-Processing ROs
- Processing on myExperiment end for DataBank submission information package
- Unpack zip files
- Update manifest with new metadata
- Including existing DOIs or other unique identifiers
V. Proposed Lifecycle
Pack is in Archived state
Eelke van der Horst http://alpha2.myexperiment.org/packs/559
User Initiates Deposit
RO Generated
RO Pre-Proccessed
-
RO pre-processed by myExperiment interface to utilize DataBank API calls
http://www.xtremeesolutions.com/xappso-process/
Processed RO Deposited in DataBank
- DOI issued at time of ingestion
Manifest Indexed
VI. Future Work
Short Term: External Resources
- Archiving associated external objects
- Attribution & copyright challenges
- Will it still be verifyable, reproducible?
Long term: Provenance
- Capture and embed provenance metadata in archived RO
- W3C PROV Model may be a useful point of entry
RO Oxford Presentation
By Jamie Wittenberg
RO Oxford Presentation
Mapping myExperiment packs to DataBank
- 1,003