Software, projects, standards, services and products for data preservation and archives.
From Lsdf
Standards
- BagIt: a hierarchical file packaging format designed to support disk-based storage and network transfer of arbitrary digital content. Implemantation of the Format are avaialble in Perl, Java Python and other languages. See the specification here: [1]
Software
- ToMaR: a framework that provides a simple and flexible solution to run preservation tools on a Hadoop MapReduce cluster in a scalable fashion. [2]. ToMaR was developed within the EU SCAPE project [3]
- Dataverse [4]: A web application for Publishing, Citing, Analysing and Preserving Scientific data.
Projects
- SCAPE: The SCAPE project developes scalable services for preservation strategies on an open source platform for semi-automated workflows for large-scale complex digital objects [5].
Commercial Products and Offerings
Preservica [6]: software for archival and preservation. They can implement distributed storage using different storage backends. Standard protocols: OAI-PMH, CMIS, metadata and content via rest acessible. They say they can run (proven) at 500 MB/s (40 TB per day!!)
Rosetta [7]: A software product from the ExLibris company for archival and preservation. They have a public set of APIs in their developer network. They use a storage adaptor to allow different storage systems to integrate with Rosetta.
Arkivum [8]: Offers validated and secure archive storage with 100% !! reliability (backed by insurance).