Difference between revisions of "Software, projects, standards, services and products for data preservation and archives."

From Lsdf
m
m
Line 9: Line 9:
 
==Projects==
 
==Projects==
 
* SCAPE: The SCAPE project developes scalable services for preservation strategies on an open source platform for semi-automated workflows for large-scale complex digital objects [http://www.scape-project.eu/].
 
* SCAPE: The SCAPE project developes scalable services for preservation strategies on an open source platform for semi-automated workflows for large-scale complex digital objects [http://www.scape-project.eu/].
  +
* DuraSPACE: a collection ot tools used in LTDS [https://jira.duraspace.org/secure/Dashboard.jspa].
   
 
==Commercial Products and Offerings==
 
==Commercial Products and Offerings==
Line 16: Line 17:
   
 
Arkivum [http://arkivum.com/]: Offers validated and secure archive storage with 100% !! reliability (backed by insurance).
 
Arkivum [http://arkivum.com/]: Offers validated and secure archive storage with 100% !! reliability (backed by insurance).
  +
  +
==Misc==
  +
* DuraSpace Projects like Fedora Commons have recognised the typicalities of Asynchronous storage. here are two links from their web site. An approach to have asynchronous acces from REST [https://wiki.duraspace.org/display/FF/Design+-+Asynchronous+REST+API] and the current roadmap of Fedora Commons that has asynchronous access to storage on the roadmap [https://wiki.duraspace.org/display/FF/Roadmap].

Revision as of 12:35, 25 September 2014

Standards

  • BagIt: a hierarchical file packaging format designed to support disk-based storage and network transfer of arbitrary digital content. Implemantation of the Format are avaialble in Perl, Java Python and other languages. See the specification here: [1]

Software

  • ToMaR: a framework that provides a simple and flexible solution to run preservation tools on a Hadoop MapReduce cluster in a scalable fashion. [2]. ToMaR was developed within the EU SCAPE project [3]
  • Dataverse [4]: A web application for Publishing, Citing, Analysing and Preserving Scientific data.

Projects

  • SCAPE: The SCAPE project developes scalable services for preservation strategies on an open source platform for semi-automated workflows for large-scale complex digital objects [5].
  • DuraSPACE: a collection ot tools used in LTDS [6].

Commercial Products and Offerings

Preservica [7]: software for archival and preservation. They can implement distributed storage using different storage backends. Standard protocols: OAI-PMH, CMIS, metadata and content via rest acessible. They say they can run (proven) at 500 MB/s (40 TB per day!!)

Rosetta [8]: A software product from the ExLibris company for archival and preservation. They have a public set of APIs in their developer network. They use a storage adaptor to allow different storage systems to integrate with Rosetta.

Arkivum [9]: Offers validated and secure archive storage with 100% !! reliability (backed by insurance).

Misc

  • DuraSpace Projects like Fedora Commons have recognised the typicalities of Asynchronous storage. here are two links from their web site. An approach to have asynchronous acces from REST [10] and the current roadmap of Fedora Commons that has asynchronous access to storage on the roadmap [11].