Software, projects, standards, services and products for data preservation and archives.: Difference between revisions
mNo edit summary |
mNo edit summary |
||
(6 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
==Standards== |
==Standards== |
||
* BagIt: a hierarchical file packaging format designed to support disk-based storage and network transfer of arbitrary digital content. Implemantation of the Format are avaialble in Perl, Java Python and other languages. See the specification here: [https://tools.ietf.org/html/draft-kunze-bagit-10] |
* BagIt: a hierarchical file packaging format designed to support disk-based storage and network transfer of arbitrary digital content. Implemantation of the Format are avaialble in Perl, Java Python and other languages. See the specification here: [https://tools.ietf.org/html/draft-kunze-bagit-10] |
||
* PREMIS: The PREMIS Data Dictionary for Preservation Metadata is the international standard for metadata to support the preservation of digital objects and ensure their long-term usability. |
|||
** Homepage LoC http://www.loc.gov/standards/premis/ |
|||
** PREMIS Version 3.0 http://www.loc.gov/standards/premis/v3/index.html |
|||
** PREMIS - Schema 3.0 http://www.loc.gov/standards/premis/premis.xsd |
|||
==Software== |
==Software== |
||
* List of free and open source software by OAD: http://oad.simmons.edu/oadwiki/Free_and_open-source_repository_software |
|||
⚫ | |||
Not included are the following packets: |
|||
⚫ | |||
* Dataverse [http://thedata.org/]: A web application for Publishing, Citing, Analysing and Preserving Scientific data. |
* Dataverse [http://thedata.org/]: A web application for Publishing, Citing, Analysing and Preserving Scientific data. |
||
Line 16: | Line 22: | ||
Rosetta [http://www.exlibrisgroup.com/category/RosettaOverview]: A software product from the ExLibris company for archival and preservation. They have a public set of APIs in their developer network. They use a storage adaptor to allow different storage systems to integrate with Rosetta. |
Rosetta [http://www.exlibrisgroup.com/category/RosettaOverview]: A software product from the ExLibris company for archival and preservation. They have a public set of APIs in their developer network. They use a storage adaptor to allow different storage systems to integrate with Rosetta. |
||
Arkivum [http://arkivum.com/]: Offers validated and secure archive storage with 100% !! reliability (backed by insurance). |
Arkivum [http://arkivum.com/]: Offers validated and secure archive storage with 100% !! reliability (backed by insurance). JISC uses Arkivum as archive provider for their national offering: https://www.jisc.ac.uk/data-archiving-framework |
||
==Misc== |
==Misc== |
||
* DuraSpace Projects like Fedora Commons have recognised the typicalities of |
* DuraSpace Projects like Fedora Commons have recognised the typicalities of asynchronous access storage. An approach to have asynchronous acces from REST is described [https://wiki.duraspace.org/display/FF/Design+-+Asynchronous+REST+API] and the current roadmap of Fedora Commons that has asynchronous access to storage on the roadmap [https://wiki.duraspace.org/display/FF/Roadmap]. |
||
==Extensive overview== |
|||
[http://digitalpowrr.niu.edu/tool-grid/ link to large list of archive application references]. |
Latest revision as of 12:12, 27 May 2016
Standards
- BagIt: a hierarchical file packaging format designed to support disk-based storage and network transfer of arbitrary digital content. Implemantation of the Format are avaialble in Perl, Java Python and other languages. See the specification here: [1]
- PREMIS: The PREMIS Data Dictionary for Preservation Metadata is the international standard for metadata to support the preservation of digital objects and ensure their long-term usability.
- Homepage LoC http://www.loc.gov/standards/premis/
- PREMIS Version 3.0 http://www.loc.gov/standards/premis/v3/index.html
- PREMIS - Schema 3.0 http://www.loc.gov/standards/premis/premis.xsd
Software
- List of free and open source software by OAD: http://oad.simmons.edu/oadwiki/Free_and_open-source_repository_software
Not included are the following packets:
- ToMaR: a framework that provides a simple and flexible solution to run preservation tools on a Hadoop MapReduce cluster in a scalable fashion. [2]. ToMaR was developed within the EU SCAPE project [3]
- Dataverse [4]: A web application for Publishing, Citing, Analysing and Preserving Scientific data.
Projects
- SCAPE: The SCAPE project developes scalable services for preservation strategies on an open source platform for semi-automated workflows for large-scale complex digital objects [5].
- DuraSPACE: a collection ot tools used in LTDS [6].
Commercial Products and Offerings
Preservica [7]: software for archival and preservation. They can implement distributed storage using different storage backends. Standard protocols: OAI-PMH, CMIS, metadata and content via rest acessible. They say they can run (proven) at 500 MB/s (40 TB per day!!)
Rosetta [8]: A software product from the ExLibris company for archival and preservation. They have a public set of APIs in their developer network. They use a storage adaptor to allow different storage systems to integrate with Rosetta.
Arkivum [9]: Offers validated and secure archive storage with 100% !! reliability (backed by insurance). JISC uses Arkivum as archive provider for their national offering: https://www.jisc.ac.uk/data-archiving-framework
Misc
- DuraSpace Projects like Fedora Commons have recognised the typicalities of asynchronous access storage. An approach to have asynchronous acces from REST is described [10] and the current roadmap of Fedora Commons that has asynchronous access to storage on the roadmap [11].