Skip to Content

Practice

Title Updatedsort icon
Personal Digital Archiving 2011 Conference

The Personal Digital Archiving 2011 Conference will be held February 24-25, 2011 at The Internet Archive in San Francisco, CA.

6 years 43 weeks ago
GNU Wget

GNU Wget is a free software package for retrieving files using HTTP, HTTPS and FTP, the most widely-used Internet protocols. It is a non-interactive commandline tool, so it may easily be called from scripts, cron jobs, terminals without X-Windows support, etc.

6 years 43 weeks ago
Internet Researcher and Offline Commander - commercial off-line browsing software for Windows

Internet Researcher and Offline Commander is commercial off-line browsing software for Windows.

6 years 43 weeks ago
Library of Congress Transfer Tools

These are tools developed by the Library of Congress and their partners in the National Digital Information Infrastructure and Preservation Program (NDIIPP) for the purpose of validation and transfer of data that conforms to the BagIt specification.

6 years 43 weeks ago
GAip

GAip (Gloucestershire Archives ingest packager) is a proof of concept demonstration system written in perl. It provides archivists and others with the means to, 1. ingest a digital object and create the associated Archival Information Package (AIP), 2. compile metadata for the digital object which is included in the AIP, and 3. create Dissemination Information Packages from AIPs in order to provide access to the ingested digital object. GAip operates by way of a graphical user interface.

6 years 43 weeks ago
DeepArc

DeepArc was developed by the National Library of France (BnF) with XQuark to transform relational database content into XML for archiving purposes. It is part of the International Internet Preservation Consortium (IIPC) tool suite for web archiving.

6 years 43 weeks ago
BAT: BnfArcTools

BAT is a Perl package for processing Internet Archive ARC, DAT and CDX file format. This package was developped and is still maintained by the National Library of France (BnF) and is distributed under the GPL licence.

6 years 43 weeks ago
Data Fountains

Data Fountains is a tool for discovering and describing Internet resources about a particular topic. After signing on the user is guided through a series of Web pages that generate information describing a particular topic.

6 years 43 weeks ago
The DeDuplicator (Heritrix add-on module)

The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls.

6 years 43 weeks ago
Find It! Keep It!

Find It! Keep It! is a tool to save and organise web content.

6 years 43 weeks ago
Syndicate content


about seo