Transactional Archiving consists of selectively capturing and storing transactions that take place between a web client (browser) and a web server. Most existing web archives recurrently send out bots to crawl the content of web servers. This results in observations of a server's content at the time of crawling. Since the crawling frequency is generally not aligned with the change rate of a server's resources, this approach is typically not able to capture all versions of a server's resource. The resulting archive may provide an acceptable overview of a server's evolution over time, but it will not provide an accurate representation of the server's entire history. A SiteStory Web Archive, however, captures every version of a resource as it is being requested by a browser. The resulting archive is effectively representative of a server's entire history...
The File Information Tool Set (FITS) identifies, validates, and extracts technical metadata for various file formats. It wraps several third-party open source tools, normalizes and consolidates their output, and reports any errors. jhove, droid, etc. The current tools used are: * Jhove (LGPL version 2.1 or any later version) * Exiftool (GPL version 1 or any later version; or the artistic license) * National Library of New Zealand Metadata Extractor (Apache Public License version 2) * DROID (BSD version 3.0) * FFIdent (LGPL) o Note that the live site for ffident (http://schmidt.devlib.org/ffident/index.html) seems to have disappeared - we are now linking to Internet Archive's version of the ffident website. * File Utility (windows) (revised BSD)