This data as JSON, CSV
|Beyond the WARC: Making Web Archives More Useful and User-friendly
|May 9 2019
|Archives of the web contain not only web pages but any type of data.
The only standard in web archiving is the ISO WARC file format, which specifies raw data captured from the web. However, the WARC files often lack any context or metadata about how this data was captured. The talk will briefly cover the basics of the WARC format, and also provide possible ideas for making web archiving data more user-friendly, present existing tools and suggest ideas for interoperable ways to describe collections and make sense of growing web archive data beyond the WARC format.