Tag Archives: Digital Archives

Preserving the City’s website with Archive-It

We are pleased to announce that we have begun preserving and providing access to crawls (snapshots) of the City’s website using Archive-It, a web application developed and managed by the Internet Archive. Archive-It uses an open-source crawler called Heritrix to crawl specific web content based on instructions provided by the user (in our case, that’s us), and the venerable Wayback Machine to provide access. Over time, the preserved crawls will show how the City’s website has changed in terms of content, look and feel.

vancouver.ca today

How it works

Each crawl directs Heritrix to one or more “seed” URLs, which you can think of as the starting points of the crawl. From each seed, Heritrix browses through all links and saves any content it encounters that falls within the scoping rules for the crawl. Crawled content is saved in the WARC file format, an ISO standard for storing web content. Continue reading

Association of Moving Image Archivists Conference 2016

This past November I attended the 2016 conference of the Association of Moving Image Archivists (AMIA). It took place in Pittsburgh, Pennsylvania – the City of Bridges. Here are a few of the highlights.

The Roberto Clemente Bridge – one of 446 in Pittsburgh! Photograph by Jana Grazley

The Roberto Clemente Bridge – one of 446 in Pittsburgh! Photograph by Jana Grazley

HACK DAY

A pre-conference tradition since 2013 is the AMIA/DLF (Digital Library Federation) Hack Day, wherein participants collaborate on short projects to develop solutions to various problems associated with moving image preservation and access. Hack Day is a free event focused on practical outcomes and skill-sharing amongst developers and non-developers. This year’s projects included: Continue reading