Web Archiving Roundup: October 2019

The Internet Archive and Center for Open Science announce collaboration to preserve open science data, funded by the IMLS National Leadership Grants for Libraries program.

On August 12, Harvard Library Preservation Services hosted a regional meetup of Archive-it subscribers. You can learn about the meetup on the Harvard Library Communications blog.

The first workshop of the Continuing Education to Advance Web Archiving (CEDWARC) is taking place at George Washington University on October 28.  It is the first in a series of workshops to train library and archive professionals to use web archiving tools to answer research questions and enable new web archiving services based on these tools.

The Archives Unleashed Project has put out a call for participation for an Archives Unleashed Datathon at Columbia University Libraries on March 26-27, 2020. The project team invites archivists, researchers, librarians, computer scientists, and web archiving enthusiasts to join us over the course of two days to collaboratively work with web collections and explore cutting-edge research tools through hands-on experience. Proposals are due November 1, 2019.

Presentations from the Web Archiving & Data Services international partners meeting on September 20, 2019 at iPRES 2019 can be viewed here.

Call for papers opens on October 14, 2019 for the International Internet Preservation Consortium (IIPC) General Assembly and the Web Archiving Conference (WAC) in Montréal, from May 11 ̶ 13, 2020.

Article on web archiving, “Please, My Digital Archive. It’s Very Sick.” by Tanner Howard was published by the Lapham’s Quarterly on September 4, 2019.

Webrecorder had a bunch of updates! First, Webrecorder released Autopilot, a new feature that can perform actions on the current web page loaded in Webrecorder, similar to a human user: clicking buttons, scrolling down, expanding sections, and so on. Webrecorder also released a Desktop app. The app is a fully self-contained version of the online service built for a desktop environment with minimal modifications: a single-user Webrecorder that can run on a computer without requiring a connection to any centralized service. Users can now have the full web archiving capabilities of Webrecorder on their own machines without having to install Docker or use the command-line.

Finally, WABAC (Web Archive Browsing Advanced Client), a project by Webrecorder, was released. More information on the WABAC and client-side replay technology can be found in a blog post by Ilya Kreymer on the Webrecorder blog and DSHR’s blog.

Upcoming US meetups: DLF Forum & NDSA Digital Preservation, October 14-17 in Tampa, FL and Web Archiving Texas on November 6 at Baylor University.

Do you have #webarchiving news?  Tweet at us @WebArch_RT and we will be sure to feature it in next month’s roundup!

 

Advertisements

Web Archiving Roundup: July, 2019

The Web Archiving and Metadata Digital Object Sections will hold a joint event during the SAA Annual Meeting in Austin, TX. Join us on Saturday, August 3rd for a debate on descriptive metadata and web archiving.

The 2019 Archive-It Partner Meeting coincides with SAA’s Annual Meeting, registration is still open.

Graphic Designer Sam Henri Gold has been archiving Apple ads from the 1970s to the present, you can take a look at the archive directly from the article.

ArchiveSpark 3.0 is now available, take a look a the updates in GitHub.

Check out this article about a High School student’s experience working for the Archives Unleashed team.

The latest issue of the Newsletter from the ESRC National Centre for Research Methods includes an article on research challenges using web archives for social research.

Registration is still open for the Specialized Data Curation Workshop hosted by the Data Curation Network at Washington University in St. Louis.

The Digital Preservation Coalition is crowd-sourcing a list of endangered digital materials. Nominations close on Friday August 30th, 2019.

Web Archiving Roundup: June, 2019

This month the Sunlight Foundation, an organization advocating for open government, published an article on the weaknesses in Federal Agencies’ archival practices.

Catch the livestream recording from the IIP Web Archiving Conference.

You can now take a look at the slides from Ian Milligan & Nick Ruest’s presentation on the Archives Unleashed Cloud Project at IIPC Web Archiving Conference.

Both paper and presentation slides on Judging Visual Correspondence in Web Archives are now available. This presentation was done by Brenda Reyes Ayala at the ACM/IEE Joint Conference on Digital Libraries (JCDL).

The Web Science and Digital Libraries Research Group at Old Dominion University has posted a trip report on the ACM/IEEE Joint Conference on Digital Libraries (JCDL).

Rhizome recently announced an opportunity for Webrecorder users and community members to financially support Webrecorder, some of the benefits include full range of tools and expanded storage for web collections.

Check out this i-D magazine article on the case for archiving our digital lives.

Web Archiving Roundup: May, 2019

UPDATE – Join the ALCTS Metadata Interest Group Meeting during ALA Annual 2019 for a presentation and Q&A on the Library of Congress Web Archiving Program on Sunday, June 23, 2019, 9:00-10:00AM at the Marriott Marquis.

Now accepting nominations for the SAA Web Archiving Section’s 2019-2020 Steering Committee: https://www2.archivists.org/groups/web-archiving-section/now-accepting-2019-2020-steering-committee-nominations.

Registration for the IIPC Web Archiving Conference ends May 24. The conference will be hosted by the National and University Library of Croatia in Zagreb, which coincides with the 15h anniversary of the Croatian Web Archive (HAW).

For members of the Digital Preservation Coalition, the DPC Web Archiving & Preservation Task Force is inviting delegates to a meeting on July 18, in London. The meeting is free for DPC members, registration ends July 10.

IIPC Content Development Group is asking for contributions to their Climate Change Collection, and their Artificial Intelligence Collection.

Ben Els, Digital Curator at the National Library of Luxembourg, gives us a glimpse not the effort to capture the Luxembourg elections.

Seth Denbo, Director of Scholarly Communication and Digital Initiatives at the American Historical Association, strikes a cord on the challenges of scale in an article titled Data Overload.

The Atlantic has an article on the implication of AI vacuum cleaners from tech companies.

You can now read the paper presented at the 2018 World Library and Information Congress by the Library of Congress, the paper is titled Institutions as Social Media Collector: Lessons Learned from the Library of Congress.

The National Library of the Netherlands has recently launched a collection of archived websites from the Chinese Community in the Netherlands.

ECAL (École cantonale d’art de Lausanne) has launched a website called Information Mesh celebrating the 30th anniversary of the World Wide Web.

Web Archiving Roundup: April, 2019

The Library of Congress Digital Collections Development Coordinator positioncloses soon (May 1st)!

Archive-It is hosting a training webinar on Web Archiving Systems API (WASAPI). Zoom in on Wednesday, May 29that 11am Pacific, 2pm Eastern.

This one came through my Archive-It listserv this morning; a webinar about WASAPI on May 29:  https://support.archive-it.org/hc/en-us/community/posts/360043527372-New-training-webinar-The-Web-Archiving-Systems-API-WASAPI-.Register here.

Registration for the Society of American Archivists Annual Meetingis now open. 

The ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL) has published a list of accepted workshopsand tutorials. 

You can now watch Michael L. Nelson’s keynote “Web Archives at the Nexus of Good Fakes and Flawed Originals”at CNIs April meeting. 

Rhizome presents “The Art Happens Here: Net Art’s Archival Poetics,” an exhibition using web archives as art. If you happen to be in New York, you can see this exhibit at the New Museum until May 26.

The National Library of China will archive 200 billion Weibo postsin a project to preserve China’s digital heritage.

Katie Cuyler, Librarian at University of Alberta Libraries, on archiving Alberta’s climate change datain danger to disappear with the change of government. You can access the collection here.

Jason Scott, from TEXTFILES.COM, recently published The MySpace Dragon Hoard, a collection of 450,000 mp3s from MySpace between 2008-2010. These songs were gathered prior to the recent MySpace data loss.

Check out Karl Blumenthal’s trick to crawl special emojis from Twitter.

Some interesting food for thought from Stephen Dowling’s BBC article.

Web Archiving Roundup: March, 2019

Help the SAA Electronic Records Section find more about the most useful resources for the electronic records community. You can find the survey and a bit more about their project here.

Registration for Archivematica Camp in Vancouver, June 24-26, is still open.

Early bird registration for IIPC Web Archiving Conference is now open. You can also take a look at the program.

The International Journal of Digital Humanities has an article on web archiving initiatives in Europe. The article is titled Web Archives as Data Resource for Digital Scholars.

The Ivy Plus Libraries Confederation have launched the 2018 Brazilian Presidential Transition Web Archive.

Library of Congress Web Archives blog post from Jesse Johnston, Senior Digital Collections Specialist at LOC, gives a walkthrough into sorting through a set of US Government PDFs.

Celebrate the 30th anniversary of the world wide web exploring internet archives through emulated legacy browsers with Rhizome!

Another fun article to celebrate the web’s 30th anniversary looking at Australia’s ugly 90’s websites.

The National Library of Ireland recently announced their 2018 Web Archiving collection.

Web Archiving Roundup: February, 2019

You can still register for AASLH’s webinar Web Archiving: What, Why, and How, the webinar will take place on February 28 @3:00pm EST.

Archive-It will host an advanced training session on February 26 at 11:00 AM Pacific Time (US & Canada), the session will focus on Archive-It as a Reference Tool.

The National Videogame Foundation in collaboration with Bath Spa University and funded by the British Academy and Leverhulme Trust, released a White Paper titled Game Over? Curating, Preserving and Exhibiting Videogames.

Richard Ovenden, Bodley’s Librarian, has an article in The Economist about digital preservation.

Ilya Kreymer, Webrecorder Lead Developer, shares his Code4Lib 2019 presentation slides.

A new release of web crawler project Heritrix 3 is now available.

Middlebury Facebook group Middlebury Memes for Crunchy Teens to be archived by Special Collections.

PhD candidate Rhiannon Lewis writes a response to the DPC’s Briefing Day on web archiving for community and individual archives.

New version of Web Archiving Integration Layer (WAIL) for macOS is now available.

Shawn M. Jones writes a blog post for the Web Science and Digital Libraries Research Group at Old Dominion University regarding Google+ shutting down.

Stanford Libraries receives a $25 million grant to preserve Silicon Valley Archives

PANDORA, Australia’s Web Archive, initially established by the National Library of Australia celebrates its 10 year anniversary.

Web Archiving Roundup: January, 2019

Here is your first Web Archiving Roundup of 2019!

Web Archiving Roundup: December 17, 2018

Here’s your Web Archiving Roundup for December, 2018:

Continue reading “Web Archiving Roundup: December 17, 2018”

Web Archiving Roundup: November 21, 2018

Here’s your Web Archiving Roundup for November, 2018: