Web Archiving Roundup: May, 2019

UPDATE – Join the ALCTS Metadata Interest Group Meeting during ALA Annual 2019 for a presentation and Q&A on the Library of Congress Web Archiving Program on Sunday, June 23, 2019, 9:00-10:00AM at the Marriott Marquis.

Now accepting nominations for the SAA Web Archiving Section’s 2019-2020 Steering Committee: https://www2.archivists.org/groups/web-archiving-section/now-accepting-2019-2020-steering-committee-nominations.

Registration for the IIPC Web Archiving Conference ends May 24. The conference will be hosted by the National and University Library of Croatia in Zagreb, which coincides with the 15h anniversary of the Croatian Web Archive (HAW).

For members of the Digital Preservation Coalition, the DPC Web Archiving & Preservation Task Force is inviting delegates to a meeting on July 18, in London. The meeting is free for DPC members, registration ends July 10.

IIPC Content Development Group is asking for contributions to their Climate Change Collection, and their Artificial Intelligence Collection.

Ben Els, Digital Curator at the National Library of Luxembourg, gives us a glimpse not the effort to capture the Luxembourg elections.

Seth Denbo, Director of Scholarly Communication and Digital Initiatives at the American Historical Association, strikes a cord on the challenges of scale in an article titled Data Overload.

The Atlantic has an article on the implication of AI vacuum cleaners from tech companies.

You can now read the paper presented at the 2018 World Library and Information Congress by the Library of Congress, the paper is titled Institutions as Social Media Collector: Lessons Learned from the Library of Congress.

The National Library of the Netherlands has recently launched a collection of archived websites from the Chinese Community in the Netherlands.

ECAL (École cantonale d’art de Lausanne) has launched a website called Information Mesh celebrating the 30th anniversary of the World Wide Web.

Advertisements

Web Archiving Roundup: April, 2019

The Library of Congress Digital Collections Development Coordinator positioncloses soon (May 1st)!

Archive-It is hosting a training webinar on Web Archiving Systems API (WASAPI). Zoom in on Wednesday, May 29that 11am Pacific, 2pm Eastern.

This one came through my Archive-It listserv this morning; a webinar about WASAPI on May 29:  https://support.archive-it.org/hc/en-us/community/posts/360043527372-New-training-webinar-The-Web-Archiving-Systems-API-WASAPI-.Register here.

Registration for the Society of American Archivists Annual Meetingis now open. 

The ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL) has published a list of accepted workshopsand tutorials. 

You can now watch Michael L. Nelson’s keynote “Web Archives at the Nexus of Good Fakes and Flawed Originals”at CNIs April meeting. 

Rhizome presents “The Art Happens Here: Net Art’s Archival Poetics,” an exhibition using web archives as art. If you happen to be in New York, you can see this exhibit at the New Museum until May 26.

The National Library of China will archive 200 billion Weibo postsin a project to preserve China’s digital heritage.

Katie Cuyler, Librarian at University of Alberta Libraries, on archiving Alberta’s climate change datain danger to disappear with the change of government. You can access the collection here.

Jason Scott, from TEXTFILES.COM, recently published The MySpace Dragon Hoard, a collection of 450,000 mp3s from MySpace between 2008-2010. These songs were gathered prior to the recent MySpace data loss.

Check out Karl Blumenthal’s trick to crawl special emojis from Twitter.

Some interesting food for thought from Stephen Dowling’s BBC article.

Web Archiving Roundup: March, 2019

Help the SAA Electronic Records Section find more about the most useful resources for the electronic records community. You can find the survey and a bit more about their project here.

Registration for Archivematica Camp in Vancouver, June 24-26, is still open.

Early bird registration for IIPC Web Archiving Conference is now open. You can also take a look at the program.

The International Journal of Digital Humanities has an article on web archiving initiatives in Europe. The article is titled Web Archives as Data Resource for Digital Scholars.

The Ivy Plus Libraries Confederation have launched the 2018 Brazilian Presidential Transition Web Archive.

Library of Congress Web Archives blog post from Jesse Johnston, Senior Digital Collections Specialist at LOC, gives a walkthrough into sorting through a set of US Government PDFs.

Celebrate the 30th anniversary of the world wide web exploring internet archives through emulated legacy browsers with Rhizome!

Another fun article to celebrate the web’s 30th anniversary looking at Australia’s ugly 90’s websites.

The National Library of Ireland recently announced their 2018 Web Archiving collection.

Web Archiving Roundup: February, 2019

You can still register for AASLH’s webinar Web Archiving: What, Why, and How, the webinar will take place on February 28 @3:00pm EST.

Archive-It will host an advanced training session on February 26 at 11:00 AM Pacific Time (US & Canada), the session will focus on Archive-It as a Reference Tool.

The National Videogame Foundation in collaboration with Bath Spa University and funded by the British Academy and Leverhulme Trust, released a White Paper titled Game Over? Curating, Preserving and Exhibiting Videogames.

Richard Ovenden, Bodley’s Librarian, has an article in The Economist about digital preservation.

Ilya Kreymer, Webrecorder Lead Developer, shares his Code4Lib 2019 presentation slides.

A new release of web crawler project Heritrix 3 is now available.

Middlebury Facebook group Middlebury Memes for Crunchy Teens to be archived by Special Collections.

PhD candidate Rhiannon Lewis writes a response to the DPC’s Briefing Day on web archiving for community and individual archives.

New version of Web Archiving Integration Layer (WAIL) for macOS is now available.

Shawn M. Jones writes a blog post for the Web Science and Digital Libraries Research Group at Old Dominion University regarding Google+ shutting down.

Stanford Libraries receives a $25 million grant to preserve Silicon Valley Archives

PANDORA, Australia’s Web Archive, initially established by the National Library of Australia celebrates its 10 year anniversary.

Web Archiving Roundup: January, 2019

Here is your first Web Archiving Roundup of 2019!

Web Archiving Roundup: December 17, 2018

Here’s your Web Archiving Roundup for December, 2018:

Continue reading “Web Archiving Roundup: December 17, 2018”

Web Archiving Roundup: November 21, 2018

Here’s your Web Archiving Roundup for November, 2018:

Webinar : An Introduction to Web Archive APIs

Jefferson Bailey, Director, Web Archiving Programs at the Internet Archive will be presenting the first webinar of 2017 for the Web Archiving Section of the Society of American Archivists.

Description: This webinar will provide a basic introduction to the many existing, and emergent, APIs specific to web archives and web archiving. Topics covered will include an overview of the role of APIs in the web archiving lifecycle, examples of APIs that exist for querying public web archives, and examples of collection and content specific APIs for use by curators and researchers. The webinar will demonstrate some basic examples for querying APIs and associated tools. Lastly, the webinar will present the work of the IMLS-funded WASAPI project (Web Archiving Systems APIs) which is developing APIs for the exchange of preservation web data and exploring models for API-based systems interoperability in web archiving.

Day:  March 8, 2017

Time:  1pmEST/12pm Central/10am PST

Where: Online via WebEx

If you are interested in attending the webinar, we ask that you RSVP via this online form so that we can plan accordingly.  We will send registered attendees a link to access the webinar in advance of March 8, 2017.

Web Archiving Roundtable professional development

May 21, 2-3pm (EDT)/11am-noon (PDT), two co-authors,  Alexis Antracoli, Records Management Archivist at Drexel University and Kristen Yarmey, Associate Professor and  Digital Services Librarian at the University of Scranton will share their experiences and engage in discussion about their web archiving projects.  The work they will be talking about is covered in “Capture All the URLs: First Steps in Web Archiving” (http://palrap.pitt.edu/ojs/index.php/palrap/article/view/67).

Kristen will discuss her and her colleagues’ first steps in web archiving at  the University of Scranton, including making the case to campus stakeholders, finding funding, choosing Archive-It as well as selecting content and seeds to capture. Alexis will talk about establishing policies and implementing QA procedures.  Both Alexis and Kristen will provide insights on stumbling blocks, lessons learned, and future plans.  Plenty of time will be allotted for questions and discussion.

The link to access the webinar (via Blackboard Collaborate) is: https://learn.dcollege.net/webapps/bb-collaborate-BBLEARN/launchSession/guest?uid=9822dd73-36fe-433f-9451-ae654c703387.  It might be necessary to install some software before getting full access to the webinar but once the required steps are taken, one will see  a phone number and access code that will allow access to the audio portion of the session.

May 22, noon-1pm (EDT)/9-10am (PDT), Martin Klein, Programmer/Analyst at UCLA Research Library, will present on an article he co-authored, “Scholarly Context Not Found: One in Five Articles Suffers from Reference Rot” (DOI:10.1371/journal.pone.0115253).  This presentation was originally scheduled for May 15 but Google Hangout failed us that day so we are trying a new platform (Adobe Connect).  The link to access the webinar is: https://webconf.vc.dfn.de/e-economy

The goal for this presentation is to share the insights in this article with archivists with an interest in web archiving but who might not feel like they have enough technical background to follow some of the finer points of the paper.  We are looking forward to having the information in this paper shared widely including with people who might not get a chance to read the whole paper.  Martin’s talk will be 35-45 minutes followed by 15-20 minutes for questions.

We encourage you to sign in to these webinars about 10 minutes before they are scheduled to start to ensure all system requirements are met.

Archiving the Social Web at MARAC this Friday

If you’re attending MARAC, check out this session on Friday, April 26, 4:15-5:45 p.m.

Download the program here: http://www.marac.info/upcoming-conferences

DESCRIPTION:

Web 2.0 technologies are creating new opportunities for social networking. Some records created by these technologies have archival or research value requiring their preservation. However, there are many challenges associated with this process. This session will explore how some archives have made an effort to preserve Facebook, Twitter and YouTube records so they are accessible to researchers.

CHAIR:

Bonnie Weddle, New York State Archives

SPEAKERS:

Kristine Hanna, Internet Archive

Michael Martin, New York State Archives

Lynda Schmitz Fuhrig, Smithsonian Institution Archives