Happy Valentine’s Day! Here’s your web archiving roundup for February 14, 2016:
- GDELT + Internet Archive’s Collaboration To Archive The World’s Online Journalism: GDELT, global news coverage, and the Internet Archive’s “No More 404” program.
- A new, free tool that’s like x-ray glasses for political ads: The Internet Archive’s Political TV Ad Archive will house all the presidential ads expected to air in eight battleground states during the primaries. Plus, fact-checking!
- Announcing Archive-It 5.0! What’s new in Archive-It’s version 5.0.
- State of the WARC–Our Digital Preservation Survey Results: The takeaways from Archive-It’s June 2015 survey of local digital preservation activities involving WARC files.
- Emulating Digital Art Works: A critique of Oya Rieger and Tim Murray’s recent white paper, Preserving and Emulating Digital Art Objects.
- Compute Canada Support: “Web Archives for Longitudinal Knowledge”—Breaking down the silos in Canadian web archiving.
- On the Road: Some Upcoming Lectures and Talks—Ian Milligan’s upcoming slate of lectures on digital humanities/digital history/web archiving.
- To ZIP or not to ZIP, that is the (web archiving) question: What trade-offs are made when we compress (or don’t compress) web archive files?
- January 2016 Federal Cloud Computing Summit: An overview.
Here’s your web archiving roundup for January 22, 2016!
- Guest post–Ilya Kreymer on oldweb.today: Ilya Kreymer explains how oldweb.today works.
- The Internet is for Cats: If the most important content genre on the Internet is cat videos, how did the Internet work back when there was no video?
- Political TV ad archive preserves lies for the ages: The Internet Archive will help you call out politicians who stretch the truth.
- BowieNet: How David Bowie’s ISP foresaw the future of the internet.
- The Top 10 Blog Posts of 2015 on The Signal: In case you missed them, here are the most popular posts from the Library of Congress’s digital preservation blog.
- Rhizome Awarded $600,000 by The Andrew W. Mellon Foundation to build Webrecorder, a tool to archive the dynamic web.
- Web Archives, Performance & Capture: Christie Peterson shares her talk from Web Archives 2015.
- ‘From Clay to the Cloud’ examines human record: Museum exhibit urges us to consider the cultural record we create through the Internet and how that record is preserved.
- Survey: How Do You Approach Web Archiving?—Do you have fifteen minutes to tell the National Digital Stewardship Alliance about your organization’s web archiving activities?
Happy new year, Roundtablers! Here is the weekly web archiving roundup for January 10, 2016!
- Review of WS-DL’s 2015: The Web Science and Digital Libraries Research Group revisit their accomplishments in 2015.
- CNI Fall 2015 Membership Meeting Trip Report: An overview of the Coalition for Networked Information’s 2015 Fall Meeting.
- Memento–Help Us Route URI Lookups to the Right Archives: An IIPC funded Archive Profiling project attempts to create a high level summary of the holdings of each web archive.
- IIPC Co-Chair Cathy Hartman Retires: The IIPC bids a fond farewell to Cathy Hartman.
- Aggregating Web Archives: Even small Web archives can make a contribution.
- Why Not Store It All? Website bloat and the dangers of digital storage.
Here’s your weekly web archiving roundup for December 20, 2015!
- Web Archiving–An Overview: The Metropolitan New York Library Council announces the first in a series of webinars on web archiving.
- These Old-School Internet Browsers Are Like Real-Life Time Machines: A new tool lets you experience the glory—and embarrassment—of the internet of yore.
- Browsing the ancient Web with an ancient browser: Nicholas Taylor shares some findings after browsing oldweb.today.
- Questions of ethics at Web Archives 2015: Despite diverse perspectives on web archiving, ethics seemed to be a persistent subject.
The results are in: Rachel Trent is the new Web Archiving Roundtable Secretary for 2015-2016. Congratulations, Rachel!
Here’s the weekly web archiving roundup for December 13, 2015:
- The Internet Archive is hosting a telethon! An actual Telethon, hosted and run by Internet Archive employees, in front of a live audience!
- From Dataverse to Gephi: Network Analysis on our Data, A Step-by-Step Walkthrough: Releasing data is only useful if we show people how they can use it.
- Acquiring at Digital Scale–Harvesting the StoryCorps.me Collection: Meeting the challenge of acquiring tens of thousands of interviews at a time thanks to the ability to harvest them via the web.
- The Internet Is for Humans, Not Robots: A new study finds people outnumber bots online for the first time in four years. But a closer inspection of the data reveals a more complicated picture of what’s happening on the web.
- Evaluating the Temporal Coherence of Composite Mementos: Only one in five archived web pages existed as presented.
Here is the weekly web archiving roundup for December 5, 2015:
- Data Storage on DNA Can Keep It Safe for Centuries: Recent advances suggest there may be a new way to store the exploding amount of computer data–and for centuries, rather than decades.
- Building an archive on the Moon (and doing science, too): In theory, an extraterrestrial data archive will pay for some unique science.
- Recreate the old-school internet with this web browser emulator: Oldweb.today not only shows ancient websites, but lets you visit them with ancient browsers.
- Why It’s So Important To Understand What’s In Our Web Archives: It is simply impossible to archive the “entire internet” and perfectly preserve every change to every page in existence.
- IHR workshop on web archiving: An Introduction to Web Archiving for Historians.
- People, communities and platforms–Digital cultural heritage and the web: Trevor Owens’s opening keynote for the National Digital Forum in New Zealand.
Here is the weekly web archiving roundup for November 29, 2015:
- Massive, 4,000-page Infocom design and biz archive posted online: This is big news, in the realm of game design studies.
- 2016 IIPC General Assembly & Web Archiving Conference: The IIPC is seeking proposals for presentations and workshops for the 2016 IIPC Web Archiving Conference and for the IIPC General Assembly.
- Five Takeaways from AOIR 2015: Eyewitness report from the annual Association of Internet Researchers (AOIR) conference.
- Institutional vs. Twitter Seed Lists for Web Archives: What would a web archive created using the tweets of users look like compared to a formal collection, curated by a subject librarian? And how much of it would be in the Wayback Machine?
- Using Warcbase with a Spark Notebook: What it is, and how to set it up.
Weekly web archiving roundup for November 22, 2015:
- How much of the Internet does the Wayback Machine really archive? Surprisingly little is known about what exactly is in the Internet Archive’s Wayback Machine.
- You get what you get and you don’t get upset: David Rosenthal’s take on the IIPC mail alias discussion of the Forbes article about the Wayback Machine.
- The Provenance of Web Archives: The UK Web Archive considers how we should document our crawls.
- The Irony of Writing Online About Digital Preservation: Last month, The Atlantic published a lengthy article about information that is lost on the web. That story itself is in jeopardy.
- Workshop on Missing Warc Features: Considering a session on crawl artifacts that don’t yet fit in WARCs for #iipcGA16, and looking for co-presenters.
Weekly web archiving roundup for November 14, 2015:
- Plan Your Digital Legacy, and Update Often: “As more wealth moves into the cloud, good luck tracking this stuff.”
- How the Internet Archive maintains an information super highway: An interview with the director of media and access at Internet Archive, Alexis Rossi, and Vicky Brasseur, a volunteer maintaining the Internet Archive S3 API documentation.
- New DAS Web Archiving Webinar Begins Streaming November 16: This course introduces the core concepts of web archiving and provides a firm foundation for further expansion of skill sets.
- WAS to Archive-It Migration Update: The Web Archiving Service (WAS) migration to Internet Archive’s Archive-It Service reached two major milestones last week.
- Thoughts from Partner staff about web archiving: An overview of a recent OCLC Research Library Partnership survey investigating the needs of the web archiving community.
- Post Firewall–“Mining the Internet Graveyard: Rethinking the Historians’ Toolkit”: Newly open access article argues that the advent of a massive quantity of born-digital historical sources necessitates a rethinking of the historians’ toolkit.