Web archiving roundup: February 14, 2016

Happy Valentine’s Day! Here’s your web archiving roundup for February 14, 2016:

  • GDELT + Internet Archive’s Collaboration To Archive The World’s Online Journalism: GDELT, global news coverage, and the Internet Archive’s “No More 404” program.
  • A new, free tool that’s like x-ray glasses for political ads: The Internet Archive’s Political TV Ad Archive will house all the presidential ads expected to air in eight battleground states during the primaries. Plus, fact-checking!
  • Announcing Archive-It 5.0! What’s new in Archive-It’s version 5.0.
  • State of the WARC–Our Digital Preservation Survey Results: The takeaways from Archive-It’s June 2015 survey of local digital preservation activities involving WARC files.
  • Emulating Digital Art Works: A critique of Oya Rieger and Tim Murray’s recent white paper, Preserving and Emulating Digital Art Objects.
  • Compute Canada Support: “Web Archives for Longitudinal Knowledge”Breaking down the silos in Canadian web archiving.
  • On the Road: Some Upcoming Lectures and TalksIan Milligan’s upcoming slate of lectures on digital humanities/digital history/web archiving.
  • To ZIP or not to ZIP, that is the (web archiving) question: What trade-offs are made when we compress (or don’t compress) web archive files?
  • January 2016 Federal Cloud Computing Summit: An overview.
Advertisements

Web archiving roundup: January 22, 2016

Here’s your web archiving roundup for January 22, 2016!

  • Guest post–Ilya Kreymer on oldweb.todayIlya Kreymer explains how oldweb.today works.
  • The Internet is for CatsIf the most important content genre on the Internet is cat videos, how did the Internet work back when there was no video?
  • Political TV ad archive preserves lies for the agesThe Internet Archive will help you call out politicians who stretch the truth.
  • BowieNet: How David Bowie’s ISP foresaw the future of the internet.
  • The Top 10 Blog Posts of 2015 on The SignalIn case you missed them, here are the most popular posts from the Library of Congress’s digital preservation blog.
  • Rhizome Awarded $600,000 by The Andrew W. Mellon Foundation to build Webrecorder, a tool to archive the dynamic web.
  • Web Archives, Performance & CaptureChristie Peterson shares her talk from Web Archives 2015.
  • ‘From Clay to the Cloud’ examines human record: Museum exhibit urges us to consider the cultural record we create through the Internet and how that record is preserved.
  • Survey: How Do You Approach Web Archiving?Do you have fifteen minutes to tell the National Digital Stewardship Alliance about your organization’s web archiving activities?

 

Weekly web archiving roundup: January 10, 2016

Happy new year, Roundtablers! Here is the weekly web archiving roundup for January 10, 2016!

  • Review of WS-DL’s 2015: The Web Science and Digital Libraries Research Group revisit their accomplishments in 2015.
  • CNI Fall 2015 Membership Meeting Trip Report: An overview of the Coalition for Networked Information’s 2015 Fall Meeting.
  • Memento–Help Us Route URI Lookups to the Right Archives: An IIPC funded Archive Profiling project attempts to create a high level summary of the holdings of each web archive.
  • IIPC Co-Chair Cathy Hartman Retires: The IIPC bids a fond farewell to Cathy Hartman.
  • Aggregating Web Archives: Even small Web archives can make a contribution.
  • Why Not Store It All? Website bloat and the dangers of digital storage.

Weekly web archiving roundup: December 20, 2015

Here’s your weekly web archiving roundup for December 20, 2015!

  • Web Archiving–An Overview: The Metropolitan New York Library Council announces the first in a series of webinars on web archiving.
  • These Old-School Internet Browsers Are Like Real-Life Time Machines: A new tool lets you experience the glory—and embarrassment—of the internet of yore.
  • Browsing the ancient Web with an ancient browser: Nicholas Taylor shares some findings after browsing oldweb.today.
  • Questions of ethics at Web Archives 2015: Despite diverse perspectives on web archiving, ethics seemed to be a persistent subject.

Weekly web archiving roundup: December 13, 2015

Here’s the weekly web archiving roundup for December 13, 2015:

  • The Internet Archive is hosting a telethon! An actual Telethon, hosted and run by Internet Archive employees, in front of a live audience!
  • From Dataverse to Gephi: Network Analysis on our Data, A Step-by-Step WalkthroughReleasing data is only useful if we show people how they can use it.
  • Acquiring at Digital Scale–Harvesting the StoryCorps.me CollectionMeeting the challenge of acquiring tens of thousands of interviews at a time thanks to the ability to harvest them via the web.
  • The Internet Is for Humans, Not RobotsA new study finds people outnumber bots online for the first time in four years. But a closer inspection of the data reveals a more complicated picture of what’s happening on the web.
  • Evaluating the Temporal Coherence of Composite MementosOnly one in five archived web pages existed as presented.

Weekly web archiving roundup: December 5, 2015

Here is the weekly web archiving roundup for December 5, 2015:

  • Data Storage on DNA Can Keep It Safe for Centuries: Recent advances suggest there may be a new way to store the exploding amount of computer data–and for centuries, rather than decades.
  • Building an archive on the Moon (and doing science, too): In theory, an extraterrestrial data archive will pay for some unique science.
  • Recreate the old-school internet with this web browser emulator: Oldweb.today not only shows ancient websites, but lets you visit them with ancient browsers.
  • Why It’s So Important To Understand What’s In Our Web Archives: It is simply impossible to archive the “entire internet” and perfectly preserve every change to every page in existence.
  • IHR workshop on web archiving: An Introduction to Web Archiving for Historians.
  • People, communities and platforms–Digital cultural heritage and the web: Trevor Owens’s opening keynote for the National Digital Forum in New Zealand.

Weekly web archiving roundup: November 29, 2015

Here is the weekly web archiving roundup for November 29, 2015:

  • Massive, 4,000-page Infocom design and biz archive posted online: This is big news, in the realm of game design studies.
  • 2016 IIPC General Assembly & Web Archiving Conference: The IIPC is seeking proposals for presentations and workshops for the 2016 IIPC Web Archiving Conference and for the IIPC General Assembly.
  • Five Takeaways from AOIR 2015: Eyewitness report from the annual Association of Internet Researchers (AOIR) conference.
  • Institutional vs. Twitter Seed Lists for Web Archives: What would a web archive created using the tweets of users look like compared to a formal collection, curated by a subject librarian? And how much of it would be in the Wayback Machine?
  • Using Warcbase with a Spark Notebook: What it is, and how to set it up.

Weekly web archiving roundup: November 22, 2015

Weekly web archiving roundup for November 22, 2015:

  • How much of the Internet does the Wayback Machine really archive? Surprisingly little is known about what exactly is in the Internet Archive’s Wayback Machine.
  • You get what you get and you don’t get upset: David Rosenthal’s take on the IIPC mail alias discussion of the Forbes article about the Wayback Machine.
  • The Provenance of Web Archives: The UK Web Archive considers how we should document our crawls.
  • The Irony of Writing Online About Digital Preservation: Last month, The Atlantic published a lengthy article about information that is lost on the web. That story itself is in jeopardy.
  • Workshop on Missing Warc Features: Considering a session on crawl artifacts that don’t yet fit in WARCs for #iipcGA16, and looking for co-presenters.

Weekly web archiving roundup: November 14, 2015

Weekly web archiving roundup for November 14, 2015:

  • Plan Your Digital Legacy, and Update Often: “As more wealth moves into the cloud, good luck tracking this stuff.”
  • How the Internet Archive maintains an information super highway: An interview with the director of media and access at Internet Archive, Alexis Rossi, and Vicky Brasseur, a volunteer maintaining the Internet Archive S3 API documentation.
  • New DAS Web Archiving Webinar Begins Streaming November 16: This course introduces the core concepts of web archiving and provides a firm foundation for further expansion of skill sets.
  • WAS to Archive-It Migration Update: The Web Archiving Service (WAS) migration to Internet Archive’s Archive-It Service reached two major milestones last week.
  • Thoughts from Partner staff about web archiving: An overview of a recent OCLC Research Library Partnership survey investigating the needs of the web archiving community.
  • Post Firewall–“Mining the Internet Graveyard: Rethinking the Historians’ Toolkit”: Newly open access article argues that the advent of a massive quantity of born-digital historical sources necessitates a rethinking of the historians’ toolkit.