Library and Archives Canada Coffee Chat: Video and Slides

On Tuesday, May 10, 2022 the SAA Web Archiving Section was honored to host Library and Archives Canada’s Tom Smyth discussing web and social media preservation.

A video of the coffee chat and presenter slides will be available for a limited time.

Video

Library and Archives Canada Coffee Chat

Presenter Slides

Web and Social Media Preservation Program at the National Library and Archives Canada

Introducing the DocNow App

This week’s post was written by Zakiya Collier, Community Manager at Documenting the Now.

This week the Documenting the Now project announces the release of DocNow, an application for appraising, collecting, and gathering consent for Twitter content. DocNow reimagines the relationship between content creators and social media analysts (archivists and researchers) by addressing two of the most challenging issues of social media archiving practice—the issues of consent and appraisal.

The Documenting the Now Project is happy to release version 1.0 of our open-source tool freely for anyone to use. Read all about the app, what it does, and how to engage with the DocNow project team for support and providing feedback.

Over the last seven years, Documenting the Now has helped to foster an environment where a more broad sector of cultural memory workers can learn about web archiving tools and practices and can become involved with web archiving networks. This has largely been achieved by practicing transparency and inviting people who have traditionally been left out of established web content archiving networks into the project to participate, namely students, activists, and archivists who represent marginalized communities and who work in community-centered organizations, HBCUs, public libraries, community-based archives, and tribal libraries and archives.

Documenting the Now was a response to the need among scholars, activists, archivists, and other memory workers for new tools that would provide easily-accessible and user-friendly means to collect, visualize, analyze, and preserve web and social media content to better document public events. In addition, it aimed to respond to questions and concerns related to ethics, safety, intellectual property, and access issues for the collection, preservation, and dissemination of Twitter data in particular.

Documenting the Now has also developed community-centered web and social media archiving tools that both prioritize care for content creators and robust functionality for users:

  • Twarc – a command line tool and Python library for collecting tweet data from Twitter’s official API
  • Hydrator – a desktop application for turning Tweet ID datasets back into tweet data to use in your research
  • Social Humans – a label system to specify the terms of consent for social media content
  • The Catalog – a community-sourced clearinghouse to access and share tweet identifier datasets

In continuing to support and develop tools that embody ethical practices for social media archiving, the DocNow app joins this suite of tools. DocNow is an application for appraising, collecting, and gathering consent for Twitter content and includes several new features including:

  • Trends tab to view trending topics across the globe in real time
  • Explore tab to view content by users, media, URLs, and related hashtags all on one screen
  • Live testing and refining of collecting parameters on recent tweets
  • Tweets per hour calculator to easily identify Twitter bot accounts
  • Search and Collect tweets back in time via Search API and forwards with Stream API
  • Activate toggle to start collecting tweets and send a notification tweet to encourage transparency and communication in Twitter data collection
  • Collections tab to share information about your collection with the public
  • “Find Me” and Insights Overview features to specify and gather consent using Social Humans labels
  • Download Tweet ID archive for sharing following Twitter’s terms of service

The DocNow app also works in concert with other Documenting the Now tools, creating for users, a 4-step social media archiving journey:

Step 1: Collect content with the DocNow App by activating a search. Set collection limits and explore insights as your collection grows.
Step 2: Download your archive from the DocNow App, which includes a Tweet Viewer, Tweet IDs, and media files.
Step 3: Hydrate your Tweet IDs from the archive’s tweets.csv file back into full
tweets using DocNow’s Hydrator desktop application.
Step 4: Describe your collection and share your Tweet IDs with other researchers by adding them to the DocNow Catalog.

Ways to Use DocNow
There are 3 different ways to use DocNow including joining the community instance, running DocNow locally on a computer, and installing an instance of DocNow in the cloud. The Community Instance is a great way to get familiar with the tool before committing to running an instance but those with development skills may want to administer their own instance of DocNow locally or in the cloud.

  1. Join Documenting the Now‘s hosted community instance
  2. Run DocNow locally on your machine
  3. Install your own instance in the cloud

For help with installation and getting started, the Documenting the Now team will host community conversations. Dates will be announced soon! More information about the DocNow App can be found here.

Documenting the Now is seeking community input on all of our features as we continue to develop DocNow. Please join our slack channel by going to our website or email us at info@docnow.io.

US Government Web Archive Coffee Chat: Video and Slides

On Tuesday, March 29, 2022 the SAA Web Archiving Section was honored to host Lauren Baker and Meghan Lyon, Library of Congress; Dory Bower, Government Publishing Office; Lynda Schmitz Fuhrig, Smithsonian Institution Archives; Christie Moffatt, National Library of Medicine at the National Institute of Health. Section chair Melissa Wertheimer moderated the panel.

A video of the coffee chat and presenter slides will be available for a limited time.

Video

US Government Web Archive Coffee Chat

Presenter Slides

Web Collecting at the National Library of Medicine, presented by Christie Moffatt

Web and Social Media Archiving, presented by Lynda Schmitz Fuhrig

Library of Congress Web Archiving, presented by Meghan Lyon & Lauren Baker

Web Archiving Roundup: June, 2019

This month the Sunlight Foundation, an organization advocating for open government, published an article on the weaknesses in Federal Agencies’ archival practices.

Catch the livestream recording from the IIP Web Archiving Conference.

You can now take a look at the slides from Ian Milligan & Nick Ruest’s presentation on the Archives Unleashed Cloud Project at IIPC Web Archiving Conference.

Both paper and presentation slides on Judging Visual Correspondence in Web Archives are now available. This presentation was done by Brenda Reyes Ayala at the ACM/IEE Joint Conference on Digital Libraries (JCDL).

The Web Science and Digital Libraries Research Group at Old Dominion University has posted a trip report on the ACM/IEEE Joint Conference on Digital Libraries (JCDL).

Rhizome recently announced an opportunity for Webrecorder users and community members to financially support Webrecorder, some of the benefits include full range of tools and expanded storage for web collections.

Check out this i-D magazine article on the case for archiving our digital lives.

Web Archiving Roundup: May, 2019

UPDATE – Join the ALCTS Metadata Interest Group Meeting during ALA Annual 2019 for a presentation and Q&A on the Library of Congress Web Archiving Program on Sunday, June 23, 2019, 9:00-10:00AM at the Marriott Marquis.

Now accepting nominations for the SAA Web Archiving Section’s 2019-2020 Steering Committee: https://www2.archivists.org/groups/web-archiving-section/now-accepting-2019-2020-steering-committee-nominations.

Registration for the IIPC Web Archiving Conference ends May 24. The conference will be hosted by the National and University Library of Croatia in Zagreb, which coincides with the 15h anniversary of the Croatian Web Archive (HAW).

For members of the Digital Preservation Coalition, the DPC Web Archiving & Preservation Task Force is inviting delegates to a meeting on July 18, in London. The meeting is free for DPC members, registration ends July 10.

IIPC Content Development Group is asking for contributions to their Climate Change Collection, and their Artificial Intelligence Collection.

Ben Els, Digital Curator at the National Library of Luxembourg, gives us a glimpse not the effort to capture the Luxembourg elections.

Seth Denbo, Director of Scholarly Communication and Digital Initiatives at the American Historical Association, strikes a cord on the challenges of scale in an article titled Data Overload.

The Atlantic has an article on the implication of AI vacuum cleaners from tech companies.

You can now read the paper presented at the 2018 World Library and Information Congress by the Library of Congress, the paper is titled Institutions as Social Media Collector: Lessons Learned from the Library of Congress.

The National Library of the Netherlands has recently launched a collection of archived websites from the Chinese Community in the Netherlands.

ECAL (École cantonale d’art de Lausanne) has launched a website called Information Mesh celebrating the 30th anniversary of the World Wide Web.

Web Archiving Roundup: February, 2019

You can still register for AASLH’s webinar Web Archiving: What, Why, and How, the webinar will take place on February 28 @3:00pm EST.

Archive-It will host an advanced training session on February 26 at 11:00 AM Pacific Time (US & Canada), the session will focus on Archive-It as a Reference Tool.

The National Videogame Foundation in collaboration with Bath Spa University and funded by the British Academy and Leverhulme Trust, released a White Paper titled Game Over? Curating, Preserving and Exhibiting Videogames.

Richard Ovenden, Bodley’s Librarian, has an article in The Economist about digital preservation.

Ilya Kreymer, Webrecorder Lead Developer, shares his Code4Lib 2019 presentation slides.

A new release of web crawler project Heritrix 3 is now available.

Middlebury Facebook group Middlebury Memes for Crunchy Teens to be archived by Special Collections.

PhD candidate Rhiannon Lewis writes a response to the DPC’s Briefing Day on web archiving for community and individual archives.

New version of Web Archiving Integration Layer (WAIL) for macOS is now available.

Shawn M. Jones writes a blog post for the Web Science and Digital Libraries Research Group at Old Dominion University regarding Google+ shutting down.

Stanford Libraries receives a $25 million grant to preserve Silicon Valley Archives

PANDORA, Australia’s Web Archive, initially established by the National Library of Australia celebrates its 10 year anniversary.

Web Archiving Roundup: October 23, 2018

Yes, we’re back! (you’re welcome)

It’s been a while since our last roundup so we thought it would be a good idea to do a quick recap. The SAA Web Archiving Section recently held elections for the Steering Committee for vice chair, secretary, education coordinator, communications manager, and student member. The newly appointed members are:

Emily Ward, Digital Archivist, East Baton Rouge Parish Library
Vice Chair/Chair-Elect

Julia Corrin, University Archivist, Carnegie Mellon University
Education Coordinator

Kelsey O’Connell, Digital Archivist, Northwestern University
Secretary

Elisa Landaverde, Metadata Librarian, Michigan State University
Communications Manager

Louise LoBello, Archives Student, Rutgers University
Student Member

We are currently working on updating our microsite so stay tuned for our new look.

And now for our October roundup, here are some web archiving highlights:

Weekly web archiving roundup: September 10, 2015

Weekly web archiving roundup for the week of September 10, 2015:

Letter to Facebook

On Wednesday, August 5 SAA President Kathleen Roe sent a letter to Mark Zuckerberg and the Facebook Board of Directors urging them to “add functionality that allows a page owner to download the contents of his/her page(s). This would allow archival institutions to accept donations of this material…” This letter was drafted and edited by the Web Archiving Roundtable’s membership and steering committee. Please read the letter attached below.

Letter to Facebook_SAA