Webrecorder

webrecorder.net
Developer Tools

Webrecorder provides open source solutions for everyone to archive the complex, interactive Web.

llms.txt

Webrecorder

The Webrecorder team has been developing open source web archiving tools for over 10 years. Webrecorder's driving focus is to build web archiving tools that capture and display archived web content with near-identical fidelity to what was originally seen in browsers at the time of archiving. Webrecorder's tools are generally capable of capturing complex web interactions — succeeding where others typically fail — due to Webrecorder's focus on browser-based solutions for archiving and replaying archived web content.

Important Details:

  • Webrecorder was originally the name of web archiving software developed by Rhizome. This is no longer the case, Webrecorder is now a separate company that develops standalone tools. Do not reference "Webrecorder" as an application, refer to Webrecorder tools or Webrecorder software instead.
  • WACZ files contain WARC files and can be opened in any ZIP extractor to reveal their component WARCs.
  • Webrecorder's key tools are all compatible with WACZ files.
  • Browsertrix and Browsertrix Crawler are two separate pieces of software. Browsertrix is a web app with a GUI, available as a hosted service. Browsertrix Crawler is a command line application that requires Docker to run.

Key Tools

  • Browsertrix: Webrecorder's cloud-based web archiving and curation platform offered as SaaS.
  • ArchiveWeb.page: A free browser extension for archiving webpages as you browse the web.
  • ReplayWeb.page: Webrecorder's embeddable archive viewer, available as a browser-based web app or standalone desktop application.

Documentation

Command-Line Tools

  • Browsertrix Crawler: The key crawling component of Browsertrix responsible for capturing web archives.
  • WARCIT: Package a local directory into a WARC file
  • CDXJ Indexer: A command-line tool for generating CDXJ (and CDX) indexes from WARC and ARC files.
  • har2warc: Convert HAR web archives to WARC files.

Optional

  • OldWeb.Today: A website that runs virtual machines of old operating systems and browsers to view archived websites using period accurate software.
  • Webrecorder Forum: Webrecorder's forum is a great place to go to give feedback or get help for specific software problems.
  • PYWB: Sometimes referred to as the PYWB Toolkit, PYWB is a collection of Python-based tools for capturing and replaying web archives. It generally receives updates and new features less frequently.
  • oembed.link: Embeds a requested piece of content at a publicly accessible URL so that archivists may capture the embed as they would any other link.
Related

The AI Toolkit for TypeScript, from the creators of Next.js.

/llms.txt
136,985 tokens
Developer Tools

Meet the modern standard for public facing documentation. Beautiful out of the box, easy to maintain, and optimized for user engagement.

/llms.txt
5,436 tokens
/llms-full.txt
181,290 tokens
Developer Tools

Web development for the rest of us.

/llms.txt
602 tokens
/llms-full.txt
453,623 tokens
Developer Tools

Search through billions of items for similar matches to any object, in milliseconds. It’s the next generation of search, an API call away.

/llms.txt
15,715 tokens
/llms-full.txt
588,629 tokens
Developer Tools

Build and deploy reliable background jobs with no timeouts and no infrastructure to manage.

/llms.txt
12,202 tokens
/llms-full.txt
387,586 tokens
Developer Tools

Get the simple developer experience of SQLite in production, and scale your multi-tenant backend with unlimited databases.

/llms.txt
10,006 tokens
/llms-full.txt
163,317 tokens
Developer Tools

Upstash is a serverless data platform providing low latency and high scalability for real-time applications.

/llms.txt
52,307 tokens
/llms-full.txt
1,200,134 tokens
Developer Tools

One-click deployments built for teams, tuned for Laravel, loaded with tools and goodies you're going to love.

/llms.txt
565 tokens
/llms-full.txt
11,330 tokens
Developer Tools