Skip to content
Change the repository type filter

All

    Repositories list

    • JavaScript
      GNU Affero General Public License v3.0
      8721Updated Dec 16, 2024Dec 16, 2024
    • One webpage for every book ever published!
      Python
      GNU Affero General Public License v3.0
      1.4k5.3k818155Updated Dec 15, 2024Dec 15, 2024
    • React components to render differences between captures at the Wayback Machine
      JavaScript
      GNU General Public License v3.0
      83211Updated Dec 14, 2024Dec 14, 2024
    • brozzler

      Public
      brozzler - distributed browser-based web crawler
      Python
      Apache License 2.0
      996763513Updated Dec 13, 2024Dec 13, 2024
    • warcprox

      Public
      WARC writing MITM HTTP/S proxy
      Python
      55384206Updated Dec 13, 2024Dec 13, 2024
    • iiif

      Public
      The official Internet Archive IIIF service
      JavaScript
      GNU General Public License v3.0
      422173Updated Dec 13, 2024Dec 13, 2024
    • TypeScript
      GNU Affero General Public License v3.0
      00112Updated Dec 13, 2024Dec 13, 2024
    • Zeno

      Public
      State-of-the-art web crawler 🔱
      HTML
      GNU Affero General Public License v3.0
      1284265Updated Dec 13, 2024Dec 13, 2024
    • gocrawlhq

      Public
      Go client for Crawl HQ v3
      Go
      0001Updated Dec 13, 2024Dec 13, 2024
    • The Internet Archive Donation Form
      TypeScript
      04030Updated Dec 12, 2024Dec 12, 2024
    • gifcities

      Public
      gifcities.org web app
      Go
      GNU Affero General Public License v3.0
      0310Updated Dec 11, 2024Dec 11, 2024
    • TypeScript
      GNU Affero General Public License v3.0
      15213Updated Dec 11, 2024Dec 11, 2024
    • hind

      Public
      Hashistack-IN-Docker (single container with nomad + consul + caddy)
      Shell
      GNU Affero General Public License v3.0
      75600Updated Dec 11, 2024Dec 11, 2024
    • rclone

      Public
      [vault fork] of "rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Yandex Files
      Go
      MIT License
      4.3k200Updated Dec 10, 2024Dec 10, 2024
    • dyno

      Public
      JavaScript
      1400Updated Dec 10, 2024Dec 10, 2024
    • A repository of cleanup bots implementing the openlibrary-client
      Python
      Other
      4963278Updated Dec 9, 2024Dec 9, 2024
    • WARC files uploader for the Internet Archive
      Go
      GNU Affero General Public License v3.0
      0100Updated Dec 9, 2024Dec 9, 2024
    • PHP
      GNU Affero General Public License v3.0
      3312902Updated Dec 9, 2024Dec 9, 2024
    • Python
      152523Updated Dec 6, 2024Dec 6, 2024
    • Voice Apps (Actions on Google, Alexa Skill) of Internet Archive. Just say: "Ok Google, Ask Internet Archive to Play Jazz" or "Alexa, Ask Internet Internet Archive to play Instrumental Music"
      JavaScript
      42469516Updated Dec 6, 2024Dec 6, 2024
    • iaux

      Public
      Monorepo for Archive.org UX development and prototyping.
      JavaScript
      GNU Affero General Public License v3.0
      866788143Updated Dec 5, 2024Dec 5, 2024
    • TypeScript
      GNU Affero General Public License v3.0
      0000Updated Dec 5, 2024Dec 5, 2024
    • The Internet Archive BookReader
      JavaScript
      GNU Affero General Public License v3.0
      4201k13487Updated Dec 4, 2024Dec 4, 2024
    • IAUX Typescript WebComponent Template
      TypeScript
      GNU Affero General Public License v3.0
      47311Updated Dec 3, 2024Dec 3, 2024
    • A Modal Manager WebComponent
      TypeScript
      GNU Affero General Public License v3.0
      11112Updated Dec 3, 2024Dec 3, 2024
    • heritrix3

      Public
      Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
      Java
      Other
      7622.9k355Updated Nov 30, 2024Nov 30, 2024
    • Sparkling

      Public
      Internet Archive's Sparkling Data Processing Library
      Scala
      MIT License
      21110Updated Nov 27, 2024Nov 27, 2024
    • TypeScript
      2200Updated Nov 27, 2024Nov 27, 2024
    • components for IA Wayback Machine to render legacy medias and data in human friendly fashion
      Python
      0000Updated Nov 25, 2024Nov 25, 2024
    • www

      Public
      archive.org website prototype - using only javascript static files
      JavaScript
      GNU Affero General Public License v3.0
      0200Updated Nov 24, 2024Nov 24, 2024