This is an automated archive made by the Lemmit Bot.

The original was posted on /r/selfhosted by /u/biolds on 2025-01-31 08:30:07+00:00.


Hey everyone! We’re excited to announce the release of SOSSE v1.12.0, the latest version of our open-source web archiving software, crawler, and search engine.

For those unfamiliar, SOSSE (Selenium Open Source Search Engine) lets you:

  • 🔍 Search web page content, including JavaScript-rendered pages
  • 🕵️ Crawl sites at regular intervals & detect content updates
  • 📥 Download files in bulk from web pages
  • 📑 Archive pages with local assets for offline access
  • 🔔 Monitor websites and generate Atom feeds for new content
  • 🔒 Authenticate to access private content

📖 Full docs:

🐙 GitHub:

🦊 GitLab:

💬 Join us on Discord:

📢 We Need Your Input!

We’re running a short survey to help prioritize new features and gauge interest in professional support. If you’ve used SOSSE or are interested, please take a moment to fill it out:

➡️ https://framaforms.org/202502-sosse-survey-1738309561

Your feedback is invaluable! Let us know what you think about v1.12.0! 🚀