Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
Scraperr: A self-hosted web scraping solution with XPath-based extraction, queue management, domain spidering, and data export.
A powerful self-hosted web scraping solution that allows you to scrape websites without writing a single line of code.
📚 Check out the docs for a comprehensive quickstart guide and detailed information.
✨ Key Features:
🚀 Getting Started:
make up
Refer to the docs for helm deployment: https://scraperr-docs.pages.dev/guides/helm-deployment
⚖️ Legal and Ethical Guidelines:
When using Scraperr, please remember to:
robots.txt
: Always check a website's robots.txt
file to verify which pages permit scraping.Disclaimer: Scraperr is intended for use only on websites that explicitly permit scraping. The creator accepts no responsibility for misuse of this tool.
💬 Join the Community:
Get support, report bugs, and chat with other users and contributors.
📄 License:
This project is licensed under the MIT License. See the LICENSE file for details.
👏 Contributions:
Development made easier with the webapp template.
To get started, simply run make build up-dev
.