Scraperr: A self-hosted web scraping solution with XPath-based extraction, queue management, domain spidering, and data export.
Jina AI offers best-in-class embeddings, rerankers, web crawler scraper, deepsearch, and small LMs for multilingual and multimodal data.