Scraperr
Self-hosted no-code web scraping platform

Scraperr is a self-hosted web scraping solution that lets you scrape websites from a web interface without writing code. It focuses on repeatable scraping jobs with structured results, exports, and optional crawling within a domain.
Key Features
- No-code web UI for creating and managing scraping jobs
- XPath-based extraction for precise element targeting
- Queue management to submit and run multiple scraping jobs
- Optional domain spidering to crawl and scrape pages within a site
- Custom request headers provided as JSON
- Media downloads for images, videos, and other assets
- Results visualization in a structured table view
- Export scraped data to CSV and Markdown
- Completion notifications via supported channels
Use Cases
- Collect product, directory, or listing data for internal analysis
- Crawl and extract structured content from documentation or knowledge sites
- Download and catalog media assets from permitted web sources
Limitations and Considerations
- Uses browser automation; large crawls can be resource-intensive and may require careful rate limiting
- Scraping capability and reliability depend on target site complexity and anti-bot measures
Scraperr fits teams and individuals who want a practical, UI-driven scraper they can run on their own infrastructure. It is well-suited for scheduled or repeated data collection workflows where exports and job management matter.
Categories:
Tags:
Tech Stack:
Similar Services

n8n
Workflow automation platform with visual builder and code support
Self-hostable workflow automation platform combining a visual builder with JavaScript/Python code steps, 400+ integrations, and AI-assisted automation.

Ansible
Agentless IT automation and configuration management engine
Open source, agentless automation engine for configuration management, app deployment, orchestration, and infrastructure provisioning using YAML playbooks over SSH.

NocoDB
No-code spreadsheet interface for SQL databases with APIs
Open-source Airtable alternative that turns Postgres/MySQL/SQLite into a no-code spreadsheet UI with views, permissions, integrations, and REST APIs.

Huginn
Open-source platform for self-hosted automation agents
Huginn is an open-source automation platform that runs agents to monitor web data, process events, and trigger actions — self-hosted and extensible.


Apache Airflow
Platform to author, schedule, and monitor workflows as code
Apache Airflow is a workflow orchestration platform to define, schedule, and monitor data pipelines and other batch jobs using Python-defined DAGs.

Appsmith
Open-source low-code platform for internal tools and dashboards
Build and deploy internal tools, admin panels, and dashboards with a low-code UI builder that connects to databases and APIs and supports JavaScript logic and Git workflo...
Kubernetes
Docker
TypeScript
Python