Maxun
Open source no-code web scraping and data extraction robots
Maxun is an open source platform for building no-code “robots” that navigate websites like a real user and turn web content into structured data, clean markdown, or API outputs. It’s designed for quick web automation and repeatable extraction workflows, with options for both recorder-based and LLM-assisted extraction.
Key Features
- No-code recorder mode to capture browsing actions and reuse them as extraction robots
- LLM-powered extraction mode for describing desired fields in natural language
- Multiple robot types: extract structured data, scrape pages to markdown/HTML, crawl sites, and run automated web searches
- Generate REST-style endpoints from extraction robots to turn websites into structured APIs
- Scheduling for recurring runs and ongoing data collection
- Support for common dynamic patterns like pagination and infinite scroll
- Resilience features aimed at recovering from website layout changes
- SDK for programmatic control of robots and automation workflows
Use Cases
- Competitive and market research by tracking prices, listings, and product changes
- Lead generation and enrichment by extracting contact details and company data
- Feeding AI workflows with clean markdown content for RAG and document processing
Limitations and Considerations
- Web automation reliability can vary based on target site defenses (bot detection, CAPTCHAs) and frequent UI changes
- LLM-based extraction quality depends on the selected model and prompt context, and may require validation
Maxun fits teams that need repeatable web data collection without building custom scrapers from scratch, while still offering an SDK for deeper integration. It can scale from quick one-off extractions to scheduled pipelines that power internal systems and AI applications.
Categories:
Tags:
Tech Stack:
Similar Services

n8n
Workflow automation platform with visual builder and code support
Self-hostable workflow automation platform combining a visual builder with JavaScript/Python code steps, 400+ integrations, and AI-assisted automation.

Ansible
Agentless IT automation and configuration management engine
Open source, agentless automation engine for configuration management, app deployment, orchestration, and infrastructure provisioning using YAML playbooks over SSH.

NocoDB
No-code spreadsheet interface for SQL databases with APIs
Open-source Airtable alternative that turns Postgres/MySQL/SQLite into a no-code spreadsheet UI with views, permissions, integrations, and REST APIs.

Huginn
Open-source platform for self-hosted automation agents
Huginn is an open-source automation platform that runs agents to monitor web data, process events, and trigger actions — self-hosted and extensible.


Apache Airflow
Platform to author, schedule, and monitor workflows as code
Apache Airflow is a workflow orchestration platform to define, schedule, and monitor data pipelines and other batch jobs using Python-defined DAGs.

Appsmith
Open-source low-code platform for internal tools and dashboards
Build and deploy internal tools, admin panels, and dashboards with a low-code UI builder that connects to databases and APIs and supports JavaScript logic and Git workflo...
JavaScript
Docker
TypeScript
Node.js