Maxun

Maxun

Open source no-code web scraping and data extraction robots

14.2kstars
1.1kforks
Last commit: 5d ago
Repo age: 3y old

Maxun is an open source platform for building no-code “robots” that navigate websites like a real user and turn web content into structured data, clean markdown, or API outputs. It’s designed for quick web automation and repeatable extraction workflows, with options for both recorder-based and LLM-assisted extraction.

Key Features

  • No-code recorder mode to capture browsing actions and reuse them as extraction robots
  • LLM-powered extraction mode for describing desired fields in natural language
  • Multiple robot types: extract structured data, scrape pages to markdown/HTML, crawl sites, and run automated web searches
  • Generate REST-style endpoints from extraction robots to turn websites into structured APIs
  • Scheduling for recurring runs and ongoing data collection
  • Support for common dynamic patterns like pagination and infinite scroll
  • Resilience features aimed at recovering from website layout changes
  • SDK for programmatic control of robots and automation workflows

Use Cases

  • Competitive and market research by tracking prices, listings, and product changes
  • Lead generation and enrichment by extracting contact details and company data
  • Feeding AI workflows with clean markdown content for RAG and document processing

Limitations and Considerations

  • Web automation reliability can vary based on target site defenses (bot detection, CAPTCHAs) and frequent UI changes
  • LLM-based extraction quality depends on the selected model and prompt context, and may require validation

Maxun fits teams that need repeatable web data collection without building custom scrapers from scratch, while still offering an SDK for deeper integration. It can scale from quick one-off extractions to scheduled pipelines that power internal systems and AI applications.

Categories:

Tags:

Tech Stack:

Share:

Similar Services

n8n

n8n

Workflow automation platform with visual builder and code support

169.5k
53.7k
Last commit: 23h ago

Self-hostable workflow automation platform combining a visual builder with JavaScript/Python code steps, 400+ integrations, and AI-assisted automation.

Alternative to:
Zapier
Zapier
+17
Ansible

Ansible

Agentless IT automation and configuration management engine

67.7k
24.2k
Last commit: 22h ago

Open source, agentless automation engine for configuration management, app deployment, orchestration, and infrastructure provisioning using YAML playbooks over SSH.

Alternative to:
Red Hat Ansible Automation Platform
Red Hat Ansible Automation Platform
+4
NocoDB

NocoDB

No-code spreadsheet interface for SQL databases with APIs

61.5k
4.6k
Last commit: 1d ago

Open-source Airtable alternative that turns Postgres/MySQL/SQLite into a no-code spreadsheet UI with views, permissions, integrations, and REST APIs.

Alternative to:
Airtable
Airtable
+10
Huginn

Huginn

Open-source platform for self-hosted automation agents

48.5k
4.2k
Last commit: 24d ago

Huginn is an open-source automation platform that runs agents to monitor web data, process events, and trigger actions — self-hosted and extensible.

Alternative to:
IFTTT
IFTTT
+17
Apache Airflow

Apache Airflow

Platform to author, schedule, and monitor workflows as code

43.9k
16.3k
Last commit: 19h ago

Apache Airflow is a workflow orchestration platform to define, schedule, and monitor data pipelines and other batch jobs using Python-defined DAGs.

Alternative to:
Astronomer
Astronomer
+5
Appsmith

Appsmith

Open-source low-code platform for internal tools and dashboards

38.9k
4.4k
Last commit: 2d ago

Build and deploy internal tools, admin panels, and dashboards with a low-code UI builder that connects to databases and APIs and supports JavaScript logic and Git workflo...

Alternative to:
Retool
Retool
+14