txtdot
Lightweight HTTP proxy that extracts article text, images and links, removes ads and heavy scripts, proxies media and compresses images to save bandwidth and improve load times.

txtdot is an HTTP proxy that simplifies web pages by extracting readable article text, links and images while stripping ads and heavy client-side scripts. It is designed to reduce bandwidth usage and deliver a clean, readable view of pages server-side.
Key Features
- Server-side page simplification using a readability-based parser to extract main article content, links and images
- Media proxy that serves and rewrites images and media assets through the server
- Image compression and processing via a server-side image pipeline to reduce payload size
- No client-side JavaScript required for rendered pages, producing fast, low-resource frontends
- API endpoints and plugin system for customization and integration (SDK and plugins architecture)
- Optional integration with search backends and search frontend support
- Lightweight TypeScript/Node.js implementation intended for Docker-based deployment
Use Cases
- Provide a reading-mode proxy for low-bandwidth or mobile users to improve page load and reduce data usage
- Serve privacy-friendly, script-free article views for kiosks, public terminals, or content archiving
- Integrate simplified page content into downstream services or indexes for search and analytics
Limitations and Considerations
- The original v1 development is marked as discontinued; newer development continues on a separate branch, so feature stability and maintenance may vary
- Pages that rely heavily on client-side interactivity or dynamic JavaScript rendering may lose functionality when simplified
txtdot is useful when you need a server-side reading-mode proxy that prioritizes bandwidth savings, privacy, and simplified content delivery. It is extensible via plugins and suited for environments where client resources or network capacity are constrained.
