I, Librarian

I, Librarian

Web-based PDF and reference manager for collaborative research

325stars
31forks
Last commit: 2mo ago
Repo age: 6y old
I, Librarian screenshot

I, Librarian is a web-based application for organizing, annotating and sharing collections of PDF papers and office documents. It targets individual researchers and small-to-medium research groups, providing centralized storage, in-browser PDF annotation and advanced full-text search including OCR support.

Key Features

  • Centralized library management with multi-user access and project-based collaboration.
  • In-browser PDF viewer with multicolor highlighting, pinned/shared notes and exportable annotations.
  • Powerful full-text search across metadata, PDF text and annotations with multilingual OCR for scanned documents.
  • Import and metadata harvesting from scientific sources (arXiv, PubMed, NASA, IEEE, Crossref, etc.) and citation export (BibTeX/EndNote/etc.).
  • Multiple deployment options: hosted service, Docker deployment or manual install; optional integrations such as SSO (OpenID/SAML/LDAP).

Use Cases

  • Research labs or departments that need a shared, searchable repository of papers and collaborative annotations.
  • Individual academics or students who want a personal reference manager with in-browser annotation and full-text search.
  • Institutions that need controlled access to a centrally hosted PDF library with audit and group features.

Limitations and Considerations

  • Self-hosted installations require a PHP-capable web server and a database backend; official instructions reference Apache + PHP 8+, and optional external tools (LibreOffice, Tesseract OCR) for Office import and OCR functionality. Installation and OCR depend on those external components being present and configured.

I, Librarian is available as a hosted SaaS or as a GPL-3.0 free edition for self-hosting; the project repository and deployment artifacts (Dockerfile, Caddyfile) are publicly maintained. It is focused on research-oriented PDF management and team collaboration.

Categories:

Tags:

Tech Stack:

Share:

Similar Services

Stirling PDF

Stirling PDF

Self-hosted PDF editing, conversion, OCR, and automation platform

74.6k
6.3k
Last commit: 7h ago

Open-source PDF platform to edit, convert, OCR, sign, redact, and automate PDF workflows via a web UI and REST API.

Alternative to:
Adobe Acrobat
Adobe Acrobat
+19
Paperless-ngx

Paperless-ngx

Document management system with OCR, search, and automated filing

36.9k
2.3k
Last commit: 1d ago

Paperless-ngx is an open-source document management system that ingests scans and files, runs OCR, and turns them into a searchable, taggable document archive.

Alternative to:
DocuWare
DocuWare
+6
Reactive Resume

Reactive Resume

Privacy-focused, open-source resume builder

35.4k
3.9k
Last commit: 1d ago

Open-source resume builder for creating, customizing, exporting and publishing resumes with templates, PDF export, public sharing and optional OpenAI assistance.

Alternative to:
Resume.io
Resume.io
+5
CyberChef

CyberChef

Browser-based toolkit for data decoding, encoding and analysis

34.1k
3.9k
Last commit: 1d ago

CyberChef is a web-based “cyber” toolkit for encoding/decoding, encryption/decryption, compression, hashing, parsing, and data transformation using drag-and-drop recipes.

ArchiveBox

ArchiveBox

Open-source self-hosted web archiving and snapshotting tool

26.9k
1.5k
Last commit: 1d ago

Self-hosted tool to collect and preserve webpages, media, and bookmarks in durable formats (HTML, PDF, WARC, MP4) with a CLI, web UI, and search.

Alternative to:
Internet Archive Wayback Machine
Internet Archive Wayback Machine
+3
ebook2audiobook

ebook2audiobook

Convert eBooks into audiobooks with TTS and optional voice cloning

18.3k
1.5k
Last commit: 5d ago

Self-hostable tool to convert non-DRM eBooks into audiobooks with chapter support, metadata, multilingual TTS engines, and optional voice cloning via a web UI or CLI.

Alternative to:
Speechify
Speechify
+7