What is Papeer?
Papeer is a specialized web scraping tool designed for the e-ink era, built by developer lapwat and first released in September 2021. Written in Go, this command-line utility addresses a specific need: converting web content into formats optimized for e-readers like Kindle, reMarkable tablets, and other digital reading devices.
The tool's primary strength lies in its ability to extract clean, readable content from websites while preserving essential formatting like bold text, italics, links, and images. Unlike general-purpose scrapers, Papeer focuses on creating distraction-free reading experiences by automatically removing ads, navigation menus, and other web clutter using the go-readability library.
With 346 stars and 25 forks on GitHub as of March 2026, Papeer has carved out a niche among developers and digital reading enthusiasts who want to convert web articles, documentation, and entire websites into portable ebook formats.
Getting Started
Installation is straightforward with multiple options available. For Go developers, the simplest method is using go install:
go install github.com/lapwat/papeer@latestAlternatively, you can download pre-compiled binaries from the GitHub releases page for your specific platform. The tool requires no additional dependencies for basic functionality.
For users who need MOBI format support (though Kindle now supports EPUB), you'll need to install kindlegen on Linux:
TMPDIR=$(mktemp -d -t papeer-XXXXX)
curl -L https://github.com/lapwat/papeer/releases/download/kindlegen/kindlegen_linux_2.6_i386_v2_9.tar.gz > $TMPDIR/kindlegen.tar.gz
tar xzvf $TMPDIR/kindlegen.tar.gz -C $TMPDIR
chmod +x $TMPDIR/kindlegen
sudo mv $TMPDIR/kindlegen /usr/local/bin
rm -rf $TMPDIROnce installed, verify the installation by running:
papeer --helpUsage & Practical Examples
Papeer's command-line interface is built around two main commands: list for previewing content structure and get for actual scraping.
Single Page Scraping
The simplest use case involves scraping a single web page:
papeer get https://example.com/articleThis creates a Markdown file with the cleaned content. To specify different output formats:
papeer get https://example.com/article --format=epub
papeer get https://example.com/article --format=html --output="my-article"Website Documentation Scraping
One of Papeer's most powerful features is scraping entire documentation sites. First, use the list command to preview the table of contents:
papeer list https://12factor.net/ --selector='section.concrete>article>h2>a'This displays a numbered list of all pages that would be scraped. Once satisfied with the structure, run the actual scraping:
papeer get https://12factor.net/ --selector='section.concrete>article>h2>a' --format=epubThe tool will create a complete ebook with all the documentation pages as chapters, complete with a table of contents.
Advanced Options
Papeer provides fine-grained control over the scraping process:
# Limit to first 10 chapters
papeer get https://docs.example.com --selector='nav a' --limit=10
# Skip first 5 chapters, reverse order
papeer get https://blog.example.com --selector='.post-link' --offset=5 --reverse
# Add delays between requests (respectful scraping)
papeer get https://example.com --selector='a.chapter' --delay=1000 --threads=2


