by 19-84
A PostgreSQL-backed archive generator that creates browsable HTML archives from link aggregator platforms including Reddit, Voat, and Ruqqus.
# Add to your Claude Code skills
git clone https://github.com/19-84/redd-archiver⭐ If you find this project useful, please star the repo! It helps others discover the tool and motivates continued development.
Transform compressed data dumps into browsable HTML archives with flexible deployment options. Redd-Archiver supports offline browsing via sorted index pages OR full-text search with Docker deployment. Features mobile-first design, multi-platform support, and enterprise-grade performance with PostgreSQL full-text indexing.
Supported Platforms: | Platform | Format | Status | Available Posts | Data | |----------|--------|--------|----------------|------| | Reddit | .zst JSON Lines (Pushshift) | ✅ Full support | 2.38B posts (40,029 subreddits, through Dec 31 2025) | Download | | Voat | SQL dumps | ✅ Full support | 3.81M posts, 24.1M comments (22,637 subverses, complete archive) | | | | .7z JSON Lines | ✅ Full support | 500K posts (6,217 guilds, complete archive) | |
No comments yet. Be the first to share your thoughts!
Tracked content: 2.384 billion posts across 68,883 communities (Reddit full Pushshift dataset through Dec 31 2025, Voat/Ruqqus complete archives)
Version 1.0 features multi-platform archiving, REST API with 30+ endpoints, MCP server for AI integration, and PostgreSQL-backed architecture for large-scale processing.
Archive internet history before it disappears - Deploy in 2 minutes, no domain required.
Try the live demo: Browse Example Archive →
→ QUICKSTART.md - Step-by-step deployment:
Why now? Communities get banned, platforms shut down, discussions vanish. Start preserving today.
→ First time here? QUICKSTART.md - Deploy in 2-15 minutes
→ Quick answers? FAQ - Common questions answered in 30 seconds
→ Need help? Troubleshooting - Fix common issues
→ Using the API? API Reference - 30+ REST endpoints
→ How it works? Architecture - Technical deep-dive
→ Deployment guides:
→ Advanced:
Archive content from multiple link aggregator platforms in a single unified archive:
| Platform | Format | CLI Flag | URL Prefix |
|----------|--------|----------|------------|
| Reddit | .zst JSON Lines | --subreddit | /r/ |
| Voat | SQL dumps | --subverse | /v/ |
| Ruqqus | .7z JSON Lines | --guild | /g/ |
29 MCP tools auto-generated from OpenAPI for AI assistants:
{
"mcpServers": {
"reddarchiver": {
"command": "uv",
"args": ["--directory", "/path/to/mcp_server", "run", "python", "server.py"],
"env": { "REDDARCHIVER_API_URL": "http://localhost:5000" }
}
}
}
See MCP Server Documentation for complete setup guide.

Main landing page showing archive overview with statistics for 9,592 posts across Reddit, Voat, and Ruqqus. Features customizable branding (site name, project URL), responsive cards, activity metrics, and content statistics. (Works offline)

Post listing with sorting options (score, comments, date), pagination, and badge coloring. Includes navigation and theme toggle. (Works offline - sorted by score/comments/date)

Individual post displaying nested comment threads with collapsible UI, user flair, and timestamps. Comments include anchor links for direct navigation from user pages. (Works offline)
Fully optimized for mobile devices with touch-friendly navigation and responsive layout.

PostgreSQL full-text search with Google-style operators. Supports filtering by subreddit, author, date range, and score. (Requires Docker deployment)

Search results with highlighted excerpts using PostgreSQL ts_headline(). Sub-second response times with GIN indexing. (Server-based, Tor-compatible)
Sample Archive: Multi-platform archive featuring programming and technology communities from Reddit, Voat, and Ruqqus · See all screenshots →
Prerequisites: Python 3.7+, PostgreSQL 12+, 4GB+ RAM
Quick Install (Docker):
git clone https://github.com/19-84/redd-archiver.git
cd redd-archiver
# Create required directories
mkdir -p data output/.postgres-data logs tor-public
# Configure environment (IMPORTANT: change passwords!)
cp .env.example .env
nano .env # Edit POSTGRES_PASSWORD and DATABASE_URL
# Start services
docker compose up -d
# Generate archive (after downloading .zst files to data/)
python reddarc.py data/ \
--subreddit privacy \
--comments-file data/privacy_comments.zst \
--submissions-file data/privacy_submissions.zst \
--output output/
Detailed installation procedures (Docker, Ubuntu/Debian, macOS, Windows WSL2):
Quick workflow: Download data → Run archive generator → Deploy
# Generate archive (assumes .zst files in data/ directory)
python reddarc.py data