Mapache is a fast, secure, deduplicating, incremental backup tool written in Rust.
You can find more in-depth documentation.
mapache (Spanish for raccoon 🦝) is a high-performance, deduplicating backup
tool designed for speed, reliability, and uncompromising security. Inspired by
restic and built with Rust, it provides a modern
approach to incremental backups.
At its core, mapache operates on a content-addressable repository model. Every
file, directory, and piece of metadata is decomposed into binary objects
identified by their cryptographic hash. This architecture naturally
enables global deduplication: if multiple files share the same content—even
across different snapshots or machines — mapache stores that data only once.
To ensure storage efficiency and high I/O throughput, these objects are bundled
into "pack files" and tracked via a central index that allows for near-instant
lookups and atomic repository updates.
Each backup is captured as a "Snapshot" representing a complete, point-in-time
state of your file system. Unlike traditional backup tools that rely on complex
"full vs. incremental" chains, every mapache snapshot is technically
independent but shares underlying data blobs with others. This means you can
delete any old snapshot at any time without risking the integrity of newer
ones. All data, from file contents to directory structures, is compressed with
zstd and protected by AES-GCM-SIV authenticated encryption, ensuring your
repository remains a "black box" to anyone without the master key.
Mapache is a feature-complete backup solution. While the architecture is designed for reliability and has extensive test coverage, it is a relatively new project. As with any tool managing critical data, users should perform their own validation before relying on it for primary backups.
- Deduplication: FastCDC (Content-Defined Chunking) identifies shifted data to minimize storage footprint.
- Security: Mandatory AES-GCM-SIV encryption and Argon2 KDF — your data is never stored or transmitted in the clear.
- Compression: Zstd compression with adjustable levels to balance backup speed and storage usage.
- Backends: Native support for Local FS, SFTP, and S3.
- Portable: A single, statically linked binary with zero external dependencies.
- Verifiable: Verify all snapshots, packs, and blobs to make sure your data can be restored at any time.
- TOML Config: Centralized repository settings via a
.tomlconfiguration file, overridable with CLI flags. - Bundle Files: Self-contained
.mapachebundle format with deduplication, encryption, and FUSE mount support for secure data transfer. - Flexible Retention: Policy-based snapshot retention with hourly, daily, weekly, monthly, yearly rules, plus host and tag filtering.
This is a non-exhaustive set of benchmarks run on my development hardware. They serve as a baseline for comparing performance between versions, using restic v0.18.1 as a base.
Test environment: Fedora 44, AMD Ryzen 9 3900X (24 threads), SanDisk Extreme PRO NVMe.
Each result is the average of 3 runs following a warmup run, all on local storage. Both tools are run with default settings and 8 readers (read-concurrency) for backup.
Mapache has traditionally been slower with datasets made of many small files, so this benchmark test addresses that area specifically.
Workloads:
- kernel — Linux kernel source tree (~1.6 GB, 99'131 objects)
- enron — Enron email corpus (~1.4 GB, 520'901 objects)
| Tool | Operation | Time | CPU | Peak RAM | Repo Size |
|---|---|---|---|---|---|
| mapache | backup | 2.34s | 1194% | 566 MB | 304.1 MB |
| restic | backup | 3.66s | 1246% | 612 MB | 308.8 MB |
| mapache | restore | 9.96s | 117% | 268 MB | — |
| restic | restore | 10.57s | 133% | 255 MB | — |
| Tool | Operation | Time | CPU | Peak RAM | Repo Size |
|---|---|---|---|---|---|
| mapache | backup | 5.54s | 1162% | 594 MB | 717.0 MB |
| restic | backup | 9.48s | 1198% | 413 MB | 724.9 MB |
| mapache | restore | 32.90s | 127% | 373 MB | — |
| restic | restore | 41.62s | 114% | 445 MB | — |
To compile mapache from source, ensure you have the Rust toolchain installed,
then build and install the binary:
# Build the optimized release binary
cargo build --release
# Install it to your cargo bin path
cargo install --path corecargo build compiles binaries with some dynamically linked dependencies. While
this is fine for testing and development on the same hardware, if you need a
statically linked binary (which I strongly recommend for portability), run
make release-static or use the binaries provided in the Releases page for a
specific released version.
Note for Linux users: The
mountcommand requires FUSE development headers (e.g.,libfuse-dev). To build without FUSE support, use--no-default-features.
# Local directory
mapache init -r /path/to/repo
# SFTP server
mapache init -r sftp://user@host/backup-folder
# S3 Bucket
mapache init -r s3://my-bucket/backup-foldermapache snapshot ~/Documents -r /path/to/repomapache log -c -r /path/to/repomapache restore --target /tmp/restore-folder -r /path/to/repomapache 0.1.0 was the first public stable release. It was meant to be a first stable prototype with all core features. This version validated the architecture.
v0.2.0 marks the finalization of the Archiver. The complete engine was refactored to make the backend code async. An initial S3 backend implementation was added and the SFTP backend was reimplemented with a rust-native async crate. The async refactor had the additional challenge of tuning parallelism to trim down memory usage while maintaining performance.
Redesigned the restorer into a high-performance, pack-centric engine with background prefetching and concurrent restoration to significantly improve I/O efficiency.
-
restoreredesign - Multi-platform static builds (Linux x64/ARM, Windows, macOS)
- Return codes for commands
- Security hardening (secure join, zeroize, FUSE permissions)
Focused on new tooling, configuration, and performance optimizations.
- Bundle command — Self-contained
.mapachebundle files with deduplication, encryption, and FUSE mount capability. - TOML config file — Centralized repository settings with
--configflag. - Enhanced retention —
--host,--keep-hourly,--keep-minfor theforgetcommand. - S3 multipart uploads — For files >= 128 MiB.
- Access time preservation —
--with-atimeflag for snapshot command. - JSON output for
clean,stats, and other commands. index.
In the future, I want to polish all rough edges, like adding json output and
error codes to all commands. Maybe a TUI, but that's a wholly different
endeavor.
Other planned features (non-exhaustive):
- incremental restore
- master key rotation