semble_rs is a Rust port and superset of MinishLab/semble built for AI coding agents. It returns the exact code chunks an agent needs, prints a token-cheap codebase tree instead of ls -R, and compresses 3 MB CI logs into 35 KB. One single binary, no daemon, no API keys, no GPU. Hybrid BM25 + Model2Vec static embeddings with code-aware reranking, plus a dependency graph, AST chunking, and a digest pipeline for build / test / CI output.

Quickstart

# Install Rust if needed, then:
git clone https://github.com/johunsang/semble_rs.git && cd semble_rs
cargo install --path .

The binary lands at ~/.cargo/bin/semble_rs. On first run, the default embedding model minishlab/potion-code-16M (~60 MB) is downloaded from HuggingFace.

# Map the codebase (replaces ls -R)
semble_rs tree ./my-project --symbols

# Find code by what it does (replaces grep + cat)
semble_rs search "how is auth handled" ./my-project --outline

# Compress build / CI output before reading it
cargo build 2>&1 | semble_rs digest
gh run view <id> --log-failed | semble_rs digest

For agent integration (Claude Code, Codex, Cursor), see Agent integration.

Main Features

Fast: indexes the local repo (22 files) in ~150 ms, ~10 s on 1,600 files. Static embedder — no transformer forward pass at query time.
Token-efficient: tree collapses ls -R by 4×–747×; --outline is -47% vs full output; digest reaches on real GitHub Actions logs.

semble_rs

Quickstart

Main Features

Related Skills

Search

Output modes

`find-related`

`plan`

`--model`

Tree

Digest

Dependency graph

`find-pattern`

Encode

Agent integration