Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep

TL;DR

Semble is a code search tool designed for agents that uses approximately 98% fewer tokens than traditional grep methods. It offers rapid, accurate code retrieval on CPU without external dependencies. This development could significantly improve code search efficiency for AI agents and developers.

Semble, a new code search library tailored for AI agents, claims to reduce token usage by approximately 98% compared to traditional grep-based searches, while delivering faster, accurate results on CPU without external dependencies.

Semble is designed to enable agents such as Claude Code, Codex, and OpenCode to perform instant, precise code searches within large codebases. It indexes repositories in around 250 milliseconds and answers queries in roughly 1.5 milliseconds, all on CPU, with no need for API keys or external services. Benchmarks indicate its retrieval quality is comparable to specialized transformer models, but with significantly lower resource requirements.

The tool can be integrated as an MCP server or used via command-line, supporting local paths and git URLs. It provides features like natural language search queries (e.g., ‘How is authentication handled?’) and related code discovery, making it a practical enhancement for code exploration in AI workflows.

Why It Matters

This development is important because it offers a highly efficient, cost-effective way for AI agents to access and understand codebases, potentially accelerating development workflows and reducing computational costs. Its token efficiency and speed could make code search more accessible and scalable for large projects and multiple users.

FOXWELL NT301 OBD2 Scanner Live Data Professional Mechanic OBDII Diagnostic Code Reader Tool for Check Engine Light

【Read Fault Codes】About the read code funtion needs to be in the ignition on state and if the…

As an affiliate, we earn on qualifying purchases.

Background

Traditional code search methods like grep read entire files, consuming many tokens and time, especially in large repositories. Existing transformer-based models provide accurate search results but are resource-intensive. Semble aims to bridge this gap by offering comparable accuracy with minimal token usage and faster performance, addressing a key bottleneck in AI-assisted coding workflows.

“Semble returns only the relevant chunks, using approximately 98% fewer tokens than grep+read, while maintaining high retrieval quality.”

— Semble developers

“Indexing and searching a full codebase takes under a second, which is a significant improvement for developers and AI agents alike.”

— Hacker News user

ADREAMER AI GPT Mouse, Voice Input Search, Voice and Word Translation, Form Document Output, Code Generation, SWOT Analysis, AI Drawing, AI Writing, Rechargeable, for Win7/8/10/11 Mac OS.

【Intelligent AI Interaction, Answers to Every Question】 With an advanced built – in AI assistant, it's like having…

As an affiliate, we earn on qualifying purchases.

What Remains Unclear

It is not yet clear how Semble performs across a wide variety of codebases or in different programming languages beyond initial benchmarks. Long-term stability and integration support are still to be tested in diverse environments.

Code by Note, Bk 1: Find the Patterns by Reading the Notes, Coloring Book (Color by Note, Bk 1)

Used Book in Good Condition

As an affiliate, we earn on qualifying purchases.

What’s Next

Next steps include broader adoption by AI developers, further benchmarking across various repositories, and potential integration into more agent workflows. Updates may include feature enhancements and expanded language support.

Generative AI for Software Development: Building Software Faster and More Effectively

As an affiliate, we earn on qualifying purchases.

Key Questions

How does Semble compare in accuracy to existing code search tools?

Benchmarks show Semble achieves approximately 99% of the retrieval quality of specialized transformer models, with significantly lower token usage and faster response times.

Can Semble be used with any codebase or programming language?

Semble supports local repositories and git URLs, but detailed language support is not specified. Its performance in languages other than those tested remains to be confirmed.

Is Semble easy to set up and integrate into existing workflows?

Yes, it runs on CPU with no external dependencies. It can be added as an MCP server or called via CLI, with straightforward setup instructions for popular agents like Claude Code, Codex, and OpenCode.

What are the limitations or potential drawbacks of Semble?

Its performance across diverse codebases and in different languages is still being evaluated. Long-term stability and compatibility with various agent architectures are also to be determined.

Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep

Up next

How API-First Thinking Improves Engineering Speed

Author

Geek Salad Team

Share article

Why It Matters

FOXWELL NT301 OBD2 Scanner Live Data Professional Mechanic OBDII Diagnostic Code Reader Tool for Check Engine Light

Background

ADREAMER AI GPT Mouse, Voice Input Search, Voice and Word Translation, Form Document Output, Code Generation, SWOT Analysis, AI Drawing, AI Writing, Rechargeable, for Win7/8/10/11 Mac OS.

What Remains Unclear

Code by Note, Bk 1: Find the Patterns by Reading the Notes, Coloring Book (Color by Note, Bk 1)

What’s Next

Generative AI for Software Development: Building Software Faster and More Effectively

Key Questions

How does Semble compare in accuracy to existing code search tools?

Can Semble be used with any codebase or programming language?

Is Semble easy to set up and integrate into existing workflows?

What are the limitations or potential drawbacks of Semble?

How Contract Testing Helps Distributed Teams

Uv is fantastic, but its package management UX is a mess

You Don’t Love Systemd Timers Enough

Choosing the Right Database: SQL Vs Nosql

Order A Burned CD Of Your Own Public GitHub Repo

Technology Operations Signal Monitor: PeerTube Is A Free, Decentralized And Federated Video Platform

14 Best Sofa Deals in 2026

6 Best Antique Style Furniture in 2026

Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep

Up next

Author

Geek Salad Team

Share article

Why It Matters

FOXWELL NT301 OBD2 Scanner Live Data Professional Mechanic OBDII Diagnostic Code Reader Tool for Check Engine Light

Background

ADREAMER AI GPT Mouse, Voice Input Search, Voice and Word Translation, Form Document Output, Code Generation, SWOT Analysis, AI Drawing, AI Writing, Rechargeable, for Win7/8/10/11 Mac OS.

What Remains Unclear

Code by Note, Bk 1: Find the Patterns by Reading the Notes, Coloring Book (Color by Note, Bk 1)

What’s Next

Generative AI for Software Development: Building Software Faster and More Effectively

Key Questions

How does Semble compare in accuracy to existing code search tools?

Can Semble be used with any codebase or programming language?

Is Semble easy to set up and integrate into existing workflows?

What are the limitations or potential drawbacks of Semble?

You May Also Like