MinishLab/semble is a Python-based code search library designed to provide fast and accurate code snippet retrieval for developers and agents, significantly reducing the time and tokens required for code search compared to traditional methods.
Source: README View on GitHub →This project is gaining attention due to its significant reduction in token usage and search time compared to grep+read, making it an attractive solution for developers and agents requiring efficient code search capabilities. Its unique technical choice of using a hybrid search approach, which combines the speed of grep with the accuracy of a code-specialized transformer, stands out in the code search landscape.
Source: README, BenchmarksSemble provides fast indexing and search capabilities, with an average indexing time of ~250 ms and query response time of ~1.5 ms, while maintaining high accuracy comparable to code-specialized transformer models.
Source: READMESemble returns only the relevant code snippets, using approximately 98% fewer tokens than grep+read, which is particularly beneficial for resource-constrained environments.
Source: READMESemble operates on CPU without the need for API keys, GPUs, or external services, making it easy to deploy and use without additional infrastructure.
Source: READMESemble can run as an MCP server, allowing agents like Claude Code, Cursor, Codex, and OpenCode to directly search any codebase, enhancing the efficiency of code exploration for these agents.
Source: READMESemble supports both local and remote codebases, allowing users to search code from local directories or directly from git repositories.
Source: READMEThe architecture of Semble is inferred to be modular, with clear separation between indexing, searching, and the interface for agents. It likely uses a combination of data structures optimized for code search and a hybrid approach to leverage both grep's speed and transformer models' accuracy. The project's dependency on various libraries for embeddings, indexing, and file handling suggests a complex yet well-structured codebase.
Source: Code tree + dependency filesCenter: project; inner ring: core feature modules; outer ring: key dependencies. Auto-generated from core_features and tech_stack.key_deps.
model2vecvicinitynumpybm25spathspectree-sittertree-sitter-language-packSemble is suitable for developers and agents working with codebases of any size, particularly those requiring efficient code search capabilities. It is useful in scenarios such as debugging, code review, or when building tools that require code understanding, such as AI agents or code search engines.
Source: READMEv0.1.7 (2026-05-12): Fixed savings aggregation and bumped version.
Source: GitHub ReleasesMinishLab/semble is a promising project for those seeking an efficient and accurate code search solution. Its unique combination of speed and accuracy, along with its ease of deployment, makes it a valuable tool for developers and agents working with codebases of any size.