The CLI supports Python 3.11~3.13. You may also need a fairly recent c++/rust compiler because the core components of the vector database (ChromaDB) contains c++ and rust code.
The recommended way of
installation is through uv, which will
create a virtual environment for the package itself that doesn't mess up with
your system Python or project-local virtual environments.
After installing uv, run:
uv tool install "vectorcode<1.0.0"in your shell. To specify a particular version of Python, use the --python
flag. For example, uv tool install vectorcode --python python3.11. For hardware
accelerated embedding, refer to the relevant section.
If you want a CPU-only installation without CUDA dependencies required by
default by PyTorch, run:
uv tool install "vectorcode<1.0.0" --index https://download.pytorch.org/whl/cpu --index-strategy unsafe-best-matchIf you need to install multiple dependency group (for LSP or MCP), you can use the following syntax:
uv tool install "vectorcode[lsp,mcp]<1.0.0"Note
The command only install VectorCode and SentenceTransformer, the default
embedding engine. If you need to install an extra dependency, you can use
uv tool install vectorcode --with <your_deps_here>
To install from source, either git clone this repository and run uv tool install <path_to_vectorcode_repo>, or use pipx:
pipx install git+https://github.com/Davidyz/VectorCodeThe motivation behind the change from pipx to uv tool is mainly the
performance. The caching mechanism in uv makes it a lot faster than pipx for a
lot of operations. If you installed VectorCode via pipx, you can continue to
use pipx to manage your VectorCode installation. If you wish to switch to
uv, you need to uninstall VectorCode using pipx and then use uv to install
it as described above. All your VectorCode configurations and database files
will work out of the box on your new install.
Chromadb is the vector database used by VectorCode to store and retrieve the code embeddings. Although it is already bundled with VectorCode and you can absolutely use VectorCode just fine, it is recommended to set up a standalone local server (they provides detailed instructions through docker and systemd), because this will significantly reduce the IO overhead and avoid potential race condition.
If you're setting up a standalone ChromaDB server, I recommend sticking to v0.6.3, because VectorCode is not ready for the upgrade to ChromaDB 1.0 yet.
Windows support is not officially tested at this moment. This PR tracks my progress trying to provide better experiences for windows users.
If your environment doesn't support numpy version 2.0+, the default,
unconstrained numpy may not work for you. In this case, you can
try installing the package by uv tool install 'vectorcode[legacy]', which enforces
numpy v1.x. If this doesn't help, please open an issue with your OS, CPU
architecture, python version and the vectorcode virtual environment
(uv tool run --from=vectorcode python -m ensurepip && uv tool run --from=vectorcode python -m pip freeze).
A community-maintained Nix package is available here. If you're using nix to install a standalone Chromadb server, make sure to stick to 0.6.3.
If you install via Nix and run into an issue, please try to reproduce with the
PyPi package (install via uv or pipx). If it's not reproducible on the
non-nix package, I may close the issue immediately.
cd into your project root repo, and run:
vectorcode initThis will initialise the project for VectorCode and create a .vectorcode
directory in your project root. This is where you keep your configuration file
for VectorCode, if any.
After that, you can start vectorising files for the project.
vectorcode vectorise src/**/*.pyVectorCode doesn't track file changes, so you need to re-vectorise edited files. You may automate this by a git pre-commit hook, etc. See the advanced usage section for examples to set them up.
Ideally, you should try to vectorise all source code in the repo, but for large
repos you may experience slow queries. If that happens, try to vectorcode drop
the project and only vectorise files that are important or informative.
And now, you're ready to make queries that will retrieve the relevant documents:
vectorcode query reranker -n 3This will try to find the 3 most relevant documents in the embedding database
that are related to the query reranker. You can pass multiple query words:
vectorcode query embedding reranking -n 3or if you want to query a sentence, wrap them in quotation mark:
vectorcode query "How to configure reranker model"If things are going right, you'll see some paths being printed, followed by their content. These are the selected documents that are relevant to the query.
If you want to wipe the embedding for the repository (to use a new embedding function or after an upgrade with breaking changes), use
vectorcode dropTo see a full list of CLI options and tricks to optimise the retrieval, keep
reading or use the --help flag.
To maintain the accuracy of the vector search, it's important to keep your
embeddings up-to-date. You can simply run the vectorise subcommand on a file
to refresh the embedding for that file. Apart from that, the CLI provides a
vectorcode update subcommand, which updates the embeddings for all files that
are currently indexed by VectorCode for the current project.
If you want something more automagic, check out the advanced usage section about setting up git hooks to trigger automatic embedding updates when you commit/checkout to a different tag.
Please try the following and see if any of these fix your issue:
dropthe collection and re-index it, because there may be changes in the way embeddings are stored in the database;- upgrade/re-install the CLI (via
pipxor however you installed VectorCode).
For each project, VectorCode creates a collection (similar to tables in
traditional databases) and puts the code embeddings in the corresponding
collection. In the root directory of a project, you may run vectorcode init.
This will initialise the repository with a subdirectory
project_root/.vectorcode/. This will mark this directory as a project root, a
concept that will later be used to construct the collection. You may put a
config.json file in project_root/.vectorcode. This file may be used to store
project-specific settings such as embedding functions and database entry point
(more on this later). If you already have a global configuration file at
~/.config/vectorcode/config.json, it will be copied to
project_root/.vectorcode/config.json when you run vectorcode init. When a
project-local config file is present, the global configuration file is ignored
to avoid confusion.
The same logics apply to file specs, which tells VectorCode what
file it should (or shouldn't) vectorise. If you created a file spec
~/.config/vectorcode/vectorcode.include or
~/.config/vectorcode/vectorcode.exclude, they will be copied to the
project-local config directory (project_root/.vectorcode). They also serve as
the fallback value if no project-local specs are present.
If you skip vectorcode init, VectorCode will look for a directory that
contains .git/ subdirectory and use it as the project root. In this case, the
default global configuration will be used. If .git/ does not exist, VectorCode
falls back to using the current working directory as the project root.
To keep the embeddings up-to-date, you may find it useful to set up some git
hooks. The init subcommand provides a --hooks flag which helps you manage
hooks when working with a git repository. You can put some custom hooks in
~/.config/vectorcode/hooks/ and the vectorcode init --hooks command will
pick them up and append them to your existing hooks, or create new hook scripts
if they don't exist yet. The custom hook files should be named the same as they
would be under the .git/hooks directory. For example, a pre-commit hook would
be named ~/.config/vectorcode/hooks/pre-commit.
By default, there are 2 pre-defined hooks:
- A pre-commit hook that vectorises the modified files.
- A post-checkout hook that:
- vectorises the full repository if it's an initial commit/clone and a
vectorcode.includespec is available (either locally in the project or globally); - vectorises the files changed by the checkout.
- vectorises the full repository if it's an initial commit/clone and a
Both hooks will only be triggered on repositories that have a .vectorcode
directory in them.
Since 0.6.4, VectorCode adapted a json5 parser
for loading configuration. VectorCode will now look for config.json5 in
configuration directories, and if it doesn't find one, it'll look for
config.json too. Regardless of the filename extension, the json5 syntax will
be accepted. This allows you to leave trailing comma in the config file, as well
as writing comments (//). This can be very useful if you're experimenting with
the configs.
The JSON configuration file may hold the following values:
-
embedding_function: string, one of the embedding functions supported by Chromadb (find more here and here). For example, Chromadb supports Ollama aschromadb.utils.embedding_functions.OllamaEmbeddingFunction, and the corresponding value forembedding_functionwould beOllamaEmbeddingFunction. Default:SentenceTransformerEmbeddingFunction; -
embedding_params: dictionary, stores whatever initialisation parameters your embedding function takes. ForOllamaEmbeddingFunction, if you setembedding_paramsto:{ "url": "http://127.0.0.1:11434/api/embeddings", "model_name": "nomic-embed-text" }Then the embedding function object will be initialised as
OllamaEmbeddingFunction(url="http://127.0.0.1:11434/api/embeddings", model_name="nomic-embed-text"). Default:{}; -
embedding_dims: integer ornull, the number of dimensions to truncate the embeddings to. Make sure your model supports Matryoshka Representation Learning (MRL) before using this. Learn more about MRL here. When set tonull(or unset), the embeddings won't be truncated; -
db_url: string, the url that points to the Chromadb server. VectorCode will start an HTTP server for Chromadb at a randomly picked free port onlocalhostif your configuredhttp://host:portis not accessible. Default:http://127.0.0.1:8000; -
db_path: string, Path to local persistent database. If you didn't set up a standalone Chromadb server, this is where the files for your database will be stored. Default:~/.local/share/vectorcode/chromadb/; -
db_log_path: string, path to the directory where the built-in chromadb server will write the log to. Default:~/.local/share/vectorcode/; -
chunk_size: integer, the maximum number of characters per chunk. A larger value reduces the number of items in the database, and hence accelerates the search, but at the cost of potentially truncated data and lost information. Default:2500. To disable chunking, set it to a negative number; -
overlap_ratio: float between 0 and 1, the ratio of overlapping/shared content between 2 adjacent chunks. A larger ratio improves the coherence of chunks, but at the cost of increasing number of entries in the database and hence slowing down the search. Default:0.2. Starting from 0.4.11, VectorCode will use treesitter to parse languages that it can automatically detect. It uses pygments to guess the language from filename, and tree-sitter-language-pack to fetch the correct parser.overlap_ratiohas no effects when treesitter works. If VectorCode fails to find an appropriate parser, it'll fallback to the legacy naive parser, in which caseoverlap_ratioworks exactly in the same way as before; -
query_multiplier: integer, when you use thequerycommand to retrievendocuments, VectorCode will checkn * query_multiplierchunks and return at mostndocuments. A larger value ofquery_multiplierguarantees the return ofndocuments, but with the risk of including too many less-relevant chunks that may affect the document selection. Default:-1(any negative value means selecting documents based on all indexed chunks); -
reranker: string, the reranking method to use. Currently supportsNaiveReranker(sort chunks by the "distance" between the embedding vectors) andCrossEncoderReranker(using sentence-transformers cross-encoder ). -
reranker_params: dictionary, similar toembedding_params. The options passed to the reranker class constructor. ForCrossEncoderReranker, these are the options passed to theCrossEncoderclass. For example, if you want to use a non-default model, you can use the following:{ "reranker_params": { "model_name_or_path": "your_model_here" } } -
db_settings: dictionary, works in a similar way toembedding_params, but for Chromadb client settings so that you can configure authentication for remote Chromadb; -
hnsw: a dictionary of hnsw settings that may improve the query performances or avoid runtime errors during queries. It's recommended to re-vectorise the collection after modifying these options, because some of the options can only be set during collection creation. Example (and default):"hnsw": { "hnsw:M": 64, }
-
filetype_map:dict[str, list[str]], a dictionary where keys are language name and values are lists of Python regex patterns that will match file extensions. This allows overriding automatic language detection and specifying a treesitter parser for certain file types for which the language parser cannot be correctly identified (e.g.,.phtmlfiles containing both php and html). Example configuration:"filetype_map": { "php": ["^phtml$"] }
-
chunk_filters:dict[str, list[str]], a dictionary where the keys are language name and values are lists of Python regex patterns that will match chunks to be excluded from being vectorised. This only applies to languages supported by treesitter chunker. By default, no filters will be added. Example configuration:"chunk_filters": { "python": ["^[^a-zA-Z0-9]+$"], // multiple patterns will be merged (unioned) // or you can use wildcard to match any languages that has no dedicated filters: "*": ["^[^a-zA-Z0-9]+$"], }
-
encoding: string, alternative encoding used for this project. By default this project uses utf8. When this is set, VectorCode will decode files with the specified encoding, unless you choose to override this with the--encodingcommand line flag. You can also set this to_auto, which uses charset-normalizer to automatically detect the encoding, but this is not very accurate, especially on small files.
See the wiki for an example of the default configuration.
Run vectorcode vectorise <path_to_your_file> or vectorcode vectorise <directory> -r. There are a few extra tweaks you may use:
- chunk size: embedding APIs may truncate long documents so that the documents
can be handled by the embedding models. To solve this, VectorCode implemented
basic chunking features that chunks the documents into smaller segments so
that the embeddings are more representative of the code content. To adjust the
chunk size when vectorising, you may either set the
chunk_sizeoption in the JSON configuration file, or use--chunk_size/-cparameter of thevectorisecommand to specify the maximum number of characters per chunk; - overlapping ratio: when the chunk size is set to
$c$ and overlapping ratio set to$o$ , the maximum number of repeated content between 2 adjacent chunks will be$c \times o$ . This prevents loss of information due to the key characters being cut into 2 chunks. To configure this, you may either setoverlap_ratioin JSON configuration file or use--overlap/-oparameter.
Note that, the documents being vectorised is not limited to source code. You can even try documentation/README, or files that are in the filesystem but not in the project directory (yes I'm talking about neovim lua runtimes).
This command also respects .gitignore. It by default skips files in
.gitignore. To override this, run the vectorise command with -f/--force
flag.
There's also a update subcommand, which updates the embedding for all the indexed
files and remove the embeddings for files that no longer exist.
As a shorthand, you can create a file at project_root/.vectorcode/vectorcode.include.
This file should follow the same syntax as a
gitignore file. Files matched by this
specs will be vectorised when you run vectorcode vectorise without specifying
files. This file has lower priority than .gitignore, but you can override this
by the -f flag. It also doesn't assume --recursive, so if you want to add a
whole directory to this file, you can use dir/**, which matches all content
of dir/ recursively.
Note that the
includespec only kicks in when you don't supply file paths when callingvectorcode vectorise. If you want a rule to be effective whenever you vectorise some files, you should use theexcludespecs explained below.
Similarly, you can also create a project_root/.vectorcode/vectorcode.exclude
file to denote any files that you want to exclude. This is useful when you have
some files that should be tracked by git, but are not necessary to be indexed by
VectorCode.
These specs can be useful if you want to automatically update the embeddings on certain conditions. See the wiki for an example to use it with git hooks.
If you're working with nested repos, you can pass --recursive/-r so that
the vectorise command will honour the .gitignores and vectorcode.excludes
in the nested repos.
To retrieve a list of documents from the database, you can use the following command:
vectorcode query "your query message"The command can take an arbitrary number of query words, but make sure that full-sentences are enclosed in quotation marks. Otherwise, they may be interpreted as separated words and the embeddings may be inaccurate. The returned results are sorted by their similarity to the query message.
You may also specify how many documents should be retrieved with -n/--number
parameter (default is 1). This is the maximum number of documents that may be
returned. Depending on a number of factors, the actual returned documents may be
less than this number but at least 1 document will be returned.
You may also set a multiplier for the queries. When VectorCode sends queries to
the database, it receives chunks, not document. It then uses some scoring
algorithms to determine which documents are the best fit. The multiplier, set by
command-line flag --multiplier or -m, defines how many chunks VectorCode
will request from the database. The default is -1, which means to retrieve all
chunks. A larger multiplier guarantees the return of n documents, but with the risk
of including too many less-relevant chunks that may affect the document selection.
The query subcommand also supports customising chunk size and overlapping
ratio because when the query message is too long it might be necessary to chunk
it. The parameters follow the same syntax as in vectorise command.
The CLI defaults to return the relative path of the documents from the project
root. To use absolute path, add the --absolute flag.
If you wish to limit the output to "path only" or "document (content) only", you
can achieve this by using the --include flag:
vectorcode query foo bar --include path
This will only include the path in the output. This is effective for both
normal CLI usage and --pipe mode.
For some applications, it may be overkill to use the full document as context
and all you need is the chunks. You can do this by using --include chunk or
--include chunk path in the command. This will return chunks from the
document, and in pipe mode the objects will also include the line numbers of
the first and last lines in the chunk. Note that chunk and document cannot be used at
the same time, and the number of query result (the -n parameter) will refer to
the number of retrieved chunks when you use --include chunk. For the sake of
completeness, the first and last lines of a chunk will be completed to include
the whole lines if the chunker broke the text from mid-line.
You can use vectorcode ls command to list all collections in your ChromaDB.
This is useful if you want to check whether the collection has been created for
the current project or not. The output will be a table with 4 columns:
- Project Root: path to the directory where VectorCode vectorised;
- Collection Size: number of chunks in the database;
- Number of Files: number of files that have been indexed;
- Embedding Function: name of embedding function used for this collection.
This can only discover collections that are stored in the same ChromaDB instance.
You can use vectorcode drop command to remove a collection from Chromadb. This
is useful if you want to clean up your Chromadb database, or if the project has
been deleted, and you don't need its embeddings any more.
You may run vectorcode check command to check whether VectorCode is properly
installed and configured for your project. This currently supports only 1 check:
config: checks whether a project-local configuration directory exists. Prints the project-root if successful, otherwise returns a non-zero exit code.
Running vectorcode check config is faster than running vectorcode query some_message and then getting an empty results.
For empty collections and collections for removed projects, you can use the
vectorcode clean command to remove them at once.
vectorcode files lsprints a list of files that are indexed in the project.vectorcode files rm file1 file2removes the embeddings that belong to the specified files from the project.
Both commands will honor the --project_root flag.
When something doesn't work as expected, you can enable logging by setting the
VECTORCODE_LOG_LEVEL variable to one of ERROR, WARN (WARNING), INFO or
DEBUG. For the CLI that you interact with in your shell, this will output logs
to STDERR and write a log file to ~/.local/share/vectorcode/logs/. For LSP
and MCP servers, because STDIO is used for the RPC, the logs will only be
written to the log file, not STDERR.
For example:
VECTORCODE_LOG_LEVEL=INFO vectorcode vectorise file1.py file2.luaDepending on the MCP/LSP client implementation, you may need to take extra steps to make sure the environment variables are captured by VectorCode.
When you pass --debug parameter to the CLI, VectorCode will track the call
stacks with cprofile. The
stats will be saved to the log directory mentioned above. You may use an
external stats viewer (like snakeviz)
to load the profiling stats for a better viewing experience.
VectorCode can work with coredumpy
to snapshot an exception so that developers can inspect the error
asynchronously. To use this, you'd need to install the vectorcode[debug]
dependency group: uv tool install vectorcode[debug].
VectorCode supports shell completion for bash/zsh/tcsh. You can use vectorcode -s {bash,zsh,tcsh}
or vectorcode --print-completion {bash,zsh,tcsh} to print the completion script
for your shell of choice.
This section covers hardware acceleration when using sentence transformer as the embedding backend.
For Nvidia users this should work out of the box. If not, try setting the following options in the JSON config file:
{
"embedding_params": {
"backend": "torch",
"device": "cuda"
}
}For Intel users, sentence transformer
supports OpenVINO
backend for supported GPU. Run uv install 'vectorcode[intel]' which will
bundle the relevant libraries when you install VectorCode. After that, you will
need to configure SentenceTransformer to use openvino backend. In your
config.json, set backend key in embedding_params to "openvino":
{
"embedding_params": {
"backend": "openvino"
}
}This will run the embedding model on your GPU. This is supported even for some integrated GPUs.
When using the default embedding function, any options inside the
"embedding_params" will go to the class constructor of SentenceTransformer,
so you can always take a look at
their documentation
for detailed information regardless of your platform.
To develop a tool that makes use of VectorCode, you may find the --pipe/-p
flag helpful. It formats the output into JSON and suppress other outputs so that
you can grab whatever's in the STDOUT and parse it as a JSON document. In
fact, this is exactly what I did when I wrote the neovim plugin.
For the query command, here's the format printed in the pipe mode:
[
{
"path": "path_to_your_code.py",
"document": "import something"
},
{
"path": "path_to_another_file.py",
"document": "print('hello world')"
}
]Basically an array of dictionaries with 2 keys: "path" for the path to the
document, and "document" for the content of the document.
If you used --include chunk path parameters, the array will look like this:
[
{
"path": "path_to_your_code.py",
"chunk": "foo",
"start_line": 1,
"end_line": 1,
"chunk_id": "chunk_id_1"
},
{
"path": "path_to_another_file.py",
"chunk": "bar",
"start_line": 1,
"end_line": 1,
"chunk_id": "chunk_id_2"
}
]Keep in mind that both start_line and end_line are inclusive. The chunk_id
is a random string that can be used as a unique identifier to distinguish
between chunks. These are the same IDs used in the database.
The output is in JSON format. It contains a dictionary with the following fields:
"add": number of added documents;"update": number of updated documents;"removed": number of removed documents;"skipped": number of skipped documents (because it's empty or its hash matches the metadata saved in the database);"failed": number of documents that failed to be vectorised. This is usually due to encoding issues.
A JSON array of collection information of the following format will be printed:
{
"project_root": str,
"user": str,
"hostname": str,
"collection_name": str,
"size": int,
"num_files": int,
"embedding_function": str
}"project_root": the path to theproject-root;"user": your *nix username, which are automatically added when vectorising to avoid collision;"hostname": your *nix hostname. The purpose of this field is the same as the"user";"collection_name": the unique identifier for the project used in the database;"size": number of chunks stored in the database;"num_files": number of files that have been vectorised in the project.
A JSON array of strings (the absolute paths to the files in the collection).
There's an experimental implementation of VectorCode CLI, which accepts requests
of workspace/executeCommand
from STDIO. This allows the CLI to keep the embedding model loaded in the
memory/VRAM, and therefore speed up the query by avoiding the IO overhead of
loading the models.
The experimental language server can be installed via the lsp dependency
group:
pipx install 'vectorcode[lsp]'
## or if you have an existing `vectorcode` install:
pipx inject vectorcode 'vectorcode[lsp]' --forceThe LSP request for the workspace/executeCommand is defined as follows:
{
command: str
arguments: list[Any]
}
For the vectorcode-server, the only valid value for the command key is
"vectorcode", and arguments is any other remaining components of a valid CLI
command. For example, to execute vectorcode query -n 10 reranker, the request
would be:
{
command: "vectorcode",
arguments: ["query", "-n", "10", "reranker"]
}
The vectorcode-server optionally accepts a --project_root parameter, which
specifies the default project root for this process. If not specified, it
will:
- try to find a project root by root anchors (
.vectorcodeor.git) starting from the current working directory; - if 1 fails, but the first request contains a
--project_rootparameter, it will use that as the default project root for this process; - if 2 fails too, the process throws an error.
Note that:
- For easier parsing,
--pipeis assumed to be enabled in LSP mode; - A
vectorcode.lockfile will be created in yourdb_pathdirectory if you're using the bundled chromadb server. Please do not delete it while a vectorcode process is running; - The LSP server supports
vectorise,query,lsandfilessubcommands. The other subcommands may be added in the future; - If the
--project_rootparameter is not provided, the LSP server will try to use the workspace folders provided by the LSP client as the project root (if available).
Model Context Protocol (MCP) is an open protocol that standardizes how applications provide context to LLMs. VectorCode provides an experimental implementation that provides the following features:
ls: list local collections, similar to thelssubcommand in the CLI;query: query from a given collection, similar to thequerysubcommand in the CLI;vectorise: vectorise files into a given project;files_ls: show files that have been indexed for the current project;files_rm: remove some files from the database for a project.
To try it out, install the vectorcode[mcp] dependency group and the MCP server
is available in the shell as vectorcode-mcp-server.
The MCP server entry point (vectorcode-mcp-server) provides some CLI options
that you can use to customise the default behaviour of the server. To view the
supported options, run vectorcode-mcp-server -h in your shell.
If you want to integrate VectorCode in your LLM application, you may want to
write some prompt that tells the LLM how to use this tool. Apart from the
function signatures, a list of instructions used by the MCP server is included in
this package. This can be retrieved by running the vectorcode prompts command.
This commands optionally accepts names of other subcommands as arguments. It'll
print a list of pre-defined prompts that are suitable for the specified
subcommands. You may run vectorcode prompts --help for the supported options.