index

Modules

embedding module embedding: Compute normalized text embeddings and print cosine similarity.
hf_download module hf_download: Download GGUF files from HuggingFace Hub.
llama module llama: D bindings and wrappers for llama.cpp.
multimodal module multimodal: Multimodal inference CLI — feed an image (and optional text) to a vision model.
save_load_state module save_load_state: Demonstrate context-state save and load for reproducible generation.
simple module simple: Minimal text-completion example. Usage: simple -m model.gguf [-n n_predict] [-ngl n_gpu_layers] [prompt]
test_bindings module test_bindings: Undocumented in source.
tokenize_example module tokenize_example: Print each token id and its string piece for the given text.