llama.cpp is an open-source project that enables efficient inference of Meta’s LLaMA and other compatible models using pure C/C++, with the main goal of enabling LLM inference with minimal setup and state-of-the-art performance on a wide variety of hardware. It offers optimizations for various architectures, supports integer quantization for faster inference and reduced memory usage, and includes custom CUDA kernels, Vulkan, SYCL, and OpenCL backend support. Wave specifically supports the llama.cpp HTTP Server, which makes the models available locally to run from other applications like Wave.Documentation Index
Fetch the complete documentation index at: https://legacydocs.waveterm.dev/llms.txt
Use this file to discover all available pages before exploring further.
Installation
Please visit llama.cpp’s GitHub page for instructions on downloading and installing llama.cpp, as well as a full list of supported models.Configuration
After installing and configuring the llama.cpp server, you can start using it in Wave by setting a single parameter:aibaseurl. This parameter can be set either through the UI or from the command line, but please note that the parameter names are slightly different depending on the method you choose.
Parameters
- AI Base URL: Set this parameter to the base URL or endpoint that Wave AI should query. For llama.cpp servers running locally, use http://localhost:8080. Please note that the port number
8080may be different depending on your specific installation. For remote llama.cpp server instances, replacelocalhostwith the appropriate hostname or IP address of the server where llama.cpp is running. If the port number is different from the default8080, update it accordingly in the URL.
Configuring via the UI
To configure llama.cpp from Wave’s user interface, navigate to the “Settings” menu and set theAI Base URL parameter as described in the previous section.
Configuring via the CLI
To configure llama.cpp using the command line, set theaibaseurl parameter using the /client:set command, as shown in the example below.
Usage
Once you have installed and configured your llama.cpp server, you can start using it in Wave. There are two primary ways to interact with your newly configured LLM: Interactive Mode and by using the /chat command.- Interactive Mode: To enter Interactive Mode, click the “Wave AI” button in the command box or use the
ctrl + spaceshortcut. This will open an interactive chat session where you can have a continuous conversation with the AI assistant powered by your llama.cpp model. - /chat: Alternatively, you can use the /chat command followed by your question to get a quick answer from your llama.cpp model directly in the terminal.
Troubleshooting
If you encounter issues while using llama.cpp with Wave AI, consider the following troubleshooting steps:- Connection failures: If Wave AI fails to connect to your llama.cpp server or returns an error message, verify that the llama.cpp server is running is running and accessible from the system where Wave is installed. Check the llama.cpp logs for any error messages or indications of why the connection might be failing.
- Timeouts: If you’re unable to complete a query or incur frequent timeouts, try adjusting the
aitimeoutparameter to a higher value. This will give your llama.cpp server more time to process and respond to your requests, especially if you are running it on a system with limited hardware resources. - Incorrect base URL or port: Ensure that the
aibaseurlparameter points to the correct URL and port number where the llama.cpp server is running. If you have changed the default port or are running llama.cpp on a remote server, update the URL accordingly. - Unexpected behavior or inconsistent results: If you encounter unexpected behavior or inconsistent results when using the llama.cpp server with Wave AI, try resetting the
aibaseurlandaimodelparameters to their default values and reconfiguring llama.cpp from scratch. This can help rule out any configuration issues that might be causing problems.
Reset Wave AI
At any time if you find that you wish to return to the default Wave AI experience, you can reset theaibaseurl and aimodel parameters to their default state by using the following commands.

