Update llama.cpp-models.md

2024-09-20 18:45:09 +02:00 · 2023-05-16 00:49:32 -03:00 · 2023-05-16 00:49:32 -03:00 · cd9be4c2ba
commit cd9be4c2ba
parent 26cf8c2545
1 changed files with 11 additions and 0 deletions
--- a/docs/llama.cpp-models.md
+++ b/docs/llama.cpp-models.md
@ -16,11 +16,22 @@ Enabled with the `--n-gpu-layers` parameter. If you have enough VRAM, use a high
 Note that you need to manually install `llama-cpp-python` with GPU support. To do that:
 #### Linux
 ```
 pip uninstall -y llama-cpp-python
 CMAKE_ARGS="-DLLAMA_CUBLAS=on" FORCE_CMAKE=1 pip install llama-cpp-python --no-cache-dir
 ```
 #### Windows
 ```
 pip uninstall -y llama-cpp-python
 set CMAKE_ARGS="-DLLAMA_CUBLAS=on"
 set FORCE_CMAKE=1
 pip install llama-cpp-python --no-cache-dir
 ```
 Here you can find the different compilation options for OpenBLAS / cuBLAS / CLBlast: https://pypi.org/project/llama-cpp-python/
 ## Performance