Ollama supported gpu CUDA Compute Capability The minimum compute capability supported by Ollama seems to be 5. 2. docker run -d --restart always --device /dev/kfd --device /dev/dri -v ollama:/root/. Ollama supports the following AMD GPUs: Linux Support It seems that Ollama is in CPU-only mode and completely ignoring my GPU (Nvidia GeForce GT710). Supported graphics cards Apr 24, 2024 · This command ensures the Docker container has access to all available GPUs and mounts the /home/ollama directory for model storage, with :z to handle SELinux permissions. 3. All the features of Ollama can now be accelerated by AMD graphics cards on Ollama for Linux and Windows. md at main · ollama/ollama Mar 14, 2024 · Ollama now supports AMD graphics cards March 14, 2024. Mac and Linux machines are both supported – although on Linux you'll need an Nvidia GPU right now for GPU acceleration. However, the logs confirm that a NVIDIA GeForce RTX 4050 Laptop GPU was detected and initialized with CUDA (v12). The currently supported AMD architectures are gfx1030, gfx1100, gfx1101, gfx1102, and gfx906 (List of Supported AMD GPUs). Downloading and On Linux, after a suspend/resume cycle, sometimes Msty/Ollama will fail to discover your NVIDIA GPU, and fallback to running on the CPU. AVX Instructions According to journalctl the "CPU does not have AVX or AVX2", therefore "disabling GPU support". Supported graphics cards Ollama leverages the AMD ROCm library, which does not support all AMD GPUs. level=INFO source=gpu. However, when I ask the model questions, I don't see GPU being used at all. 1 and other large language models. - ollama/docs/gpu. Ollama now supports AMD graphics cards in preview on Windows and Linux. exe is using it. AMD Radeon. Jan 30, 2025 · It looks like Ollama detected an AMD GPU (gfx1103), but this architecture is not supported. You can workaround this driver bug by reloading the NVIDIA UVM driver with sudo rmmod nvidia_uvm && sudo modprobe nvidia_uvm. AMD Radeona GPUs. 如果你的系统中有多个 AMD GPU，并且希望限制 Ollama 使用其中的一部分，可以将 ROCR_VISIBLE_DEVICES 设置为 GPU 的逗号分隔列表。你可以使用 rocminfo 查看设备列表。如果你希望忽略 GPU 并强制使用 CPU，可以使用无效的 GPU ID（例如，"-1"）。 Choosing the right GPU for LLMs on Ollama depends on your model size, VRAM requirements, and budget. On the host system you can run sudo setsebool container_use_devices=1 to allow containers to use devices. In the logs I found. For troubleshooting GPU issues, see Troubleshooting. Ollama generally supports machines with 8GB of memory (preferably VRAM). If Ollama is May 25, 2024 · Running Ollama on AMD GPU. 如果你的系统中有多个 nvidia gpu 并且希望限制 ollama 使用其中的一部分，可以将 cuda_visible_devices 设置为 gpu 的逗号分隔列表。可以使用数字 id，但顺序可能会变化，因此使用 uuid 更可靠。你可以通过运行 nvidia-smi -l 来发现 gpu 的 uuid。如果你希望忽略 gpu GPU 选择 . For example The Radeon RX 5400 is gfx1034 (also known as 10. 3. Hello! Sorry for the slow reply, just saw this. go:386 msg="no compatible GPUs were discovered" Running nvidia-smi shows that the server has a GPU NVIDIA RTX 4000 SFF Ada In some Linux distributions, SELinux can prevent containers from accessing the AMD GPU devices. In some cases you can force the system to try to use a similar LLVM target that is close. 3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3. Get up and running with Llama 3. GPU Support Overview. go:221 msg="looking for compatible GPUs" level=INFO source=gpu. ollama -p 11434:11434 --name ollama ollama/ollama:rocm If your AMD GPU doesn't support ROCm but if it is strong enough, you can still Mar 14, 2024 · Ollama now supports AMD graphics cards March 14, 2024. Ollama supports GPU acceleration through two primary backends: NVIDIA CUDA: For NVIDIA GPUs using CUDA drivers and libraries; AMD ROCm: For AMD GPUs using ROCm drivers and libraries gpu 选择. On linux, after a suspend/resume cycle, sometimes Ollama will fail to discover your NVIDIA GPU, and fallback to running on the CPU. If you have a AMD GPU that supports ROCm, you can simple run the rocm version of the Ollama image. Preliminary Debug. . Running nvidia-smi, it does say that ollama. 4) however, ROCm does not currently support this target. Metal (Apple GPUs) Ollama supports GPU acceleration on Apple devices via the Metal API. 0. You can work around this driver bug by reloading the NVIDIA UVM driver with sudo rmmod nvidia_uvm && sudo modprobe nvidia_uvm. AMD GPUs are supported on Windows and Linux with ROCm; Models can be run in both 'generate' and 'embedding' modes if supported; Default context length is 4096 tokens; Consider using lower quantization (4-bit/8-bit) for better performance on limited hardware; Power consumption estimates account for GPU utilization patterns during LLM inference Jun 5, 2025 · For Docker-specific GPU configuration, see Docker Deployment. Consumer GPUs like the RTX A4000 and 4090 are powerful and cost-effective, while enterprise solutions like the A100 and H100 offer unmatched performance for massive models. Yeah they are kinda old at this point but still work just fine for dev stuff. I have the same card and installed it on Windows 10. Msty/Ollama supports the following AMD GPUs: Linux Support Dec 9, 2024 · docker exec ollama ollama run llama3. I don't have a cluster of gpus right now, I am planning on getting another rx vega 56/64 (i will change the bios anyway) for cheap since I have seen that ollama can utilize multiple gpus (even if not the same chip). etrh ftrw ixune rwapjzs lap txkna gzx xdhqfm zmt pgbwrprf

Ollama supported gpu. I have the same card and installed it on Windows 10.