Difference between revisions of "Nvidia"

From NixOS Wiki
Jump to: navigation, search
("things like Vulkan, CUDA and OpenCL" work in static mode (just tested all of them), removed from cons)
m (rollback unauthorized mass edits)
Tag: Rollback
 
(109 intermediate revisions by 54 users not shown)
Line 1: Line 1:
== Card type ==
+
= NixOS Manual =
 +
First check out the [https://nixos.org/manual/nixos/stable/#sec-x11-graphics-cards-nvidia method] in the NixOS Manual.
  
* MXM / output-providing card (shows as ''VGA Controller'' in ''lspci''), i.e. graphics card in desktop computer or in some laptops
+
= Installing Nvidia Drivers on NixOS =
* muxless/non-MXM Optimus cards have no display outputs and show as ''3D Controller'' in ''lspci'' output, seen in most modern consumer laptops
 
  
== Non-optimus mode ==
+
NixOS uses a functional package management approach, which necessitates specific procedures for driver installation. When considering NVIDIA GPU drivers in a Linux environment, the installation process can be more complex compared to AMD and Intel. This complexity arises primarily because NVIDIA's official drivers are closed source and not typically bundled with many distributions. This document outlines the technical steps required to install NVIDIA GPU drivers on NixOS, factoring in both the unique nature of NixOS and the proprietary status of NVIDIA's drivers.
  
'''You need MXM card'''. Follow [https://nixos.org/nixos/manual/#sec-x11-graphics-cards-nvidia NVIDIA Graphics Cards] section in official manual.
+
== Enable Unfree Software Repositories ==
  
In case of laptop you may also need to use a BIOS option to select which card to use for the internal display.
+
Make sure to allow Unfree Software. The unfree NVIDIA packages include <code>nvidia-x11</code>, <code>nvidia-settings</code>, and <code>nvidia-persistenced</code>.
  
== Optimus ==
+
== Determining the Correct Driver Version ==
  
'''Only for laptops'''. There are currently two solutions available under NixOS:
+
You will next need to determine the appropriate driver version for your card. The following options are available:
  
=== Bumblebee ===
+
<syntaxhighlight lang="nix">
 +
hardware.nvidia.package = config.boot.kernelPackages.nvidiaPackages.stable;
 +
hardware.nvidia.package = config.boot.kernelPackages.nvidiaPackages.beta;
 +
hardware.nvidia.package = config.boot.kernelPackages.nvidiaPackages.production;  # (installs 550)
 +
hardware.nvidia.package = config.boot.kernelPackages.nvidiaPackages.vulkan_beta;
 +
hardware.nvidia.package = config.boot.kernelPackages.nvidiaPackages.legacy_470;
 +
hardware.nvidia.package = config.boot.kernelPackages.nvidiaPackages.legacy_390;
 +
hardware.nvidia.package = config.boot.kernelPackages.nvidiaPackages.legacy_340;
 +
</syntaxhighlight>
  
Unofficial solution.
+
Out of the above, `stable` and `beta` will work for the latest RTX cards and some lower cards so long as they're not considered "legacy" by Nvidia. For "legacy" cards, you can consult the [https://www.nvidia.com/en-us/drivers/unix/legacy-gpu/ Nvidia official legacy driver list] and check whether your device is supported by the 470, 390 or 340 branches. If so, you can use the corresponding <code>legacy_470</code>, <code>legacy_390</code> or <code>legacy_340</code> driver. For a full list of options, consult the [https://github.com/NixOS/nixpkgs/blob/nixos-unstable/pkgs/os-specific/linux/nvidia-x11/default.nix nvidia-x11 module repository].
 
* Pros:  
 
** works out of the box, just start the game via the ''primusrun'' or ''optirun'' wrapper
 
** works under existing X11 server
 
  
* Cons:
+
{{note|As of early March 2024 the <code>production</code> driver has been updated from <code>535</code> to <code>550</code>. This is a breaking change for some people, especially those on Wayland.
** some performance penalties, running under wayland decreases performance even more
+
To resolve this follow the steps under [https://nixos.wiki/wiki/Nvidia#Running_the_new_RTX_SUPER_on_nixos_stable Running the new RTX SUPER on nixos stable]}}
** things like Vulkan, CUDA and OpenCL are problematic to configure
 
  
Use option <syntaxHighlight lang="nix">hardware.bumblebee.enable = true;</syntaxHighlight>
+
Once you've determined the correct driver version, note it down; you'll need it in the next step.
  
=== Nvidia PRIME ===
+
== Modifying NixOS Configuration ==
Official solution by nvidia.
 
  
==== dynamic mode ====
+
Ensure that the following is in your NixOS configuration file (customizing as you prefer):
  
Like with Bumblebee, in this mode nvidia card is only turned on by need, other time it stays in power saving mode.
+
{{warning|If you are using a laptop, the below configuration update is not sufficient to get your Nvidia card running! Once you've entered it, please continue reading, as there are important adjustments that must then be made to the configuration''' before your laptop graphics will work properly.}}
  
* Pros:
+
{{file|/etc/nixos/configuration.nix|nix|<nowiki>
** Has better performance than Bumblebee
+
{ config, lib, pkgs, ... }:
** Vulkan, CUDA and OpenCL work, though CUDA needs an additional device creation rule https://github.com/NixOS/nixpkgs/blob/05e375d7103ac51e2da917965c37246c99f1ae4f/nixos/modules/hardware/video/nvidia.nix#L72
+
{
** Using a pure wayland setup is supported
+
 
 +
  # Enable OpenGL
 +
  hardware.opengl = {
 +
    enable = true;
 +
    driSupport = true;
 +
    driSupport32Bit = true;
 +
  };
 +
 
 +
  # Load nvidia driver for Xorg and Wayland
 +
  services.xserver.videoDrivers = ["nvidia"];
  
* Cons:
+
  hardware.nvidia = {
** There are still some slight issues noted at the top of script
+
 
** Launches new X11 server (and so it's preferable to run the WM/DE alongside the program)
+
    # Modesetting is required.
 +
    modesetting.enable = true;
 +
 
 +
    # Nvidia power management. Experimental, and can cause sleep/suspend to fail.
 +
    # Enable this if you have graphical corruption issues or application crashes after waking
 +
    # up from sleep. This fixes it by saving the entire VRAM memory to /tmp/ instead
 +
    # of just the bare essentials.
 +
    powerManagement.enable = false;
 +
 
 +
    # Fine-grained power management. Turns off GPU when not in use.
 +
    # Experimental and only works on modern Nvidia GPUs (Turing or newer).
 +
    powerManagement.finegrained = false;
 +
 
 +
    # Use the NVidia open source kernel module (not to be confused with the
 +
    # independent third-party "nouveau" open source driver).
 +
    # Support is limited to the Turing and later architectures. Full list of
 +
    # supported GPUs is at:
 +
    # https://github.com/NVIDIA/open-gpu-kernel-modules#compatible-gpus
 +
    # Only available from driver 515.43.04+
 +
    # Currently alpha-quality/buggy, so false is currently the recommended setting.
 +
    open = false;
 +
 
 +
    # Enable the Nvidia settings menu,
 +
# accessible via `nvidia-settings`.
 +
    nvidiaSettings = true;
 +
 
 +
    # Optionally, you may need to select the appropriate driver version for your specific GPU.
 +
    package = config.boot.kernelPackages.nvidiaPackages.stable;
 +
  };
 +
  ...
 +
}
 +
</nowiki>}}
 +
 
 +
== Laptop Configuration: Hybrid Graphics (Nvidia Optimus PRIME) ==
 +
 
 +
In order to correctly finish configuring your Nvidia graphics driver, you must follow the below steps, which differ depending on whether or not you are using a hybrid graphics setup or not. A laptop with hybrid graphics possesses both an integrated GPU (often from the central processor) and a discrete, more powerful Nvidia GPU, typically for performance-intensive tasks. This dual-GPU setup allows for power-saving during basic tasks and higher graphics performance when needed.
 +
 
 +
'''Nvidia Optimus PRIME''' is a technology developed by Nvidia to optimize the power consumption and performance of laptops equipped with their GPUs. It seamlessly switches between the integrated graphics, usually from Intel, for lightweight tasks to save power, and the discrete Nvidia GPU for performance-intensive tasks like gaming or video editing. By dynamically balancing graphics power and battery life, Optimus provides the best of both worlds, ensuring that users get longer battery life without sacrificing graphical performance.
 +
 
 +
=== Configuring Optimus PRIME: Bus ID Values (Mandatory) ===
 +
 
 +
Before we can continue, we must mandatorily first determine the Bus ID values for both your Nvidia and Intel GPUs. This step will be essential regardless of which configuration you later adopt.
 +
 
 +
First, install the <code>lshw</code> package in order to be able to use the <code>lshw</code> command, then run:
 +
 
 +
<code>sudo lshw -c display</code>
 +
 
 +
You will likely get something like this:
 +
 
 +
<syntaxhighlight lang="bash">
 +
*-display               
 +
description: i915drmfb
 +
physical id: 0
 +
bus info: pci@0000:0e:00.0
 +
logical name: /dev/fb0
 +
version: a1
 +
width: 64 bits
 +
clock: 33MHz
 +
capabilities: pm msi pciexpress bus_master cap_list rom fb
 +
configuration: depth=32 driver=nvidia latency=0 mode=2560x1600 visual=truecolor xres=2560 yres=1600
 +
resources: iomemory:600-5ff iomemory:620-61f irq:220 memory:85000000-85ffffff memory:6000000000-61ffffffff memory:6200000000-6201ffffff ioport:5000(size=128) memory:86000000-8607ffff
 +
 
 +
*-display
 +
product: i915drmfb
 +
physical id: 2
 +
bus info: pci@0000:00:02.0
 +
logical name: /dev/fb0
 +
version: 04
 +
width: 64 bits
 +
clock: 33MHz
 +
capabilities: pciexpress msi pm bus_master cap_list rom fb
 +
configuration: depth=32 driver=i915 latency=0 resolution=2560,1600
 +
resources: iomemory:620-61f iomemory:400-3ff irq:221 memory:622e000000-622effffff memory:4000000000-400fffffff ioport:6000(size=64) memory:c0000-dffff memory:4010000000-4016ffffff memory:4020000000-40ffffffff
 +
</syntaxhighlight>
 +
 
 +
Note the two values under "bus info" above, which may differ from laptop to laptop. Our Nvidia Bus ID is <code>0e:00.0</code> and our Intel Bus ID is <code>00:02.0</code>. Watch out for the formatting; convert them from hexadecimal to decimal, remove the padding (leading zeroes), replace the dot with a colon, then add them like this:
  
'''1.''' Configure NixOS to allow Intel and Nvidia X.Org drivers coexist together (only needed because NixOS forces LD_LIBRARY_PATH variable propagation), and also to disable nvidia card by default.
 
 
{{file|/etc/nixos/configuration.nix|nix|<nowiki>
 
{{file|/etc/nixos/configuration.nix|nix|<nowiki>
 
{
 
{
  # disable card with bbswitch by default since we turn it on only on demand!
+
hardware.nvidia.prime = {
  hardware.nvidiaOptimus.disable = true;
+
# Make sure to use the correct Bus ID values for your system!
  # install nvidia drivers in addition to intel one
+
intelBusId = "PCI:0:2:0";
  hardware.opengl.extraPackages = [ pkgs.linuxPackages.nvidia_x11.out ];
+
nvidiaBusId = "PCI:14:0:0";
  hardware.opengl.extraPackages32 = [ pkgs.linuxPackages.nvidia_x11.lib32 ];
+
};
 
}
 
}
 
</nowiki>}}
 
</nowiki>}}
  
'''2.''' Download [https://github.com/dukzcry/crap/tree/master/primerun primerun script].
+
=== Optimus PRIME Option A: Offload Mode ===
 +
 
 +
''Offload Mode is available in NixOS 20.09 and higher, and requires an Nvidia card of the Turing generation or newer, and an Intel chipset from the Coffee Lake generation or newer.''
 +
 
 +
Offload mode puts your Nvidia GPU to sleep and lets the Intel GPU handle all tasks, except if you call the Nvidia GPU specifically by "offloading" an application to it. For example, you can run your laptop normally and it will use the energy-efficient Intel GPU all day, and then you can offload a game from Steam onto the Nvidia GPU to make the Nvidia GPU run that game only. For many, this is the most desirable option.
 +
 
 +
Offload mode is enabled by running your programs with specific environment variables. Here's a sample script called <code>nvidia-offload</code> that you can run wrapped around your executable, for example <code>nvidia-offload glxgears</code>:
  
Like with Bumblebee, start the game via ''primerun.sh''.
+
{{file|nvidia-offload|bash|<nowiki>
 +
export __NV_PRIME_RENDER_OFFLOAD=1
 +
export __NV_PRIME_RENDER_OFFLOAD_PROVIDER=NVIDIA-G0
 +
export __GLX_VENDOR_LIBRARY_NAME=nvidia
 +
export __VK_LAYER_NV_optimus=NVIDIA_only
 +
exec "$@"
 +
</nowiki>}}
 +
 +
To enable offload mode, finish configuring your Nvidia driver by adding the following to your NixOS configuration file:
  
==== static mode ====
+
{{file|/etc/nixos/configuration.nix|nix|<nowiki>
 +
{
 +
hardware.nvidia.prime = {
 +
offload = {
 +
enable = true;
 +
enableOffloadCmd = true;
 +
};
 +
# Make sure to use the correct Bus ID values for your system!
 +
intelBusId = "PCI:0:2:0";
 +
nvidiaBusId = "PCI:14:0:0";
 +
};
 +
}
 +
</nowiki>}}
  
In this mode nvidia card is turned on constantly, having impact on laptop battery and health.
+
=== Optimus PRIME Option B: Sync Mode ===
  
* Pros:
+
Enabling PRIME sync introduces better performance and greatly reduces screen tearing, at the expense of higher power consumption since the Nvidia GPU will not go to sleep completely unless called for, as is the case in Offload Mode. It may also cause its own issues in rare cases. '''PRIME Sync and Offload Mode cannot be enabled at the same time.'''
** better performance than Bumblebee
 
** out of box experience
 
** works under existing X11 server
 
  
* Cons:
+
PRIME sync may also solve some issues with connecting a display in clamshell mode directly to the GPU.
** nvidia is turned on constantly
 
** no wayland support
 
  
 
{{file|/etc/nixos/configuration.nix|nix|<nowiki>
 
{{file|/etc/nixos/configuration.nix|nix|<nowiki>
 
{
 
{
   services.xserver.videoDrivers = [ "nvidia" ];
+
   hardware.nvidia.prime = {
  hardware.nvidia.optimus_prime.enable = true;
+
    sync.enable = true;
  # Bus ID of the NVIDIA GPU. You can find it using lspci
+
 
  hardware.nvidia.optimus_prime.nvidiaBusId = "PCI:1:0:0";
+
    # Make sure to use the correct Bus ID values for your system!
  # Bus ID of the Intel GPU. You can find it using lspci
+
    nvidiaBusId = "PCI:14:0:0";
  hardware.nvidia.optimus_prime.intelBusId = "PCI:0:2:0";
+
    intelBusId = "PCI:0:2:0";
    
+
   };
 
}
 
}
 
</nowiki>}}
 
</nowiki>}}
  
== non-NixOS case ==
+
=== Optimus Option C: Reverse Sync Mode (Experimental) ===
 +
 
 +
This feature is relatively new and may not work properly on all systems ([https://forums.developer.nvidia.com/t/the-all-new-outputsink-feature-aka-reverse-prime/129828 see discussion]). It is also only available on driver 460.39 or newer. Reverse sync also only works with <code>services.xserver.displayManager.setupCommands</code> compatible Display Managers (LightDM, GDM and SDDM).
 +
 
 +
{{file|/etc/nixos/configuration.nix|nix|<nowiki>
 +
{
 +
  hardware.nvidia.prime = {
 +
    reverseSync.enable = true;
 +
    # Enable if using an external GPU
 +
    allowExternalGpu = false;
 +
 
 +
    # Make sure to use the correct Bus ID values for your system!
 +
    intelBusId = "PCI:0:2:0";
 +
    nvidiaBusId = "PCI:14:0:0";
 +
  };
 +
}
 +
</nowiki>}}
  
* The [https://github.com/guibou/nixGL nixGL project] provides wrapper to use GL drivers outside of NixOS. You need to have nvidia drivers installed on your distro (for kernel modules). Then supply nvidia driver version you use on host system to nixGL.
+
== Useful Tips ==
* [https://github.com/dukzcry/crap/tree/master/primerun Optimus only]. Primerun will build nvidia kernel modules against your currently running kernel.
 
  
== CUDA ==
+
=== Check NixosHardware ===
 +
You should check the  [https://github.com/NixOS/nixos-hardware nixoshardware] GitHub repository. It is possible that someone already wrote a hardware configuration for your device and that usually takes care of drivers. If so, follow the upstream documentation to enable the required modules. 
  
There some possible ways to setup a development environment using CUDA on NixOS. This can accomplished in the following ways:
+
=== Multiple Boot Configurations ===
 +
Imagine you have a laptop that you mostly use in clamshell mode (docked, connected to an external display and plugged into a charger) but that you sometimes use on the go.
  
* By making a FHS user env
+
In clamshell mode, using PRIME Sync is likely to lead to better performance, external display support, etc., at the cost of potentially (but not always) lower battery life. However, when using the laptop on the go, you may prefer to use offload mode.
  
{{file|cuda-fsh.nix|nix|<nowiki>
+
NixOS supports "specialisations", which allow you to automatically generate different boot profiles when rebuilding your system. We can, for example, enable PRIME sync by default, but also create a "on-the-go" specialization that disables PRIME sync and instead enables offload mode:
{ pkgs ? import <nixpkgs> {} }:
+
 
 +
{{file|/etc/nixos/configuration.nix|nix|<nowiki>
 +
{
 +
 
 +
specialisation = {
 +
  on-the-go.configuration = {
 +
    system.nixos.tags = [ "on-the-go" ];
 +
    hardware.nvidia = {
 +
      prime.offload.enable = lib.mkForce true;
 +
      prime.offload.enableOffloadCmd = lib.mkForce true;
 +
      prime.sync.enable = lib.mkForce false;
 +
    };
 +
  };
 +
};
  
let fhs = pkgs.buildFHSUserEnv {
 
        name = "cuda-env";
 
        targetPkgs = pkgs: with pkgs;
 
              [ git
 
                gitRepo
 
                gnupg
 
                autoconf
 
                curl
 
                procps
 
                gnumake
 
                utillinux
 
                m4
 
                gperf
 
                unzip
 
                cudatoolkit
 
                linuxPackages.nvidia_x11
 
                libGLU_combined
 
xorg.libXi xorg.libXmu freeglut
 
                xorg.libXext xorg.libX11 xorg.libXv xorg.libXrandr zlib
 
ncurses5
 
stdenv.cc
 
binutils
 
                ];
 
          multiPkgs = pkgs: with pkgs; [ zlib ];
 
          runScript = "bash";
 
          profile = ''
 
                  export CUDA_PATH=${pkgs.cudatoolkit}
 
                  # export LD_LIBRARY_PATH=${pkgs.linuxPackages.nvidia_x11}/lib
 
  export EXTRA_LDFLAGS="-L/lib -L${pkgs.linuxPackages.nvidia_x11}/lib"
 
  export EXTRA_CCFLAGS="-I/usr/include"
 
            '';
 
          };
 
in pkgs.stdenv.mkDerivation {
 
  name = "cuda-env-shell";
 
  nativeBuildInputs = [ fhs ];
 
  shellHook = "exec cuda-env";
 
 
}
 
}
 
</nowiki>}}
 
</nowiki>}}
  
 +
(You can also add other settings here totally unrelated to Nvidia, such as power profiles, etc.)
 +
 +
After rebuilding and rebooting, you'll see in your boot menu under each Generation an "on-the-go" option, which will let you boot into the on-to-go specialisation for that generation.
 +
 +
=== Using GPUs on non-NixOS ===
 +
If you're using Nix-packaged software on a non-NixOS system, you'll need a workaround to get everything up-and-running. The [https://github.com/guibou/nixGL nixGL project] provides wrapper to use GL drivers on non-NixOS systems. You need to have GPU drivers installed on your distro (for kernel modules). With nixGL installed, you'll run <code>nixGL foobar</code> instead of  <code>foobar</code>.
 +
 +
Note that nixGL is not specific to Nvidia GPUs, and should work with just about any GPU.
 +
 +
=== CUDA and using your GPU for compute ===
 +
See the [[CUDA]] wiki page.
 +
 +
=== Using Steam in Offload Mode ===
 +
In order to automatically launch Steam in offload mode, you need to add the following to your <code>~/.bashrc</code>:
 +
<syntaxhighlight lang="bash">
 +
export XDG_DATA_HOME="$HOME/.local/share"
 +
</syntaxhighlight>   
 +
 +
Then, if you are using NixOS Steam, run:
 +
 +
<syntaxhighlight lang="bash">
 +
mkdir -p ~/.local/share/applications
 +
sed 's/^Exec=/&nvidia-offload /' /run/current-system/sw/share/applications/steam.desktop > ~/.local/share/applications/steam.desktop
 +
</syntaxhighlight>
 +
 +
For Flatpak Steam, run:
 +
 +
<syntaxhighlight lang="bash">
 +
mkdir -p ~/.local/share/applications
 +
sed 's/^Exec=/&nvidia-offload /' /var/lib/flatpak/exports/share/applications/com.valvesoftware.Steam.desktop > ~/.local/share/applications/com.valvesoftware.steam.desktop
 +
</syntaxhighlight>
 +
 +
Then restart your graphical environment session (or simply reboot).
 +
 +
=== Running the new RTX SUPER on nixos stable ===
 +
 +
The new RTX Super are not supported by the 545 driver. On Nixos stable, you want to use the 535 driver that come from unstable branch or the 550 (beta).
 +
To do that you need to manually call the driver you want in your config.
 +
Check on this link to choose the driver you want and change your config accordingly :
 +
https://github.com/NixOS/nixpkgs/blob/979a311fbd179b86200e412a3ed266b64808df4e/pkgs/os-specific/linux/nvidia-x11/default.nix#L36
 +
 +
{{file|/etc/nixos/nvidia.nix|nix|<nowiki>
 +
{ pkgs, config, libs, ... }:
 +
 +
{
 +
 +
# Enable OpenGL
 +
  hardware.opengl = {
 +
    enable = true;
 +
    driSupport = true;
 +
    driSupport32Bit = true;
 +
  };
 +
 +
# Load nvidia driver for Xorg and Wayland
 +
  services.xserver.videoDrivers = ["nvidia"];
 +
 
 +
  hardware.nvidia.modesetting.enable = true;
 +
  hardware.nvidia.powerManagement.enable = false;
 +
  hardware.nvidia.powerManagement.finegrained = false;
 +
  hardware.nvidia.open = false;
 +
  hardware.nvidia.nvidiaSettings = true;
 +
# Special config to load the latest (535 or 550) driver for the support of the 4070 SUPER
 +
  hardware.nvidia.package = let
 +
  rcu_patch = pkgs.fetchpatch {
 +
    url = "https://github.com/gentoo/gentoo/raw/c64caf53/x11-drivers/nvidia-drivers/files/nvidia-drivers-470.223.02-gpl-pfn_valid.patch";
 +
    hash = "sha256-eZiQQp2S/asE7MfGvfe6dA/kdCvek9SYa/FFGp24dVg=";
 +
  };
 +
in config.boot.kernelPackages.nvidiaPackages.mkDriver {
 +
    version = "535.154.05";
 +
    sha256_64bit = "sha256-fpUGXKprgt6SYRDxSCemGXLrEsIA6GOinp+0eGbqqJg=";
 +
    sha256_aarch64 = "sha256-G0/GiObf/BZMkzzET8HQjdIcvCSqB1uhsinro2HLK9k=";
 +
    openSha256 = "sha256-wvRdHguGLxS0mR06P5Qi++pDJBCF8pJ8hr4T8O6TJIo=";
 +
    settingsSha256 = "sha256-9wqoDEWY4I7weWW05F4igj1Gj9wjHsREFMztfEmqm10=";
 +
    persistencedSha256 = "sha256-d0Q3Lk80JqkS1B54Mahu2yY/WocOqFFbZVBh+ToGhaE=";
  
* By making a nix-shell
+
    #version = "550.40.07";
{{file|cuda-shell.nix|nix|<nowiki>
+
    #sha256_64bit = "sha256-KYk2xye37v7ZW7h+uNJM/u8fNf7KyGTZjiaU03dJpK0=";
{ pkgs ? import <nixpkgs> {} }:
+
    #sha256_aarch64 = "sha256-AV7KgRXYaQGBFl7zuRcfnTGr8rS5n13nGUIe3mJTXb4=";
 +
    #openSha256 = "sha256-mRUTEWVsbjq+psVe+kAT6MjyZuLkG2yRDxCMvDJRL1I=";
 +
    #settingsSha256 = "sha256-c30AQa4g4a1EHmaEu1yc05oqY01y+IusbBuq+P6rMCs=";
 +
    #persistencedSha256 = "sha256-11tLSY8uUIl4X/roNnxf5yS2PQvHvoNjnd2CB67e870=";
  
pkgs.stdenv.mkDerivation {
+
    patches = [ rcu_patch ];
  name = "cuda-env-shell";
+
};
  buildInputs = with pkgs;
 
                  [ git gitRepo gnupg autoconf curl
 
                    procps gnumake utillinux m4 gperf unzip
 
                    cudatoolkit linuxPackages.nvidia_x11
 
                    libGLU_combined
 
                    xorg.libXi xorg.libXmu freeglut
 
                    xorg.libXext xorg.libX11 xorg.libXv xorg.libXrandr zlib
 
                    ncurses5 stdenv.cc binutils
 
                  ];
 
  shellHook = ''
 
      export CUDA_PATH=${pkgs.cudatoolkit}
 
      # export LD_LIBRARY_PATH=${pkgs.linuxPackages.nvidia_x11}/lib:${pkgs.ncurses5}/lib
 
  export EXTRA_LDFLAGS="-L/lib -L${pkgs.linuxPackages.nvidia_x11}/lib"
 
  export EXTRA_CCFLAGS="-I/usr/include"
 
  '';        
 
 
}
 
}
 
</nowiki>}}
 
</nowiki>}}
  
== See also ==
+
== Troubleshooting ==
 +
 
 +
=== Booting to Text Mode ===
 +
If you encounter the problem of booting to text mode you might try adding the Nvidia kernel module manually with:
 +
 
 +
<syntaxHighlight lang="nix">
 +
boot.initrd.kernelModules = [ "nvidia" ];
 +
boot.extraModulePackages = [ config.boot.kernelPackages.nvidia_x11 ];
 +
</syntaxHighlight>
 +
 
 +
=== Screen Tearing Issues ===
 +
 
 +
First, try to switch to PRIME Sync Mode, as described above. If that doesn't work, try forcing a composition pipeline.
 +
 
 +
{{note|Forcing a full composition pipeline has been reported to reduce the performance of some OpenGL applications and may produce issues in WebGL. It also drastically increases the time the driver needs to clock down after load.}}
 +
 
 +
{{file|/etc/nixos/configuration.nix|nix|<nowiki>
 +
hardware.nvidia.forceFullCompositionPipeline = true;
 +
</nowiki>}}
 +
 
 +
=== Flickering Issues with Picom  ===
 +
{{file|~/.config/picom/picom.conf|conf|<nowiki>
 +
unredir-if-possible = false;
 +
backend = "xrender"; # try "glx" if xrender doesn't help
 +
vsync = true;
 +
</nowiki>}}
 +
 
 +
=== Graphical Corruption and System Crashes on Suspend/Resume ===
 +
<code>powerManagement.enable = true</code> can sometimes fix this, but is itself unstable and is known to cause suspend issues.
 +
 
 +
If you have a modern Nvidia GPU (Turing [https://en.wikipedia.org/wiki/Turing_(microarchitecture)#Products_using_Turing] or later), you may also want to investigate the <code>hardware.nvidia.powerManagement.finegrained</code> option: [https://download.nvidia.com/XFree86/Linux-x86_64/460.73.01/README/dynamicpowermanagement.html]
 +
 
 +
=== Black Screen or Nothing Works on Laptops ===
 +
The kernel module <code>i915</code> for intel or <code>amdgpu</code> for AMD may interfere with the Nvidia driver. This may result in a black screen when switching to the virtual terminal, or when exiting the X session. A possible workaround is to disable the integrated GPU by blacklisting the module, using the following configuration option (see also [https://discourse.nixos.org/t/nvidia-gpu-and-i915-kernel-module/21307/3]):
 +
 
 +
<syntaxHighlight lang="nix">
 +
# intel
 +
boot.kernelParams = [ "module_blacklist=i915" ];
 +
# AMD
 +
boot.kernelParams = [ "module_blacklist=amdgpu" ];
 +
</syntaxHighlight>
 +
 
 +
= Disable Nvidia dGPU completely =
  
* [https://github.com/grahamc/nixos-cuda-example nixos-cuda-example]
+
completely disable dGPU, saving battery. Probably not all configurations and module blacklists are required but this worked successfully
* [https://github.com/grahamc/nixos-cuda-example/pull/2 nix-shell envs for Cuda]
+
 
* [https://discourse.nixos.org/t/cuda-setup-on-nixos/1118 CUDA setup on NixOS]
+
{{file|/etc/nixos/configuration.nix|nix|<nowiki>
 +
boot.extraModprobeConfig = ''
 +
  blacklist nouveau
 +
  options nouveau modeset=0
 +
'';
 +
 
 +
services.udev.extraRules = ''
 +
  # Remove NVIDIA USB xHCI Host Controller devices, if present
 +
  ACTION=="add", SUBSYSTEM=="pci", ATTR{vendor}=="0x10de", ATTR{class}=="0x0c0330", ATTR{power/control}="auto", ATTR{remove}="1"
 +
  # Remove NVIDIA USB Type-C UCSI devices, if present
 +
  ACTION=="add", SUBSYSTEM=="pci", ATTR{vendor}=="0x10de", ATTR{class}=="0x0c8000", ATTR{power/control}="auto", ATTR{remove}="1"
 +
  # Remove NVIDIA Audio devices, if present
 +
  ACTION=="add", SUBSYSTEM=="pci", ATTR{vendor}=="0x10de", ATTR{class}=="0x040300", ATTR{power/control}="auto", ATTR{remove}="1"
 +
  # Remove NVIDIA VGA/3D controller devices
 +
  ACTION=="add", SUBSYSTEM=="pci", ATTR{vendor}=="0x10de", ATTR{class}=="0x03[0-9]*", ATTR{power/control}="auto", ATTR{remove}="1"
 +
'';
 +
boot.blacklistedKernelModules = [ "nouveau" "nvidia" "nvidia_drm" "nvidia_modeset" ];
 +
</nowiki>}}
  
 
[[Category:Video]]
 
[[Category:Video]]

Latest revision as of 10:57, 6 April 2024

NixOS Manual

First check out the method in the NixOS Manual.

Installing Nvidia Drivers on NixOS

NixOS uses a functional package management approach, which necessitates specific procedures for driver installation. When considering NVIDIA GPU drivers in a Linux environment, the installation process can be more complex compared to AMD and Intel. This complexity arises primarily because NVIDIA's official drivers are closed source and not typically bundled with many distributions. This document outlines the technical steps required to install NVIDIA GPU drivers on NixOS, factoring in both the unique nature of NixOS and the proprietary status of NVIDIA's drivers.

Enable Unfree Software Repositories

Make sure to allow Unfree Software. The unfree NVIDIA packages include nvidia-x11, nvidia-settings, and nvidia-persistenced.

Determining the Correct Driver Version

You will next need to determine the appropriate driver version for your card. The following options are available:

hardware.nvidia.package = config.boot.kernelPackages.nvidiaPackages.stable;
hardware.nvidia.package = config.boot.kernelPackages.nvidiaPackages.beta;
hardware.nvidia.package = config.boot.kernelPackages.nvidiaPackages.production;  # (installs 550)
hardware.nvidia.package = config.boot.kernelPackages.nvidiaPackages.vulkan_beta;
hardware.nvidia.package = config.boot.kernelPackages.nvidiaPackages.legacy_470;
hardware.nvidia.package = config.boot.kernelPackages.nvidiaPackages.legacy_390;
hardware.nvidia.package = config.boot.kernelPackages.nvidiaPackages.legacy_340;

Out of the above, `stable` and `beta` will work for the latest RTX cards and some lower cards so long as they're not considered "legacy" by Nvidia. For "legacy" cards, you can consult the Nvidia official legacy driver list and check whether your device is supported by the 470, 390 or 340 branches. If so, you can use the corresponding legacy_470, legacy_390 or legacy_340 driver. For a full list of options, consult the nvidia-x11 module repository.

Note: As of early March 2024 the production driver has been updated from 535 to 550. This is a breaking change for some people, especially those on Wayland. To resolve this follow the steps under Running the new RTX SUPER on nixos stable

Once you've determined the correct driver version, note it down; you'll need it in the next step.

Modifying NixOS Configuration

Ensure that the following is in your NixOS configuration file (customizing as you prefer):

Warning: If you are using a laptop, the below configuration update is not sufficient to get your Nvidia card running! Once you've entered it, please continue reading, as there are important adjustments that must then be made to the configuration before your laptop graphics will work properly.
Breeze-text-x-plain.png
/etc/nixos/configuration.nix
{ config, lib, pkgs, ... }:
{

  # Enable OpenGL
  hardware.opengl = {
    enable = true;
    driSupport = true;
    driSupport32Bit = true;
  };

  # Load nvidia driver for Xorg and Wayland
  services.xserver.videoDrivers = ["nvidia"];

  hardware.nvidia = {

    # Modesetting is required.
    modesetting.enable = true;

    # Nvidia power management. Experimental, and can cause sleep/suspend to fail.
    # Enable this if you have graphical corruption issues or application crashes after waking
    # up from sleep. This fixes it by saving the entire VRAM memory to /tmp/ instead 
    # of just the bare essentials.
    powerManagement.enable = false;

    # Fine-grained power management. Turns off GPU when not in use.
    # Experimental and only works on modern Nvidia GPUs (Turing or newer).
    powerManagement.finegrained = false;

    # Use the NVidia open source kernel module (not to be confused with the
    # independent third-party "nouveau" open source driver).
    # Support is limited to the Turing and later architectures. Full list of 
    # supported GPUs is at: 
    # https://github.com/NVIDIA/open-gpu-kernel-modules#compatible-gpus 
    # Only available from driver 515.43.04+
    # Currently alpha-quality/buggy, so false is currently the recommended setting.
    open = false;

    # Enable the Nvidia settings menu,
	# accessible via `nvidia-settings`.
    nvidiaSettings = true;

    # Optionally, you may need to select the appropriate driver version for your specific GPU.
    package = config.boot.kernelPackages.nvidiaPackages.stable;
  };
  ...
}


Laptop Configuration: Hybrid Graphics (Nvidia Optimus PRIME)

In order to correctly finish configuring your Nvidia graphics driver, you must follow the below steps, which differ depending on whether or not you are using a hybrid graphics setup or not. A laptop with hybrid graphics possesses both an integrated GPU (often from the central processor) and a discrete, more powerful Nvidia GPU, typically for performance-intensive tasks. This dual-GPU setup allows for power-saving during basic tasks and higher graphics performance when needed.

Nvidia Optimus PRIME is a technology developed by Nvidia to optimize the power consumption and performance of laptops equipped with their GPUs. It seamlessly switches between the integrated graphics, usually from Intel, for lightweight tasks to save power, and the discrete Nvidia GPU for performance-intensive tasks like gaming or video editing. By dynamically balancing graphics power and battery life, Optimus provides the best of both worlds, ensuring that users get longer battery life without sacrificing graphical performance.

Configuring Optimus PRIME: Bus ID Values (Mandatory)

Before we can continue, we must mandatorily first determine the Bus ID values for both your Nvidia and Intel GPUs. This step will be essential regardless of which configuration you later adopt.

First, install the lshw package in order to be able to use the lshw command, then run:

sudo lshw -c display

You will likely get something like this:

*-display                 
description: i915drmfb
physical id: 0
bus info: pci@0000:0e:00.0
logical name: /dev/fb0
version: a1
width: 64 bits
clock: 33MHz
capabilities: pm msi pciexpress bus_master cap_list rom fb
configuration: depth=32 driver=nvidia latency=0 mode=2560x1600 visual=truecolor xres=2560 yres=1600
resources: iomemory:600-5ff iomemory:620-61f irq:220 memory:85000000-85ffffff memory:6000000000-61ffffffff memory:6200000000-6201ffffff ioport:5000(size=128) memory:86000000-8607ffff

*-display
product: i915drmfb
physical id: 2
bus info: pci@0000:00:02.0
logical name: /dev/fb0
version: 04
width: 64 bits
clock: 33MHz
capabilities: pciexpress msi pm bus_master cap_list rom fb
configuration: depth=32 driver=i915 latency=0 resolution=2560,1600
resources: iomemory:620-61f iomemory:400-3ff irq:221 memory:622e000000-622effffff memory:4000000000-400fffffff ioport:6000(size=64) memory:c0000-dffff memory:4010000000-4016ffffff memory:4020000000-40ffffffff

Note the two values under "bus info" above, which may differ from laptop to laptop. Our Nvidia Bus ID is 0e:00.0 and our Intel Bus ID is 00:02.0. Watch out for the formatting; convert them from hexadecimal to decimal, remove the padding (leading zeroes), replace the dot with a colon, then add them like this:

Breeze-text-x-plain.png
/etc/nixos/configuration.nix
{
	hardware.nvidia.prime = {
		# Make sure to use the correct Bus ID values for your system!
		intelBusId = "PCI:0:2:0";
		nvidiaBusId = "PCI:14:0:0";
	};
}


Optimus PRIME Option A: Offload Mode

Offload Mode is available in NixOS 20.09 and higher, and requires an Nvidia card of the Turing generation or newer, and an Intel chipset from the Coffee Lake generation or newer.

Offload mode puts your Nvidia GPU to sleep and lets the Intel GPU handle all tasks, except if you call the Nvidia GPU specifically by "offloading" an application to it. For example, you can run your laptop normally and it will use the energy-efficient Intel GPU all day, and then you can offload a game from Steam onto the Nvidia GPU to make the Nvidia GPU run that game only. For many, this is the most desirable option.

Offload mode is enabled by running your programs with specific environment variables. Here's a sample script called nvidia-offload that you can run wrapped around your executable, for example nvidia-offload glxgears:

Breeze-text-x-plain.png
nvidia-offload
export __NV_PRIME_RENDER_OFFLOAD=1
export __NV_PRIME_RENDER_OFFLOAD_PROVIDER=NVIDIA-G0
export __GLX_VENDOR_LIBRARY_NAME=nvidia
export __VK_LAYER_NV_optimus=NVIDIA_only
exec "$@"


To enable offload mode, finish configuring your Nvidia driver by adding the following to your NixOS configuration file:

Breeze-text-x-plain.png
/etc/nixos/configuration.nix
{
	hardware.nvidia.prime = {
		offload = {
			enable = true;
			enableOffloadCmd = true;
		};
		# Make sure to use the correct Bus ID values for your system!
		intelBusId = "PCI:0:2:0";
		nvidiaBusId = "PCI:14:0:0";
	};
}


Optimus PRIME Option B: Sync Mode

Enabling PRIME sync introduces better performance and greatly reduces screen tearing, at the expense of higher power consumption since the Nvidia GPU will not go to sleep completely unless called for, as is the case in Offload Mode. It may also cause its own issues in rare cases. PRIME Sync and Offload Mode cannot be enabled at the same time.

PRIME sync may also solve some issues with connecting a display in clamshell mode directly to the GPU.

Breeze-text-x-plain.png
/etc/nixos/configuration.nix
{
  hardware.nvidia.prime = {
    sync.enable = true;

    # Make sure to use the correct Bus ID values for your system!
    nvidiaBusId = "PCI:14:0:0";
    intelBusId = "PCI:0:2:0";
  };
}


Optimus Option C: Reverse Sync Mode (Experimental)

This feature is relatively new and may not work properly on all systems (see discussion). It is also only available on driver 460.39 or newer. Reverse sync also only works with services.xserver.displayManager.setupCommands compatible Display Managers (LightDM, GDM and SDDM).

Breeze-text-x-plain.png
/etc/nixos/configuration.nix
{
  hardware.nvidia.prime = {
    reverseSync.enable = true;
    # Enable if using an external GPU
    allowExternalGpu = false;

    # Make sure to use the correct Bus ID values for your system!
    intelBusId = "PCI:0:2:0";
    nvidiaBusId = "PCI:14:0:0";
  };
}


Useful Tips

Check NixosHardware

You should check the nixoshardware GitHub repository. It is possible that someone already wrote a hardware configuration for your device and that usually takes care of drivers. If so, follow the upstream documentation to enable the required modules.

Multiple Boot Configurations

Imagine you have a laptop that you mostly use in clamshell mode (docked, connected to an external display and plugged into a charger) but that you sometimes use on the go.

In clamshell mode, using PRIME Sync is likely to lead to better performance, external display support, etc., at the cost of potentially (but not always) lower battery life. However, when using the laptop on the go, you may prefer to use offload mode.

NixOS supports "specialisations", which allow you to automatically generate different boot profiles when rebuilding your system. We can, for example, enable PRIME sync by default, but also create a "on-the-go" specialization that disables PRIME sync and instead enables offload mode:

Breeze-text-x-plain.png
/etc/nixos/configuration.nix
{

specialisation = {
  on-the-go.configuration = {
    system.nixos.tags = [ "on-the-go" ];
    hardware.nvidia = {
      prime.offload.enable = lib.mkForce true;
      prime.offload.enableOffloadCmd = lib.mkForce true;
      prime.sync.enable = lib.mkForce false;
    };
  };
};

}


(You can also add other settings here totally unrelated to Nvidia, such as power profiles, etc.)

After rebuilding and rebooting, you'll see in your boot menu under each Generation an "on-the-go" option, which will let you boot into the on-to-go specialisation for that generation.

Using GPUs on non-NixOS

If you're using Nix-packaged software on a non-NixOS system, you'll need a workaround to get everything up-and-running. The nixGL project provides wrapper to use GL drivers on non-NixOS systems. You need to have GPU drivers installed on your distro (for kernel modules). With nixGL installed, you'll run nixGL foobar instead of foobar.

Note that nixGL is not specific to Nvidia GPUs, and should work with just about any GPU.

CUDA and using your GPU for compute

See the CUDA wiki page.

Using Steam in Offload Mode

In order to automatically launch Steam in offload mode, you need to add the following to your ~/.bashrc:

export XDG_DATA_HOME="$HOME/.local/share"

Then, if you are using NixOS Steam, run:

mkdir -p ~/.local/share/applications
sed 's/^Exec=/&nvidia-offload /' /run/current-system/sw/share/applications/steam.desktop > ~/.local/share/applications/steam.desktop

For Flatpak Steam, run:

mkdir -p ~/.local/share/applications
sed 's/^Exec=/&nvidia-offload /' /var/lib/flatpak/exports/share/applications/com.valvesoftware.Steam.desktop > ~/.local/share/applications/com.valvesoftware.steam.desktop

Then restart your graphical environment session (or simply reboot).

Running the new RTX SUPER on nixos stable

The new RTX Super are not supported by the 545 driver. On Nixos stable, you want to use the 535 driver that come from unstable branch or the 550 (beta). To do that you need to manually call the driver you want in your config. Check on this link to choose the driver you want and change your config accordingly : https://github.com/NixOS/nixpkgs/blob/979a311fbd179b86200e412a3ed266b64808df4e/pkgs/os-specific/linux/nvidia-x11/default.nix#L36

Breeze-text-x-plain.png
/etc/nixos/nvidia.nix
{ pkgs, config, libs, ... }:

{

# Enable OpenGL
  hardware.opengl = {
    enable = true;
    driSupport = true;
    driSupport32Bit = true;
  };

# Load nvidia driver for Xorg and Wayland
  services.xserver.videoDrivers = ["nvidia"];
  
  hardware.nvidia.modesetting.enable = true;
  hardware.nvidia.powerManagement.enable = false;
  hardware.nvidia.powerManagement.finegrained = false;
  hardware.nvidia.open = false;
  hardware.nvidia.nvidiaSettings = true;
# Special config to load the latest (535 or 550) driver for the support of the 4070 SUPER
  hardware.nvidia.package = let 
  rcu_patch = pkgs.fetchpatch {
    url = "https://github.com/gentoo/gentoo/raw/c64caf53/x11-drivers/nvidia-drivers/files/nvidia-drivers-470.223.02-gpl-pfn_valid.patch";
    hash = "sha256-eZiQQp2S/asE7MfGvfe6dA/kdCvek9SYa/FFGp24dVg=";
  };
in config.boot.kernelPackages.nvidiaPackages.mkDriver {
    version = "535.154.05";
    sha256_64bit = "sha256-fpUGXKprgt6SYRDxSCemGXLrEsIA6GOinp+0eGbqqJg=";
    sha256_aarch64 = "sha256-G0/GiObf/BZMkzzET8HQjdIcvCSqB1uhsinro2HLK9k=";
    openSha256 = "sha256-wvRdHguGLxS0mR06P5Qi++pDJBCF8pJ8hr4T8O6TJIo=";
    settingsSha256 = "sha256-9wqoDEWY4I7weWW05F4igj1Gj9wjHsREFMztfEmqm10=";
    persistencedSha256 = "sha256-d0Q3Lk80JqkS1B54Mahu2yY/WocOqFFbZVBh+ToGhaE=";

    #version = "550.40.07";
    #sha256_64bit = "sha256-KYk2xye37v7ZW7h+uNJM/u8fNf7KyGTZjiaU03dJpK0=";
    #sha256_aarch64 = "sha256-AV7KgRXYaQGBFl7zuRcfnTGr8rS5n13nGUIe3mJTXb4=";
    #openSha256 = "sha256-mRUTEWVsbjq+psVe+kAT6MjyZuLkG2yRDxCMvDJRL1I=";
    #settingsSha256 = "sha256-c30AQa4g4a1EHmaEu1yc05oqY01y+IusbBuq+P6rMCs=";
    #persistencedSha256 = "sha256-11tLSY8uUIl4X/roNnxf5yS2PQvHvoNjnd2CB67e870=";

    patches = [ rcu_patch ];
 };
}


Troubleshooting

Booting to Text Mode

If you encounter the problem of booting to text mode you might try adding the Nvidia kernel module manually with:

boot.initrd.kernelModules = [ "nvidia" ];
boot.extraModulePackages = [ config.boot.kernelPackages.nvidia_x11 ];

Screen Tearing Issues

First, try to switch to PRIME Sync Mode, as described above. If that doesn't work, try forcing a composition pipeline.

Note: Forcing a full composition pipeline has been reported to reduce the performance of some OpenGL applications and may produce issues in WebGL. It also drastically increases the time the driver needs to clock down after load.
Breeze-text-x-plain.png
/etc/nixos/configuration.nix
hardware.nvidia.forceFullCompositionPipeline = true;


Flickering Issues with Picom

Breeze-text-x-plain.png
~/.config/picom/picom.conf
unredir-if-possible = false;
backend = "xrender"; # try "glx" if xrender doesn't help
vsync = true;


Graphical Corruption and System Crashes on Suspend/Resume

powerManagement.enable = true can sometimes fix this, but is itself unstable and is known to cause suspend issues.

If you have a modern Nvidia GPU (Turing [1] or later), you may also want to investigate the hardware.nvidia.powerManagement.finegrained option: [2]

Black Screen or Nothing Works on Laptops

The kernel module i915 for intel or amdgpu for AMD may interfere with the Nvidia driver. This may result in a black screen when switching to the virtual terminal, or when exiting the X session. A possible workaround is to disable the integrated GPU by blacklisting the module, using the following configuration option (see also [3]):

# intel
boot.kernelParams = [ "module_blacklist=i915" ];
# AMD
boot.kernelParams = [ "module_blacklist=amdgpu" ];

Disable Nvidia dGPU completely

completely disable dGPU, saving battery. Probably not all configurations and module blacklists are required but this worked successfully

Breeze-text-x-plain.png
/etc/nixos/configuration.nix
boot.extraModprobeConfig = ''
  blacklist nouveau
  options nouveau modeset=0
'';
  
services.udev.extraRules = ''
  # Remove NVIDIA USB xHCI Host Controller devices, if present
  ACTION=="add", SUBSYSTEM=="pci", ATTR{vendor}=="0x10de", ATTR{class}=="0x0c0330", ATTR{power/control}="auto", ATTR{remove}="1"
  # Remove NVIDIA USB Type-C UCSI devices, if present
  ACTION=="add", SUBSYSTEM=="pci", ATTR{vendor}=="0x10de", ATTR{class}=="0x0c8000", ATTR{power/control}="auto", ATTR{remove}="1"
  # Remove NVIDIA Audio devices, if present
  ACTION=="add", SUBSYSTEM=="pci", ATTR{vendor}=="0x10de", ATTR{class}=="0x040300", ATTR{power/control}="auto", ATTR{remove}="1"
  # Remove NVIDIA VGA/3D controller devices
  ACTION=="add", SUBSYSTEM=="pci", ATTR{vendor}=="0x10de", ATTR{class}=="0x03[0-9]*", ATTR{power/control}="auto", ATTR{remove}="1"
'';
boot.blacklistedKernelModules = [ "nouveau" "nvidia" "nvidia_drm" "nvidia_modeset" ];