This error typically occurs when there is a compatibility issue between the Vulkan application and Heavydb.
Here are some general steps to troubleshoot this type of error.
Important Notes
- When upgrading exising drivers, it's very important to use the exact SAME installation method used when installing drivers previously. Failure to do so can result in an unstable system.
- Note that this article assumes use of Ubuntu operating system. If you're using CentOS/RHEL, instructions may vary.
- If the hardware you're upgrading is an on-premise DGX machine, please do not use this article. Instead, refer to NVIDIA DGX documentation specific to the operating system you have installed.
- HEAVY.AI Support provides this information for your reference, however, responsibility for installing and upgrading NVIDIA drivers remains that of your system administrator.
- NVIDIA Drivers require use of Linux kernel mods. Use of Secure Boot may prevent successful upgrade/installation of drivers.
Update/Reinstall Graphics Drivers:
Choose the appropriate option based on your existing NVIDIA driver installation type.
Option 1:
Install NVIDIA Drivers with CUDA Toolkit from the NVIDIA Website.
-
For "Operating System" select Linux
-
For Architecture" select x86_64
-
For "Distribution" select Ubuntu
-
For "Version" select the version of your operating system (20.04)
-
For "Installer Type" choose deb (network) **
wget
and paste the URL you copied in the previous step on your server. Press enter to download.sudo apt install build-essential
Change the permissions of the downloaded .run file to allow execution, and run the installation.
chmod +x NVIDIA-Linux-x86_64-*.run
sudo ./NVIDIA-Linux--x86_64-*.run
Option 3:
Install NVIDIA drivers using APT.
In this case
Install a specific version of the driver for your GPU by installing the NVIDIA repository and using the apt
package manager.
You can run the command to get the updated list of the available driver's version.
apt list nvidia-driver-*
Install the driver version needed with apt
sudo apt install nvidia-driver-<version>
Run nvidia-smi
to verify that your drivers are installed correctly and recognize the GPUs in your environment. Depending on your environment, you should see something like this to confirm that your NVIDIA GPUs and drivers are present.
Roll back the Nvidia driver:
If you still getting the same error message after updating the driver you can proceed to Roll back the Driver instead.
First, confirm your Nvidia driver version by running the nvidia-smi
Once you confirm your current driver version, run this command to remove your driver using apt
sudo apt-get remove --purge '^nvidia-.*'
sudo apt-get remove --purge '^cuda-.*'
This command will remove all NVIDIA-related packages, including the driver, associated libraries, and other NVIDIA tools.
After removing the packages, it's a good idea to update the system to ensure everything is in order:
sudo apt-get update
sudo apt-get upgrade
Once you have successfully removed the driver, let's follow the documentation guide and install the Vulkan library first using this command
sudo apt install libvulkan1
Then followed by installing the Nvidia driver desired version using this command
sudo apt install nvidia-driver-<version>
.
You can verify again the installation/driver version using nvidia-smi
Comments
0 comments
Article is closed for comments.