NVIDIA's Open-Source Linux Kernel Driver Performing At Parity To Proprietary Driver Review - Phoronix

filister@lemmy.world · 7 months ago

NVIDIA's Open-Source Linux Kernel Driver Performing At Parity To Proprietary Driver Review - Phoronix

GenderNeutralBro@lemmy.sdf.org · 7 months ago

As a reminder, the same (closed-source) user-space components for OpenGL / OpenCL / Vulkan / CUDA are used regardless of the NVIDIA kernel driver option with their official driver stack.

CUDA hell remains. :(

Possibly linux@lemmy.zip · 7 months ago

AMD needs to get their ducks in a row. They already have the advantage of not being Nvidia

john89@lemmy.ca · 7 months ago

They already have the advantage of not being Nvidia

That’s just because they release worse products.

If AMD had Nvidia’s marketshare, they would be just as scummy as the business climate allows.

In fact, AMD piggybacks off of Nvidia’s scumbaggery to charge more for their GPUs rather than engage in an actual price war.

Cornelius@lemmy.ml · 7 months ago

Who would’ve thunk that big, for profit, tech companies don’t care about us :T

john89@lemmy.ca · 7 months ago

It’s all by design.

istanbullu@lemmy.ml · 7 months ago

it’s breaking down. Pytorch supports ROCm now.

ProdigalFrog@slrpnk.net · 7 months ago

ROCm is it’s own hell (unless they finally put some resources into it in the past couple years)

Cornelius@lemmy.ml · 7 months ago

They put in the absolute minimum amount of resources for it.

It’s also littered with bugs as the ZLUDA project has noted

filister@lemmy.world · 7 months ago

Yes, the CUDA is the only reason why I consider NVIDIA. I really hate this company but the AMD tech stack is really inferior.

laurelraven@lemmy.blahaj.zone · 7 months ago

I’ve heard this but don’t really understand it… At a high level, what makes cuda so much better?

Phoenixz@lemmy.ca · 7 months ago

So is CUDA good or bad?

I keep reading it’s hell, but the best. Apparently it’s the single one reason why Nvidia is so big with AI, but it sucks.

What is it?

GenderNeutralBro@lemmy.sdf.org · edit-2 7 months ago

Both.

The good: CUDA is required for maximum performance and compatibility with machine learning (ML) frameworks and applications. It is a legitimate reason to choose Nvidia, and if you have an Nvidia card you will want to make sure you have CUDA acceleration working for any compatible ML workloads.

The bad: Getting CUDA to actually install and run correctly is a giant pain in the ass for anything but the absolute most basic use case. You will likely need to maintain multiple framework versions, because new ones are not backwards-compatible. You’ll need to source custom versions of Python modules compiled against specific versions of CUDA, which opens a whole new circle of Dependency Hell. And you know how everyone and their dog publishes shit with Docker now? Yeah, have fun with that.

That said, AMD’s equivalent (ROCm) is just as bad, and AMD is lagging about a full generation behind Nvidia in terms of ML performance.

The easy way is to just use OpenCL. But that’s not going to give you the best performance, and it’s not going to be compatible with everything out there.

Swedneck@discuss.tchncs.de · 7 months ago

almost sounds like god doesn’t want us doing machine learning

magikmw@lemm.ee · 7 months ago

The fact that cuda means ‘wonders’ in polish is living in my mind rent free several days after I read about nvidia news.

Supermariofan67@programming.dev · 7 months ago

I think this will change. Nvidia hired devs on Nouveau, NVK is coming along, etc

leopold@lemmy.kde.social · 7 months ago

Last I checked, there is no evidence Nvidia has hired anyone to work on Nouveau.

Supermariofan67@programming.dev · 7 months ago

https://www.phoronix.com/news/Ben-Skeggs-Joins-NVIDIA

leopold@lemmy.kde.social · edit-2 7 months ago

Right, I’m well aware that that article is the reason why a bunch of people are making the thoroughly unsubstantiated claim that Nvidia has hired people to work on Nouveau.

Nvidia hired the former lead Nouveau maintainer and he contributed a bunch of patches a couple of months ago after they hired him. That was his first contribution since stepping down and I’m fairly certain it was his last because there’s no way Phoronix would miss the opportunity to milk this some more if they could. He had said when stepping down that he was open to contributing every once in a while, so this wasn’t very surprising either way. To be clear, it is not evidence that he or anyone else was hired by Nvidia to work on Nouveau. Otherwise, I’d like to ask what he’s been doing since, because that was over three months ago.

boredsquirrel@slrpnk.net · edit-2 7 months ago

Well… it is an out-of-tree kernel driver that is made by the same company, and the userspace drivers are still proprietary.

This says NOTHING other than “wow NVIDIA can write good code (open source) that doesnt suck”?

sunzu@kbin.run · 7 months ago

How is it different. Wouldn’t just be the same software with source code available?

SMillerNL@lemmy.world · 7 months ago

It’s not, they’re not open sourcing their driver. They’ve made an open source driver.

sunzu@kbin.run · 7 months ago

Is there a reason to reinvent the wheel?

seaQueue@lemmy.world · 7 months ago

Usually this is done for licensing reasons. They probably don’t want the old code caught up in the open license they’re shipping the new driver under.

My understanding is that the new open driver separates proprietary code into a black box binary blob that isn’t distributed under an open source license. I’m guessing that they’ve been very careful not to include anything they want to keep closed into the new open driver, whereas the old driver wasn’t written with this separation in mind.

sunzu@kbin.run · 7 months ago

I was wondering about what they were doing with their “secret sauce”, thanks for explaining.

CMDR_Horn@lemmy.world · 7 months ago

Control, precedent, bean counter analysis etc. Pick your poison.

Supermariofan67@programming.dev · 7 months ago

Some of it probably comes from other companies that are unable or unwilling to relicense it even if Nvidia wanted to

krolden@lemmy.ml · 7 months ago

chirospasm@lemmy.ml · 7 months ago

Woohoo!

Karna@lemmy.ml · edit-2 7 months ago

Anyone tried this beta version yet? Any idea how stable it is?

Molecular0079@lemmy.world · 7 months ago

Performance parity? Heck no, not until this bug with the GSP firmware is solved: https://github.com/NVIDIA/open-gpu-kernel-modules/issues/538

Unyieldingly@lemmy.world · 7 months ago

I been using the open kernel driver with my Debian Workstation, it has worked better then the default driver by far with the Debian backport Kernel, I installed it using the Nvidia Cuda Repo.