I had a 5090 some months ago but couldnt get flash attention to work. Does it no...

sigmoid10 · 2025-08-23T16:50:30 1755967830

Pytorch now has native support for the Blackwell architecture:

SynasterBeiter · 2025-08-23T18:22:17 1755973337

It does, but the performance is pretty bad, worse than Hopper.

zackangelo · 2025-08-23T17:41:09 1755970869

Curious what issues you were having. The kernel should compile natively if you pass nvcc the correct arch flags, although it probably won't take advantage of any new hardware features.

saagarjha · 2025-08-24T02:34:09 1756002849

High-performance GPU code typically uses nonportable features that are not supported across generations.