Skip to content

[CMake] Fix SM70/72 support range upper bound for CUDA 13.0#3166

Open
Flink-ddd wants to merge 1 commit intoNVIDIA:mainfrom
Flink-ddd:fix/sm70-sm72-cuda13-upper-bound
Open

[CMake] Fix SM70/72 support range upper bound for CUDA 13.0#3166
Flink-ddd wants to merge 1 commit intoNVIDIA:mainfrom
Flink-ddd:fix/sm70-sm72-cuda13-upper-bound

Conversation

@Flink-ddd
Copy link
Copy Markdown

@Flink-ddd Flink-ddd commented Apr 13, 2026

Purpose

Fixes #3164.

The CUDA 13.0 compiler no longer accepts SM70/72 as target architectures, causing a fatal error when CUTLASS_NVCC_ARCHS_SUPPORTED includes them. The existing condition only checked VERSION_GREATER_EQUAL 11.4 with no upper bound.

Fix: add VERSION_LESS 13.0 upper bound so SM70/72 are only included when the compiler supports them. Volta remains fully supported on CUDA < 13.0.

Test Plan

Verified CUTLASS_NVCC_ARCHS_SUPPORTED output with CUDA_VERSION=13.0 before and after the fix.

Test Result

Before fix:

-- CUTLASS_NVCC_ARCHS_SUPPORTED = 70;72;75;80;86;87;89;90;90a;100;...

After fix:

-- CUTLASS_NVCC_ARCHS_SUPPORTED = 75;80;86;87;89;90;90a;100;...

@Flink-ddd
Copy link
Copy Markdown
Author

Hi @hwu36 , Could you please take a look when you have time? Thanks!

@hwu36
Copy link
Copy Markdown
Collaborator

hwu36 commented Apr 14, 2026

We haven't decided to drop volta. Since some are still use it. Could we only drop it with 13.x compiler?

@Flink-ddd
Copy link
Copy Markdown
Author

Sorry for the confusion in the description, SM70/72 are only excluded when CUDA_VERSION >= 13.0, they remain supported for all earlier versions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[BUG] CUTLASS_NVCC_ARCHS_SUPPORTED has wrong values leading to compilation failure

2 participants