[NVVM] Expose nvvm version detection in cuda.bindings.utils.#1837

Open

abhilash1910 wants to merge 3 commits intoNVIDIA:mainfrom

abhilash1910:nvvm_fix

Contributor

abhilash1910 commented Mar 31, 2026

Description

Fixes issue #1457 . Provides an api call to cuda.bindings.utils to check the version of nvvm.
@leofang @rwgk pinging for review.


          refactor cuda bindings utils for nvvm

65fb587

Contributor

copy-pr-bot bot commented Mar 31, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Contributor Author

abhilash1910 commented Mar 31, 2026

pre-commit.ci autofix


          [pre-commit.ci] auto code formatting

10943c8

abhilash1910 changed the title ~~[NVVM]Expose nvvm version detection in cuda.bindings.utils.~~ [NVVM][Fix] Expose nvvm version detection in cuda.bindings.utils.


          refresh

a7ef552

abhilash1910 changed the title ~~[NVVM][Fix] Expose nvvm version detection in cuda.bindings.utils.~~ [NVVM] Expose nvvm version detection in cuda.bindings.utils.

rwgk assigned abhilash1910

rwgk added the cuda.bindings label

rwgk added this to the cuda.bindings backlog milestone

rwgk added enhancement P1 labels

rwgk reviewed

View reviewed changes

cuda_bindings/cuda/bindings/utils/_nvvm_utils.py

		"""


		def check_nvvm_options(options: Sequence[bytes]) -> bool:

Collaborator

rwgk Apr 1, 2026

For a public Python API, bytes seems unusual here.

Most pythonic would be Sequence[str], but Sequence[str | bytes] would seem fine, too.

The implementation below actually converts bytes to str:

        options_list = [opt.decode("utf-8") if isinstance(opt, bytes) else opt for opt in options]

So requiring bytes in the API is especially surprising. My recommendation is to simply use Sequence[str]; then the options_list line can be removed completely.

I also recommend using check_nvvm_compiler_options as the function name, for clarity, especially in the call sites.

Contributor Author

abhilash1910 Apr 1, 2026

Yes I will update this, I initially thought this to be Sequence[bytes | str] .

cuda_bindings/cuda/bindings/utils/_nvvm_utils.py

		@@ -0,0 +1,93 @@
		# SPDX-FileCopyrightText: Copyright (c) 2026-2027 NVIDIA CORPORATION & AFFILIATES. All rights reserved.

Collaborator

rwgk Apr 1, 2026

Remove -2027

cuda_bindings/cuda/bindings/utils/_nvvm_utils.py

+                      if _inspect_function_pointer("__nvvmCreateProgram") == 0:
+                          return False
+                  except Exception:

Collaborator

rwgk Apr 1, 2026

This is far too broad. It's very likely to mask bugs. Also the other except Exception further below. Could you please try this in Cursor:

Could you please narrow down the exception catching as much as possible? Please use ModuleNotFoundError for the nvvm import; we also want to check exc.name == "nvvm". If that import works, we don't want to guard the _inspect_function_pointer import, or the _inspect_function_pointer() call. If those don't work, that's a bug we want to surface.

cuda_bindings/cuda/bindings/utils/_nvvm_utils.py

+                      options_list = [opt.decode("utf-8") if isinstance(opt, bytes) else opt for opt in options]
+                      nvvm.verify_program(program, len(options_list), options_list)
+                      nvvm.compile_program(program, len(options_list), options_list)
+                  except Exception:

Collaborator

rwgk Apr 1, 2026

If you're compiling anyway, do you actually need the verify_program() call?

I'm thinking the try should just be around this one line:

        try:
            nvvm.compile_program(prog, len(options), options)
        except nvvm.nvvmError as e:
            # can we add something here to ensure we're not masking errors other than invalid options?

I believe it's really important to take great care that we're not masking actual errors; e.g. the hard-wired _PRECHECK_NVVM_IR might need tweaks for future GPU generations. If we're simply reporting any error as "invalid compiler option", it'll potentially take someone downstream a long time to drill down all the way back here.

cuda_bindings/cuda/bindings/utils/_nvvm_utils.py

Comment on lines +53 to +54

		>>> check_nvvm_options([b"-arch=compute_90", b"-numba-debug"])
		True # if -numba-debug is supported by the installed libNVVM

Collaborator

rwgk Apr 1, 2026

Could we use a non-numba example here?

cuda_core/tests/test_program.py

+                      from cuda.bindings.utils import check_nvvm_options
+                      return check_nvvm_options([f"-arch={arch}".encode()])
                   except Exception:

Collaborator

rwgk Apr 1, 2026 •

edited

Loading

It's only a test, therefore not nearly as critical as in the production code, but we may miss regressions if this isn't handled with similar care.

rparolin removed this from the cuda.bindings backlog milestone

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cuda.bindings enhancement P1