Skip to content

The source code should be updated for compute capability 120; Please let us know #270

@cihe13375

Description

@cihe13375

cc120 is RTX50xx series (https://developer.nvidia.com/cuda-gpus).

The two numbers seem to be same as cc90:

a.max_blocks_per_multiprocessor = 32;
a.cores_per_multiprocessor = 128;

https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#compute-capability-12-x
https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#features-and-technical-specifications-technical-specifications-per-compute-capability

After the edit to src/cudart/gpucard.cpp the code compiled and all tests passed (tested on RTX5090, cuda 12.9.41. Compile: FC=gfortran CXX=g++ CUDACXX=nvcc compute_capability=120 gpu_lang=cuda cmake -D CMAKE_INSTALL_PREFIX=$HOME/.local ..).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions