diff --git a/README.md b/README.md index 25b3b87af0..2c0b18f077 100644 --- a/README.md +++ b/README.md @@ -4,11 +4,10 @@ CUDA Python is the home for accessing NVIDIA’s CUDA platform from Python. It c * [cuda.core](https://nvidia.github.io/cuda-python/cuda-core/latest): Pythonic access to CUDA Runtime and other core functionalities * [cuda.bindings](https://nvidia.github.io/cuda-python/cuda-bindings/latest): Low-level Python bindings to CUDA C APIs -* [cuda.cooperative](https://nvidia.github.io/cccl/cuda_cooperative/): A Python package providing CCCL's reusable block-wide and warp-wide *device* primitives for use within Numba CUDA kernels -* [cuda.parallel](https://nvidia.github.io/cccl/cuda_parallel/): A Python package for easy access to CCCL's highly efficient and customizable parallel algorithms, like `sort`, `scan`, `reduce`, `transform`, etc, that are callable on the *host* +* [cuda.cccl.cooperative](https://nvidia.github.io/cccl/cuda_cooperative/): A Python module providing CCCL's reusable block-wide and warp-wide *device* primitives for use within Numba CUDA kernels +* [cuda.cccl.parallel](https://nvidia.github.io/cccl/cuda_parallel/): A Python module for easy access to CCCL's highly efficient and customizable parallel algorithms, like `sort`, `scan`, `reduce`, `transform`, etc, that are callable on the *host* * [numba.cuda](https://nvidia.github.io/numba-cuda/): Numba's target for CUDA GPU programming by directly compiling a restricted subset of Python code into CUDA kernels and device functions following the CUDA execution model. - -For access to NVIDIA CPU & GPU Math Libraries, please refer to [nvmath-python](https://docs.nvidia.com/cuda/nvmath-python/latest). +* [nvmath-python](https://docs.nvidia.com/cuda/nvmath-python/latest): Pythonic access to NVIDIA CPU & GPU Math Libraries, with both [*host*](https://docs.nvidia.com/cuda/nvmath-python/latest/overview.html#host-apis) and [*device* (nvmath.device)](https://docs.nvidia.com/cuda/nvmath-python/latest/overview.html#device-apis) APIs. It also provides low-level Python bindings to host C APIs ([nvmath.bindings](https://docs.nvidia.com/cuda/nvmath-python/latest/bindings/index.html)). CUDA Python is currently undergoing an overhaul to improve existing and bring up new components. All of the previously available functionalities from the `cuda-python` package will continue to be available, please refer to the [cuda.bindings](https://nvidia.github.io/cuda-python/cuda-bindings/latest) documentation for installation guide and further detail. diff --git a/cuda_python/docs/source/conf.py b/cuda_python/docs/source/conf.py index 78791993a0..4dabf8a899 100644 --- a/cuda_python/docs/source/conf.py +++ b/cuda_python/docs/source/conf.py @@ -96,7 +96,7 @@ rst_epilog = f""" .. _cuda.core: {CUDA_PYTHON_DOMAIN}/cuda-core/latest .. _cuda.bindings: {CUDA_PYTHON_DOMAIN}/cuda-bindings/latest -.. _cuda.cooperative: https://nvidia.github.io/cccl/cuda_cooperative/ -.. _cuda.parallel: https://nvidia.github.io/cccl/cuda_parallel/ +.. _cuda.cccl.cooperative: https://nvidia.github.io/cccl/cuda_cooperative/ +.. _cuda.cccl.parallel: https://nvidia.github.io/cccl/cuda_parallel/ .. _numba.cuda: https://nvidia.github.io/numba-cuda/ """ diff --git a/cuda_python/docs/source/index.rst b/cuda_python/docs/source/index.rst index 58a70aed0e..ab96bb63d1 100644 --- a/cuda_python/docs/source/index.rst +++ b/cuda_python/docs/source/index.rst @@ -8,13 +8,14 @@ multiple components: - `cuda.core`_: Pythonic access to CUDA runtime and other core functionalities - `cuda.bindings`_: Low-level Python bindings to CUDA C APIs -- `cuda.cooperative`_: A Python package providing CCCL's reusable block-wide and warp-wide *device* primitives for use within Numba CUDA kernels -- `cuda.parallel`_: A Python package for easy access to CCCL's highly efficient and customizable parallel algorithms, like ``sort``, ``scan``, ``reduce``, ``transform``, etc, that are callable on the *host* +- `cuda.cccl.cooperative`_: A Python module providing CCCL's reusable block-wide and warp-wide *device* primitives for use within Numba CUDA kernels +- `cuda.cccl.parallel`_: A Python module for easy access to CCCL's highly efficient and customizable parallel algorithms, like ``sort``, ``scan``, ``reduce``, ``transform``, etc, that are callable on the *host* - `numba.cuda`_: Numba's target for CUDA GPU programming by directly compiling a restricted subset of Python code into CUDA kernels and device functions following the CUDA execution model. - -For access to NVIDIA CPU & GPU Math Libraries, please refer to `nvmath-python`_. +* `nvmath-python`_: Pythonic access to NVIDIA CPU & GPU Math Libraries, with both *host* and *device* (through `nvmath.device`_) APIs. It also provides low-level Python bindings to host C APIs (through `nvmath.bindings`_). .. _nvmath-python: https://docs.nvidia.com/cuda/nvmath-python/latest +.. _nvmath.device: https://docs.nvidia.com/cuda/nvmath-python/latest/overview.html#device-apis +.. _nvmath.bindings: https://docs.nvidia.com/cuda/nvmath-python/latest/bindings/index.html CUDA Python is currently undergoing an overhaul to improve existing and bring up new components. All of the previously available functionalities from the ``cuda-python`` package will continue to @@ -31,6 +32,7 @@ be available, please refer to the `cuda.bindings`_ documentation for installatio release.md cuda.core cuda.bindings - cuda.cooperative - cuda.parallel + cuda.cccl.cooperative + cuda.cccl.parallel numba.cuda + nvmath-python