Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA]: Introduce Python module with CCCL headers #3201

Open
wants to merge 23 commits into
base: main
Choose a base branch
from

Conversation

rwgk
Copy link
Contributor

@rwgk rwgk commented Dec 19, 2024

Description

closes #2281

Copy link

copy-pr-bot bot commented Dec 19, 2024

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

project_path = os.path.abspath(os.path.dirname(__file__))
cccl_path = os.path.abspath(os.path.join(project_path, "..", ".."))
cccl_headers = [["cub", "cub"], ["libcudacxx", "include"], ["thrust", "thrust"]]
ver = "0.1.2.8.0"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we need to use the CCCL version here, not CCCL Python modules' version. We should also not hard-code it, but instead read from CMakeLists which is the source of truth AFAIK, and for that setuptools might not be doing the job. @vyasr might have a simple example for how this can be done with scikit-build-core.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ack. I added this is a bullet to the PR description.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Check out the dynamic metadata section, specifically the Regex tab.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You would need to rewrite everything here to use CMake instead of setuptools. Depending on what this module is trying to do that may or may not be beneficial. Do you need to run compilation of cuda_cccl/cooperative/parallel against CCCL headers? In that case it is almost certainly worthwhile, I wouldn't want to orchestrate that compilation using setuptools.

Copy link
Member

@leofang leofang Dec 19, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do you need to run compilation of cuda_cccl/cooperative/parallel against CCCL headers?

  • cuda_cccl would just be nvidia-cuda-cccl-cuXX containing the headers but owned/maintained by the CCCL team for faster release cycles (think of it as cccl vs cuda-cccl on conda-forge)
  • cuda_cooperative JIT compiles CCCL headers at run time, so for all purposes the headers can be thought as shared libraries; no AOT compilation is needed
  • cuda_parallel is the most interesting case, because it does need to build the CCCL C shared library and include it in the wheel, but I dunno if building it requires NVCC + CCCL headers, or GCC/MSVC alone is enough

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

but I dunno if building it requires NVCC + CCCL headers, or GCC/MSVC alone is enough

Based on

  • adding -DCMAKE_VERBOSE_MAKEFILE=ON and looking at the output of
  • pip install --verbose ./cuda_parallel[test]

nvcc is required for compiling cccl/c/parallel/src/for.cu and reduce.cu:

  cd /home/coder/cccl/python/cuda_parallel/build/temp.linux-x86_64-cpython-312/c/parallel && /usr/bin/sccache /usr/local/cuda/bin/nvcc -forward-unknown-to-host-compiler -ccbin=/usr/bin/g++ -DCCCL_C_EXPERIMENTAL=1 -DNVRTC_GET_TYPE_NAME=1 -D_CCCL_NO_SYSTEM_HEADER -Dcccl_c_parallel_EXPORTS --options-file CMakeFiles/cccl.c.parallel.dir/includes_CUDA.rsp -O3 -DNDEBUG -std=c++20 "--generate-code=arch=compute_52,code=[compute_52,sm_52]" -Xcompiler=-fPIC -Xcudafe=--display_error_number -Wno-deprecated-gpu-targets -Xcudafe=--promote_warnings -Wreorder -Xcompiler=-Werror -Xcompiler=-Wall -Xcompiler=-Wextra -Xcompiler=-Wreorder -Xcompiler=-Winit-self -Xcompiler=-Woverloaded-virtual -Xcompiler=-Wcast-qual -Xcompiler=-Wpointer-arith -Xcompiler=-Wvla -Xcompiler=-Wno-gnu-line-marker -Xcompiler=-Wno-gnu-zero-variadic-macro-arguments -Xcompiler=-Wno-unused-function -Xcompiler=-Wno-noexcept-type -MD -MT c/parallel/CMakeFiles/cccl.c.parallel.dir/src/for.cu.o -MF CMakeFiles/cccl.c.parallel.dir/src/for.cu.o.d -x cu -c /home/coder/cccl/c/parallel/src/for.cu -o CMakeFiles/cccl.c.parallel.dir/src/for.cu.o
  cd /home/coder/cccl/python/cuda_parallel/build/temp.linux-x86_64-cpython-312/c/parallel && /usr/bin/sccache /usr/local/cuda/bin/nvcc -forward-unknown-to-host-compiler -ccbin=/usr/bin/g++ -DCCCL_C_EXPERIMENTAL=1 -DNVRTC_GET_TYPE_NAME=1 -D_CCCL_NO_SYSTEM_HEADER -Dcccl_c_parallel_EXPORTS --options-file CMakeFiles/cccl.c.parallel.dir/includes_CUDA.rsp -O3 -DNDEBUG -std=c++20 "--generate-code=arch=compute_52,code=[compute_52,sm_52]" -Xcompiler=-fPIC -Xcudafe=--display_error_number -Wno-deprecated-gpu-targets -Xcudafe=--promote_warnings -Wreorder -Xcompiler=-Werror -Xcompiler=-Wall -Xcompiler=-Wextra -Xcompiler=-Wreorder -Xcompiler=-Winit-self -Xcompiler=-Woverloaded-virtual -Xcompiler=-Wcast-qual -Xcompiler=-Wpointer-arith -Xcompiler=-Wvla -Xcompiler=-Wno-gnu-line-marker -Xcompiler=-Wno-gnu-zero-variadic-macro-arguments -Xcompiler=-Wno-unused-function -Xcompiler=-Wno-noexcept-type -MD -MT c/parallel/CMakeFiles/cccl.c.parallel.dir/src/reduce.cu.o -MF CMakeFiles/cccl.c.parallel.dir/src/reduce.cu.o.d -x cu -c /home/coder/cccl/c/parallel/src/reduce.cu -o CMakeFiles/cccl.c.parallel.dir/src/reduce.cu.o

Copy link
Member

@leofang leofang Dec 19, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I skimmed over the code and I am actually confused, because my impression is that the kernel compilation is still done at run time (JIT), and that the host logic can just be handled by a host compiler. @gevtushenko IIRC you built the prototype, any reason we have to use .cu files here and use NVCC to compile?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Commit 2913ae0 adopts the established _version.py handling.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tl;dr I would suggest that if you have to do any compilation whatsoever beyond pure Cython you switch away from setuptools, but if you don't have any compiled modules at build time then stick to setuptools or use another backend that isn't designed for compilation (hatchling would be a great choice).

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@gevtushenko IIRC you built the prototype, any reason we have to use .cu files here and use NVCC to compile?

In the offline call Georgii reminded me that there are some CUB structs that we need to pre-compile to pass around. Since generally CUB headers are not host compilable, NVCC has to be used, but we don't generate any GPU-specific code.

@leofang
Copy link
Member

leofang commented Dec 19, 2024

  • pip install --editable does not work (anymore).

Q: In what way is it not working?

@rwgk
Copy link
Contributor Author

rwgk commented Dec 19, 2024

  • pip install --editable does not work (anymore).

Q: In what way is it not working?

It is getting a non-existing path here:

cub_include_path = str(f.parent / "_include")

At HEAD, cuda_paralleld/cuda/_include exists in the source directory (it is .gitignored), but with this PR, that directory no longer exists and the trick/hack I'm using in the code above fails.

@rwgk
Copy link
Contributor Author

rwgk commented Dec 19, 2024

On August 30, 2014 @leofang wrote:

#2281 (comment)

Leo: Do you still recommend that we replace shutil.copytree() (in setup.py) with the cmake-based alternative?

I'm asking because that'll take this PR in a very different direction (I think).

@rwgk
Copy link
Contributor Author

rwgk commented Dec 19, 2024

Logging an observation (JIC it's useful to reference this later):

With CCCL HEAD (I have @ d6253b5)

TL;DR: pip install for cuda_cooperative or cuda_parallel does not work correctly in the first pass.

@gevtushenko could this explain your "only works 50% of the time" experience?

Current working directory is cccl/python/

deactivate
rm -rf devenv
git clean -fdx cuda_cooperative/ >& /dev/null
python -m venv devenv
. devenv/bin/activate
pip install --verbose ./cuda_cooperative[test] >& ~/pip_install_cuda_cooperative_log1.txt
ls -lR devenv/lib/python3.12/site-packages/cuda/_include | wc -l
pip install --verbose ./cuda_cooperative[test] >& ~/pip_install_cuda_cooperative_log2.txt
ls -lR devenv/lib/python3.12/site-packages/cuda/_include | wc -l

The output is:

ls: cannot access 'devenv/lib/python3.12/site-packages/cuda/_include': No such file or directory
0
2076

Similarly for cuda_parallel:

deactivate
rm -rf devenv
git clean -fdx cuda_parallel/ >& /dev/null
python -m venv devenv
. devenv/bin/activate
pip install --verbose ./cuda_parallel[test] >& ~/pip_install_cuda_parallel_log1.txt
ls -lR devenv/lib/python3.12/site-packages/cuda/_include | wc -l
pip install --verbose ./cuda_parallel[test] >& ~/pip_install_cuda_parallel_log2.txt
ls -lR devenv/lib/python3.12/site-packages/cuda/_include | wc -l

Same output as above.

@rwgk
Copy link
Contributor Author

rwgk commented Dec 19, 2024

Now with this PR (@ daab580)

TL;DR: Same problem (this had me really confused TBH).

deactivate
rm -rf devenv
git clean -fdx cuda_cccl/ >& /dev/null
python -m venv devenv
. devenv/bin/activate
pip install --verbose ./cuda_cccl[test] >& ~/pip_install_cuda_cccl_log1.txt
ls -lR devenv/lib/python3.12/site-packages/cuda/_include | wc -l
pip install --verbose ./cuda_cccl[test] >& ~/pip_install_cuda_cccl_log2.txt
ls -lR devenv/lib/python3.12/site-packages/cuda/_include | wc -l

Output:

ls: cannot access 'devenv/lib/python3.12/site-packages/cuda/_include': No such file or directory
0
2076

@rwgk
Copy link
Contributor Author

rwgk commented Dec 19, 2024

Small summary:

  • pip install cuda_cooperative or cuda_parallel does not work reliably at HEAD.

  • Simply moving out cuda-cccl does not fix that.

  • But we have the bigger question: Should I even try to fix that? Or should I instead try to use CMake instead of setuptools?

@rwgk
Copy link
Contributor Author

rwgk commented Dec 20, 2024

Commit ef9d5f4 makes the pip install ./cuda_cccl command work reliably.

I wouldn't be surprised if this isn't the right way of doing it, but it does work in one pass.

@rwgk
Copy link
Contributor Author

rwgk commented Dec 20, 2024

Commit bc116dc fixes the pip install --editable issue.

@rwgk
Copy link
Contributor Author

rwgk commented Dec 20, 2024

It turns out what I discovered the hard way was actually a known issue:

cccl/ci/test_python.sh

Lines 23 to 27 in d6253b5

# Temporarily install the package twice to populate include directory as part of the first installation
# and to let manifest discover these includes during the second installation. Do not forget to remove the
# second installation after https://github.com/NVIDIA/cccl/issues/2281 is addressed.
run_command "⚙️ Pip install cuda_parallel once" pip install --force-reinstall --upgrade --target "${prefix}" .[test]
run_command "⚙️ Pip install cuda_parallel twice" pip install --force-reinstall --upgrade --target "${prefix}" .[test]

@rwgk
Copy link
Contributor Author

rwgk commented Dec 20, 2024

/ok to test

Copy link
Contributor

🟩 CI finished in 58m 34s: Pass: 100%/176 | Total: 1d 00h | Avg: 8m 22s | Max: 44m 12s | Hits: 99%/22510
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 7h 29m | Avg: 9m 22s | Max: 36m 14s | Hits: 98%/9814

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total:  7h 23m | Avg:  9m 37s | Max: 36m 14s | Hits:  98%/9814  
      🟩 arm64              Pass: 100%/2   | Total:  6m 50s | Avg:  3m 25s | Max:  3m 31s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 54m 23s | Avg:  7m 46s | Max: 19m 13s | Hits:  98%/2239  
      🟩 12.5               Pass: 100%/2   | Total: 17m 29s | Avg:  8m 44s | Max:  8m 49s
      🟩 12.6               Pass: 100%/39  | Total:  6h 18m | Avg:  9m 41s | Max: 36m 14s | Hits:  98%/7575  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 06m | Avg: 16m 44s | Max: 20m 25s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 54m 23s | Avg:  7m 46s | Max: 19m 13s | Hits:  98%/2239  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 29s | Avg:  8m 44s | Max:  8m 49s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  5h 11m | Avg:  8m 53s | Max: 36m 14s | Hits:  98%/7575  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 06m | Avg: 16m 44s | Max: 20m 25s
      🟩 nvcc               Pass: 100%/44  | Total:  6h 22m | Avg:  8m 42s | Max: 36m 14s | Hits:  98%/9814  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 15m 17s | Avg:  3m 49s | Max:  4m 28s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 45s | Avg:  4m 45s | Max:  4m 45s
      🟩 Clang11            Pass: 100%/1   | Total:  4m 10s | Avg:  4m 10s | Max:  4m 10s
      🟩 Clang12            Pass: 100%/1   | Total:  4m 14s | Avg:  4m 14s | Max:  4m 14s
      🟩 Clang13            Pass: 100%/1   | Total:  4m 24s | Avg:  4m 24s | Max:  4m 24s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 02s | Avg:  4m 02s | Max:  4m 02s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 38s | Avg:  4m 38s | Max:  4m 38s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 33s | Avg:  4m 33s | Max:  4m 33s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 14s | Avg:  4m 14s | Max:  4m 14s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 49m | Avg: 13m 43s | Max: 30m 21s
      🟩 GCC6               Pass: 100%/2   | Total: 10m 24s | Avg:  5m 12s | Max:  7m 46s
      🟩 GCC7               Pass: 100%/2   | Total:  6m 32s | Avg:  3m 16s | Max:  3m 28s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 48s | Avg:  3m 48s | Max:  3m 48s
      🟩 GCC9               Pass: 100%/3   | Total: 21m 47s | Avg:  7m 15s | Max: 15m 38s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 47s | Avg:  3m 47s | Max:  3m 47s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 42s | Avg:  3m 42s | Max:  3m 42s
      🟩 GCC12              Pass: 100%/1   | Total:  3m 49s | Avg:  3m 49s | Max:  3m 49s
      🟩 GCC13              Pass: 100%/10  | Total:  2h 32m | Avg: 15m 15s | Max: 36m 14s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  5m 37s | Avg:  5m 37s | Max:  5m 37s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 13s | Avg: 19m 13s | Max: 19m 13s | Hits:  98%/2239  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 13m 12s | Avg: 13m 12s | Max: 13m 12s | Hits:  99%/2476  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 27m 54s | Avg: 13m 57s | Max: 15m 08s | Hits:  98%/5099  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 29s | Avg:  8m 44s | Max:  8m 49s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/20  | Total:  2h 40m | Avg:  8m 00s | Max: 30m 21s
      🟩 GCC                Pass: 100%/21  | Total:  3h 26m | Avg:  9m 49s | Max: 36m 14s
      🟩 Intel              Pass: 100%/1   | Total:  5m 37s | Avg:  5m 37s | Max:  5m 37s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 00m | Avg: 15m 04s | Max: 19m 13s | Hits:  98%/9814  
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 29s | Avg:  8m 44s | Max:  8m 49s
    🟩 gpu
      🟩 v100               Pass: 100%/48  | Total:  7h 29m | Avg:  9m 22s | Max: 36m 14s | Hits:  98%/9814  
    🟩 jobs
      🟩 Build              Pass: 100%/41  | Total:  4h 41m | Avg:  6m 52s | Max: 20m 25s | Hits:  98%/9814  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 52m | Avg: 28m 11s | Max: 36m 14s
      🟩 Test               Pass: 100%/2   | Total: 53m 12s | Avg: 26m 36s | Max: 30m 21s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 04s | Avg:  2m 04s | Max:  2m 04s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 14m 37s | Avg: 14m 37s | Max: 14m 37s
      🟩 90a                Pass: 100%/2   | Total: 16m 02s | Avg:  8m 01s | Max: 12m 11s
    🟩 std
      🟩 11                 Pass: 100%/6   | Total: 46m 47s | Avg:  7m 47s | Max: 31m 13s
      🟩 14                 Pass: 100%/5   | Total: 59m 13s | Avg: 11m 50s | Max: 24m 18s | Hits:  98%/2239  
      🟩 17                 Pass: 100%/13  | Total:  2h 00m | Avg:  9m 18s | Max: 21m 02s | Hits:  99%/4952  
      🟩 20                 Pass: 100%/23  | Total:  3h 40m | Avg:  9m 36s | Max: 36m 14s | Hits:  98%/2623  
    
  • 🟩 cub: Pass: 100%/47 | Total: 7h 03m | Avg: 9m 00s | Max: 31m 24s | Hits: 99%/3124

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  6h 53m | Avg:  9m 11s | Max: 31m 24s | Hits:  99%/3124  
      🟩 arm64              Pass: 100%/2   | Total:  9m 47s | Avg:  4m 53s | Max:  4m 59s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 41m 41s | Avg:  5m 57s | Max: 15m 29s | Hits:  99%/781   
      🟩 12.5               Pass: 100%/2   | Total: 18m 36s | Avg:  9m 18s | Max:  9m 23s
      🟩 12.6               Pass: 100%/38  | Total:  6h 02m | Avg:  9m 32s | Max: 31m 24s | Hits:  99%/2343  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 55s | Avg:  4m 27s | Max:  4m 36s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 41m 41s | Avg:  5m 57s | Max: 15m 29s | Hits:  99%/781   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 36s | Avg:  9m 18s | Max:  9m 23s
      🟩 nvcc12.6           Pass: 100%/36  | Total:  5h 53m | Avg:  9m 49s | Max: 31m 24s | Hits:  99%/2343  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 55s | Avg:  4m 27s | Max:  4m 36s
      🟩 nvcc               Pass: 100%/45  | Total:  6h 54m | Avg:  9m 12s | Max: 31m 24s | Hits:  99%/3124  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 22m 04s | Avg:  5m 31s | Max:  6m 25s
      🟩 Clang10            Pass: 100%/1   | Total:  7m 01s | Avg:  7m 01s | Max:  7m 01s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 42s | Avg:  5m 42s | Max:  5m 42s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 38s | Avg:  5m 38s | Max:  5m 38s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 32s | Avg:  5m 32s | Max:  5m 32s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 51s | Avg:  5m 51s | Max:  5m 51s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 43s | Avg:  5m 43s | Max:  5m 43s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 34s | Avg:  5m 34s | Max:  5m 34s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 18m | Avg: 11m 10s | Max: 31m 24s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 23s | Avg:  4m 11s | Max:  4m 24s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 24s | Avg:  5m 12s | Max:  5m 13s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 30s | Avg:  5m 30s | Max:  5m 30s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 11s | Avg:  4m 43s | Max:  5m 37s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 48s | Avg:  5m 48s | Max:  5m 48s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 40s | Avg:  5m 40s | Max:  5m 40s
      🟩 GCC12              Pass: 100%/3   | Total: 25m 38s | Avg:  8m 32s | Max: 15m 50s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 01m | Avg: 15m 09s | Max: 31m 13s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  6m 49s | Avg:  6m 49s | Max:  6m 49s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 29s | Avg: 15m 29s | Max: 15m 29s | Hits:  99%/781   
      🟩 MSVC14.29          Pass: 100%/1   | Total: 12m 05s | Avg: 12m 05s | Max: 12m 05s | Hits:  99%/781   
      🟩 MSVC14.39          Pass: 100%/2   | Total: 26m 30s | Avg: 13m 15s | Max: 13m 43s | Hits:  99%/1562  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 36s | Avg:  9m 18s | Max:  9m 23s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 26m | Avg:  7m 43s | Max: 31m 24s
      🟩 GCC                Pass: 100%/21  | Total:  3h 16m | Avg:  9m 22s | Max: 31m 13s
      🟩 Intel              Pass: 100%/1   | Total:  6m 49s | Avg:  6m 49s | Max:  6m 49s
      🟩 MSVC               Pass: 100%/4   | Total: 54m 04s | Avg: 13m 31s | Max: 15m 29s | Hits:  99%/3124  
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 36s | Avg:  9m 18s | Max:  9m 23s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 20m 01s | Avg: 10m 00s | Max: 15m 50s
      🟩 v100               Pass: 100%/45  | Total:  6h 43m | Avg:  8m 57s | Max: 31m 24s | Hits:  99%/3124  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 13m | Avg:  6m 20s | Max: 15m 29s | Hits:  99%/3124  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 24m 41s | Avg: 24m 41s | Max: 24m 41s
      🟩 GraphCapture       Pass: 100%/1   | Total: 27m 30s | Avg: 27m 30s | Max: 27m 30s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 03m | Avg: 21m 13s | Max: 31m 24s
      🟩 TestGPU            Pass: 100%/2   | Total: 53m 48s | Avg: 26m 54s | Max: 31m 13s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 20m 01s | Avg: 10m 00s | Max: 15m 50s
      🟩 90a                Pass: 100%/1   | Total:  4m 30s | Avg:  4m 30s | Max:  4m 30s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 24m 23s | Avg:  4m 52s | Max:  6m 25s
      🟩 14                 Pass: 100%/4   | Total: 31m 28s | Avg:  7m 52s | Max: 15m 29s | Hits:  99%/781   
      🟩 17                 Pass: 100%/12  | Total:  1h 24m | Avg:  7m 03s | Max: 13m 43s | Hits:  99%/1562  
      🟩 20                 Pass: 100%/26  | Total:  4h 42m | Avg: 10m 51s | Max: 31m 24s | Hits:  99%/781   
    
  • 🟩 thrust: Pass: 100%/46 | Total: 6h 20m | Avg: 8m 16s | Max: 33m 45s | Hits: 99%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 18m 32s | Avg:  9m 16s | Max: 12m 27s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total:  6h 11m | Avg:  8m 26s | Max: 33m 45s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  9m 38s | Avg:  4m 49s | Max:  5m 08s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 44m 55s | Avg:  6m 25s | Max: 18m 45s | Hits:  99%/1852  
      🟩 12.5               Pass: 100%/2   | Total: 27m 55s | Avg: 13m 57s | Max: 14m 05s
      🟩 12.6               Pass: 100%/37  | Total:  5h 07m | Avg:  8m 19s | Max: 33m 45s | Hits:  99%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 08s | Avg:  5m 04s | Max:  5m 07s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 44m 55s | Avg:  6m 25s | Max: 18m 45s | Hits:  99%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 27m 55s | Avg: 13m 57s | Max: 14m 05s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  4h 57m | Avg:  8m 30s | Max: 33m 45s | Hits:  99%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 08s | Avg:  5m 04s | Max:  5m 07s
      🟩 nvcc               Pass: 100%/44  | Total:  6h 10m | Avg:  8m 25s | Max: 33m 45s | Hits:  99%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 21m 19s | Avg:  5m 19s | Max:  6m 25s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 42s | Avg:  6m 42s | Max:  6m 42s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 15s | Avg:  5m 15s | Max:  5m 15s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 45s | Avg:  5m 45s | Max:  5m 45s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 10s | Avg:  5m 10s | Max:  5m 10s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 46s | Avg:  5m 46s | Max:  5m 46s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 20s | Avg:  5m 20s | Max:  5m 20s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 32s | Avg:  5m 32s | Max:  5m 32s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 07m | Avg:  9m 36s | Max: 33m 45s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 15s | Avg:  4m 07s | Max:  4m 09s
      🟩 GCC7               Pass: 100%/2   | Total:  9m 25s | Avg:  4m 42s | Max:  5m 01s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 00s | Avg:  5m 00s | Max:  5m 00s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 45s | Avg:  4m 55s | Max:  5m 54s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 42s | Avg:  5m 42s | Max:  5m 42s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 40s | Avg:  5m 40s | Max:  5m 40s
      🟩 GCC12              Pass: 100%/1   | Total:  6m 17s | Avg:  6m 17s | Max:  6m 17s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 05m | Avg:  8m 09s | Max: 17m 03s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  6m 58s | Avg:  6m 58s | Max:  6m 58s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 18m 45s | Avg: 18m 45s | Max: 18m 45s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 15m 05s | Avg: 15m 05s | Max: 15m 05s | Hits:  99%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 58m 19s | Avg: 19m 26s | Max: 23m 53s | Hits:  99%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 27m 55s | Avg: 13m 57s | Max: 14m 05s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 13m | Avg:  7m 01s | Max: 33m 45s
      🟩 GCC                Pass: 100%/19  | Total:  2h 00m | Avg:  6m 19s | Max: 17m 03s
      🟩 Intel              Pass: 100%/1   | Total:  6m 58s | Avg:  6m 58s | Max:  6m 58s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 32m | Avg: 18m 25s | Max: 23m 53s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total: 27m 55s | Avg: 13m 57s | Max: 14m 05s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  6h 20m | Avg:  8m 16s | Max: 33m 45s | Hits:  99%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 37m | Avg:  6m 56s | Max: 18m 45s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 39m 40s | Avg: 13m 13s | Max: 23m 53s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 03m | Avg: 21m 05s | Max: 33m 45s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 28s | Avg:  4m 28s | Max:  4m 28s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 22m 32s | Avg:  4m 30s | Max:  5m 50s
      🟩 14                 Pass: 100%/4   | Total: 34m 17s | Avg:  8m 34s | Max: 18m 45s | Hits:  99%/1852  
      🟩 17                 Pass: 100%/12  | Total:  1h 36m | Avg:  8m 03s | Max: 17m 03s | Hits:  99%/3704  
      🟩 20                 Pass: 100%/23  | Total:  3h 28m | Avg:  9m 04s | Max: 33m 45s | Hits:  99%/3704  
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 18m | Avg: 5m 19s | Max: 31m 25s | Hits: 92%/312

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 07m | Avg:  5m 48s | Max: 31m 25s | Hits:  92%/312   
      🟩 arm64              Pass: 100%/4   | Total: 10m 21s | Avg:  2m 35s | Max:  2m 39s
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 14m 27s | Avg:  4m 49s | Max:  8m 33s | Hits:  92%/156   
      🟩 12.5               Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  5m 40s
      🟩 12.6               Pass: 100%/21  | Total:  1h 52m | Avg:  5m 22s | Max: 31m 25s | Hits:  92%/156   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 14m 27s | Avg:  4m 49s | Max:  8m 33s | Hits:  92%/156   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  5m 40s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  1h 52m | Avg:  5m 22s | Max: 31m 25s | Hits:  92%/156   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 18m | Avg:  5m 19s | Max: 31m 25s | Hits:  92%/312   
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 03s | Avg:  3m 03s | Max:  3m 03s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 03s | Avg:  4m 03s | Max:  4m 03s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 13s | Avg:  3m 13s | Max:  3m 13s
      🟩 Clang12            Pass: 100%/1   | Total:  3m 07s | Avg:  3m 07s | Max:  3m 07s
      🟩 Clang13            Pass: 100%/1   | Total:  3m 17s | Avg:  3m 17s | Max:  3m 17s
      🟩 Clang14            Pass: 100%/1   | Total:  3m 33s | Avg:  3m 33s | Max:  3m 33s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 14s | Avg:  3m 14s | Max:  3m 14s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 12s | Avg:  3m 12s | Max:  3m 12s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 16s | Avg:  3m 16s | Max:  3m 16s
      🟩 Clang18            Pass: 100%/4   | Total: 39m 45s | Avg:  9m 56s | Max: 31m 25s
      🟩 GCC9               Pass: 100%/1   | Total:  2m 51s | Avg:  2m 51s | Max:  2m 51s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 23s | Avg:  3m 23s | Max:  3m 23s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 03s | Avg:  3m 03s | Max:  3m 03s
      🟩 GCC12              Pass: 100%/2   | Total: 20m 10s | Avg: 10m 05s | Max: 16m 46s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 56s | Avg:  2m 44s | Max:  2m 58s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  8m 33s | Avg:  8m 33s | Max:  8m 33s | Hits:  92%/156   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  8m 39s | Avg:  8m 39s | Max:  8m 39s | Hits:  92%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  5m 40s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/13  | Total:  1h 09m | Avg:  5m 21s | Max: 31m 25s
      🟩 GCC                Pass: 100%/9   | Total: 40m 23s | Avg:  4m 29s | Max: 16m 46s
      🟩 MSVC               Pass: 100%/2   | Total: 17m 12s | Avg:  8m 36s | Max:  8m 39s | Hits:  92%/312   
      🟩 NVHPC              Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  5m 40s
    🟩 gpu
      🟩 v100               Pass: 100%/26  | Total:  2h 18m | Avg:  5m 19s | Max: 31m 25s | Hits:  92%/312   
    🟩 jobs
      🟩 Build              Pass: 100%/24  | Total:  1h 30m | Avg:  3m 45s | Max:  8m 39s | Hits:  92%/312   
      🟩 Test               Pass: 100%/2   | Total: 48m 11s | Avg: 24m 05s | Max: 31m 25s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 58s | Avg:  2m 58s | Max:  2m 58s
      🟩 90a                Pass: 100%/1   | Total:  2m 45s | Avg:  2m 45s | Max:  2m 45s
    🟩 std
      🟩 17                 Pass: 100%/6   | Total: 19m 42s | Avg:  3m 17s | Max:  5m 40s
      🟩 20                 Pass: 100%/20  | Total:  1h 58m | Avg:  5m 55s | Max: 31m 25s | Hits:  92%/312   
    
  • 🟩 cccl: Pass: 100%/6 | Total: 27m 15s | Avg: 4m 32s | Max: 4m 51s

    🟩 cpu
      🟩 amd64              Pass: 100%/6   | Total: 27m 15s | Avg:  4m 32s | Max:  4m 51s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  8m 47s | Avg:  4m 23s | Max:  4m 51s
      🟩 12.0               Pass: 100%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  4m 45s
      🟩 12.6               Pass: 100%/2   | Total:  9m 02s | Avg:  4m 31s | Max:  4m 49s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  8m 47s | Avg:  4m 23s | Max:  4m 51s
      🟩 nvcc12.0           Pass: 100%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  4m 45s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 02s | Avg:  4m 31s | Max:  4m 49s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/6   | Total: 27m 15s | Avg:  4m 32s | Max:  4m 51s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  4m 51s | Avg:  4m 51s | Max:  4m 51s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 41s | Avg:  4m 41s | Max:  4m 41s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 49s | Avg:  4m 49s | Max:  4m 49s
      🟩 GCC6               Pass: 100%/1   | Total:  3m 56s | Avg:  3m 56s | Max:  3m 56s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 45s | Avg:  4m 45s | Max:  4m 45s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 13s | Avg:  4m 13s | Max:  4m 13s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/3   | Total: 14m 21s | Avg:  4m 47s | Max:  4m 51s
      🟩 GCC                Pass: 100%/3   | Total: 12m 54s | Avg:  4m 18s | Max:  4m 45s
    🟩 gpu
      🟩 v100               Pass: 100%/6   | Total: 27m 15s | Avg:  4m 32s | Max:  4m 51s
    🟩 jobs
      🟩 Infra              Pass: 100%/6   | Total: 27m 15s | Avg:  4m 32s | Max:  4m 51s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 04s | Avg: 5m 02s | Max: 8m 04s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 04s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 04s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 04s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 04s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 04s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 04s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 04s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 00s | Avg:  2m 00s | Max:  2m 00s
      🟩 Test               Pass: 100%/1   | Total:  8m 04s | Avg:  8m 04s | Max:  8m 04s
    
  • 🟩 python: Pass: 100%/1 | Total: 44m 12s | Avg: 44m 12s | Max: 44m 12s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 44m 12s | Avg: 44m 12s | Max: 44m 12s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 44m 12s | Avg: 44m 12s | Max: 44m 12s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 44m 12s | Avg: 44m 12s | Max: 44m 12s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 44m 12s | Avg: 44m 12s | Max: 44m 12s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 44m 12s | Avg: 44m 12s | Max: 44m 12s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 44m 12s | Avg: 44m 12s | Max: 44m 12s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 44m 12s | Avg: 44m 12s | Max: 44m 12s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 44m 12s | Avg: 44m 12s | Max: 44m 12s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
+/- python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 176)

# Runner
125 linux-amd64-cpu16
25 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@rwgk rwgk marked this pull request as ready for review December 20, 2024 03:13
@rwgk rwgk requested review from a team as code owners December 20, 2024 03:13
@rwgk rwgk requested review from jrhemstad and miscco and removed request for miscco and jrhemstad December 20, 2024 03:13
@leofang
Copy link
Member

leofang commented Dec 20, 2024

Leo: Do you still recommend that we replace shutil.copytree() (in setup.py) with the cmake-based alternative?

I'm asking because that'll take this PR in a very different direction (I think).

Don't sweat to make major changes. There are technical and cultural reasons that I suggested scikit-build-core when we planned this task.

  • Technically, a custom copy routine is a bit nerve wrecking, especially after we encountered the incident (CUDAINST-3178) that for many months the nvidia-cuda-cccl-cuXX wheels were completely not usable.
  • Technically, as influenced by our dear friends from RAPIDS and the wider C++ community, CCCL has geared heavily toward CMake, and it makes sense to use a build system that can integrate with the existing CMake infra
  • Culturally, our RAPIDS friends try to stay away from setuptools, and using scikit-build-core could help us ask for helps from RAPIDS easier because they're familiar with it

However, it does take some efforts to rewrite the build system. Right now our custom shutil- based copy works just fine, so apart from the potential benefit of becoming more robust I don't see other immediate needs for switching, so it makes sense to not sweat for now. Let's do minimal work to unblock ourselves so that we can focus on more important things.

@@ -110,6 +111,7 @@ update_file "$CUDAX_CMAKE_VERSION_FILE" "set(cudax_VERSION_MAJOR \([0-9]\+\))" "
update_file "$CUDAX_CMAKE_VERSION_FILE" "set(cudax_VERSION_MINOR \([0-9]\+\))" "set(cudax_VERSION_MINOR $minor)"
update_file "$CUDAX_CMAKE_VERSION_FILE" "set(cudax_VERSION_PATCH \([0-9]\+\))" "set(cudax_VERSION_PATCH $patch)"

update_file "$CUDA_CCCL_VERSION_FILE" "^__version__ = \"\([0-9.]\+\)\"" "__version__ = \"$pymajor.$pyminor.$major.$minor.$patch\""
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this is wrong, we need the headers to track C++ versions

Suggested change
update_file "$CUDA_CCCL_VERSION_FILE" "^__version__ = \"\([0-9.]\+\)\"" "__version__ = \"$pymajor.$pyminor.$major.$minor.$patch\""
update_file "$CUDA_CCCL_VERSION_FILE" "^__version__ = \"\([0-9.]\+\)\"" "__version__ = \"$major.$minor.$patch\""

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done: commit df943c0

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Q: Is it possible that we consolidate .gitignore files at the root directory and not have independent ones per sub dir...?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I created #3212 to look into this later.

Comment on lines 14 to 19
__version__ = None
with open(os.path.join(project_path, "cuda", "cccl", "_version.py")) as f:
exec(f.read())
assert __version__ is not None
ver = __version__
del __version__
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done: commit 40c8389

Comment on lines 22 to 23
with open("README.md") as f:
long_description = f.read()
Copy link
Member

@leofang leofang Dec 20, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: this can be moved to pyproject.toml too, ex:
https://github.com/NVIDIA/cuda-python/blob/33b7366e308201f3bca8206ae331e399ac1b3379/cuda_core/pyproject.toml#L65
(in pyproject.toml, readme is the new preferred name over long_description)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done: commit 40c8389



def copy_cccl_headers_to_cuda_include():
inc_path = os.path.join(project_path, "cuda", "_include")
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Important: Can we please establish the right layout before merging? There is a new wheel layout being communicated internally. This should be something like

Suggested change
inc_path = os.path.join(project_path, "cuda", "_include")
inc_path = os.path.join(project_path, "cccl", "include")

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done: commit e3c7867

I'm not sure about two aspects:

  • Did you mean to suggest site-packages/cccl/include/ as the install directory? — I decided to make this site-packages/cuda/cccl/include/. It seemed odd to me to have it outside the cuda subdir.

  • By accident I discovered that all cuda.cooperative unit tests pass without the CCCL headers. @gevtushenko for comment.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

By accident I discovered that all cuda.cooperative unit tests pass without the CCCL headers. @gevtushenko for comment.

@rwgk we add path to CUDA Toolkit (CTK) headers here:

cuda_include_path = os.path.join(get_cuda_path(), "include")

CTK provides CUB headers as well. Tests likely pass because they use CTK version of CUB. We should add a static assert somewhere in cuda.cooperative to check that we use version of CUB that cuda.cooperative was "build" with.

@rwgk rwgk marked this pull request as draft December 20, 2024 21:33
rwgk added 3 commits December 20, 2024 16:00
Trigger for this change:

* NVIDIA#3201 (comment)

Unexpected accidental discovery: cuda.cooperative unit tests pass without CCCL headers entirely.
@rwgk rwgk marked this pull request as ready for review December 21, 2024 01:08
Copy link
Contributor

🟩 CI finished in 2h 05m: Pass: 100%/176 | Total: 3d 07h | Avg: 27m 15s | Max: 1h 15m | Hits: 57%/22530
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 13h 04m | Avg: 16m 20s | Max: 1h 15m | Hits: 47%/9814

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total: 12h 39m | Avg: 16m 31s | Max:  1h 15m | Hits:  47%/9814  
      🟩 arm64              Pass: 100%/2   | Total: 24m 42s | Avg: 12m 21s | Max: 20m 41s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 58m 35s | Avg:  8m 22s | Max: 17m 51s | Hits:  98%/2239  
      🟩 12.5               Pass: 100%/2   | Total: 17m 48s | Avg:  8m 54s | Max:  9m 00s
      🟩 12.6               Pass: 100%/39  | Total: 11h 48m | Avg: 18m 09s | Max:  1h 15m | Hits:  32%/7575  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 05m | Avg: 16m 23s | Max: 20m 34s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 58m 35s | Avg:  8m 22s | Max: 17m 51s | Hits:  98%/2239  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 48s | Avg:  8m 54s | Max:  9m 00s
      🟩 nvcc12.6           Pass: 100%/35  | Total: 10h 42m | Avg: 18m 21s | Max:  1h 15m | Hits:  32%/7575  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 05m | Avg: 16m 23s | Max: 20m 34s
      🟩 nvcc               Pass: 100%/44  | Total: 11h 59m | Avg: 16m 20s | Max:  1h 15m | Hits:  47%/9814  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 28m 30s | Avg:  7m 07s | Max: 17m 49s
      🟩 Clang10            Pass: 100%/1   | Total:  5m 28s | Avg:  5m 28s | Max:  5m 28s
      🟩 Clang11            Pass: 100%/1   | Total:  4m 29s | Avg:  4m 29s | Max:  4m 29s
      🟩 Clang12            Pass: 100%/1   | Total: 20m 19s | Avg: 20m 19s | Max: 20m 19s
      🟩 Clang13            Pass: 100%/1   | Total: 20m 13s | Avg: 20m 13s | Max: 20m 13s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 31s | Avg:  4m 31s | Max:  4m 31s
      🟩 Clang15            Pass: 100%/1   | Total: 21m 49s | Avg: 21m 49s | Max: 21m 49s
      🟩 Clang16            Pass: 100%/1   | Total: 21m 05s | Avg: 21m 05s | Max: 21m 05s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 52s | Avg:  4m 52s | Max:  4m 52s
      🟩 Clang18            Pass: 100%/8   | Total:  3h 07m | Avg: 23m 27s | Max:  1h 15m
      🟩 GCC6               Pass: 100%/2   | Total: 16m 18s | Avg:  8m 09s | Max: 13m 32s
      🟩 GCC7               Pass: 100%/2   | Total: 19m 12s | Avg:  9m 36s | Max: 16m 00s
      🟩 GCC8               Pass: 100%/1   | Total: 19m 56s | Avg: 19m 56s | Max: 19m 56s
      🟩 GCC9               Pass: 100%/3   | Total: 22m 11s | Avg:  7m 23s | Max: 15m 20s
      🟩 GCC10              Pass: 100%/1   | Total: 23m 55s | Avg: 23m 55s | Max: 23m 55s
      🟩 GCC11              Pass: 100%/1   | Total: 20m 16s | Avg: 20m 16s | Max: 20m 16s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 25s | Avg:  4m 25s | Max:  4m 25s
      🟩 GCC13              Pass: 100%/10  | Total:  2h 47m | Avg: 16m 46s | Max: 26m 11s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 24m 24s | Avg: 24m 24s | Max: 24m 24s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 17m 51s | Avg: 17m 51s | Max: 17m 51s | Hits:  98%/2239  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 36m 47s | Avg: 36m 47s | Max: 36m 47s | Hits:  32%/2476  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 14m | Avg: 37m 26s | Max: 42m 00s | Hits:  31%/5099  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 48s | Avg:  8m 54s | Max:  9m 00s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/20  | Total:  5h 18m | Avg: 15m 56s | Max:  1h 15m
      🟩 GCC                Pass: 100%/21  | Total:  4h 54m | Avg: 14m 00s | Max: 26m 11s
      🟩 Intel              Pass: 100%/1   | Total: 24m 24s | Avg: 24m 24s | Max: 24m 24s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 09m | Avg: 32m 22s | Max: 42m 00s | Hits:  47%/9814  
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 48s | Avg:  8m 54s | Max:  9m 00s
    🟩 gpu
      🟩 v100               Pass: 100%/48  | Total: 13h 04m | Avg: 16m 20s | Max:  1h 15m | Hits:  47%/9814  
    🟩 jobs
      🟩 Build              Pass: 100%/41  | Total:  9h 53m | Avg: 14m 27s | Max: 42m 00s | Hits:  47%/9814  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 36m | Avg: 24m 11s | Max: 26m 11s
      🟩 Test               Pass: 100%/2   | Total:  1h 32m | Avg: 46m 23s | Max:  1h 15m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 01s | Avg:  2m 01s | Max:  2m 01s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 14m 01s | Avg: 14m 01s | Max: 14m 01s
      🟩 90a                Pass: 100%/2   | Total: 17m 27s | Avg:  8m 43s | Max: 13m 00s
    🟩 std
      🟩 11                 Pass: 100%/6   | Total: 40m 12s | Avg:  6m 42s | Max: 25m 08s
      🟩 14                 Pass: 100%/5   | Total:  1h 31m | Avg: 18m 16s | Max: 26m 11s | Hits:  98%/2239  
      🟩 17                 Pass: 100%/13  | Total:  3h 55m | Avg: 18m 07s | Max: 36m 47s | Hits:  32%/4952  
      🟩 20                 Pass: 100%/23  | Total:  6h 55m | Avg: 18m 03s | Max:  1h 15m | Hits:  31%/2623  
    
  • 🟩 cub: Pass: 100%/47 | Total: 1d 11h | Avg: 45m 25s | Max: 1h 12m | Hits: 53%/3144

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 09h | Avg: 44m 49s | Max:  1h 12m | Hits:  53%/3144  
      🟩 arm64              Pass: 100%/2   | Total:  1h 57m | Avg: 58m 59s | Max:  1h 00m
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  4h 07m | Avg: 35m 20s | Max: 55m 16s | Hits:  67%/786   
      🟩 12.5               Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 05m
      🟩 12.6               Pass: 100%/38  | Total:  1d 05h | Avg: 46m 17s | Max:  1h 12m | Hits:  48%/2358  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 54m | Avg: 57m 15s | Max: 59m 25s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  4h 07m | Avg: 35m 20s | Max: 55m 16s | Hits:  67%/786   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 05m
      🟩 nvcc12.6           Pass: 100%/36  | Total:  1d 03h | Avg: 45m 40s | Max:  1h 12m | Hits:  48%/2358  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 54m | Avg: 57m 15s | Max: 59m 25s
      🟩 nvcc               Pass: 100%/45  | Total:  1d 09h | Avg: 44m 54s | Max:  1h 12m | Hits:  53%/3144  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total:  1h 58m | Avg: 29m 39s | Max: 58m 02s
      🟩 Clang10            Pass: 100%/1   | Total: 58m 43s | Avg: 58m 43s | Max: 58m 43s
      🟩 Clang11            Pass: 100%/1   | Total: 54m 07s | Avg: 54m 07s | Max: 54m 07s
      🟩 Clang12            Pass: 100%/1   | Total: 52m 35s | Avg: 52m 35s | Max: 52m 35s
      🟩 Clang13            Pass: 100%/1   | Total: 59m 17s | Avg: 59m 17s | Max: 59m 17s
      🟩 Clang14            Pass: 100%/1   | Total: 57m 28s | Avg: 57m 28s | Max: 57m 28s
      🟩 Clang15            Pass: 100%/1   | Total: 59m 10s | Avg: 59m 10s | Max: 59m 10s
      🟩 Clang16            Pass: 100%/1   | Total: 59m 42s | Avg: 59m 42s | Max: 59m 42s
      🟩 Clang17            Pass: 100%/1   | Total: 52m 17s | Avg: 52m 17s | Max: 52m 17s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 29m | Avg: 47m 05s | Max:  1h 00m
      🟩 GCC6               Pass: 100%/2   | Total: 49m 50s | Avg: 24m 55s | Max: 46m 02s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 09s | Max: 54m 09s
      🟩 GCC8               Pass: 100%/1   | Total: 58m 08s | Avg: 58m 08s | Max: 58m 08s
      🟩 GCC9               Pass: 100%/3   | Total:  2h 25m | Avg: 48m 36s | Max: 55m 16s
      🟩 GCC10              Pass: 100%/1   | Total: 56m 15s | Avg: 56m 15s | Max: 56m 15s
      🟩 GCC11              Pass: 100%/1   | Total: 55m 37s | Avg: 55m 37s | Max: 55m 37s
      🟩 GCC12              Pass: 100%/3   | Total:  1h 41m | Avg: 33m 58s | Max: 59m 08s
      🟩 GCC13              Pass: 100%/8   | Total:  4h 24m | Avg: 33m 02s | Max: 57m 46s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  1h 04m | Avg:  1h 04m | Max:  1h 04m
      🟩 MSVC14.16          Pass: 100%/1   | Total: 55m 16s | Avg: 55m 16s | Max: 55m 16s | Hits:  67%/786   
      🟩 MSVC14.29          Pass: 100%/1   | Total: 59m 05s | Avg: 59m 05s | Max: 59m 05s | Hits:  67%/786   
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 12m | Hits:  39%/1572  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 05m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 15h 01m | Avg: 47m 26s | Max:  1h 00m
      🟩 GCC                Pass: 100%/21  | Total: 13h 14m | Avg: 37m 49s | Max: 59m 08s
      🟩 Intel              Pass: 100%/1   | Total:  1h 04m | Avg:  1h 04m | Max:  1h 04m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 05m | Avg:  1h 01m | Max:  1h 12m | Hits:  53%/3144  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 05m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 42m 47s | Avg: 21m 23s | Max: 26m 38s
      🟩 v100               Pass: 100%/45  | Total:  1d 10h | Avg: 46m 29s | Max:  1h 12m | Hits:  53%/3144  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  1d 09h | Avg: 50m 05s | Max:  1h 12m | Hits:  53%/3144  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 18m 29s | Avg: 18m 29s | Max: 18m 29s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 07s | Avg: 16m 07s | Max: 16m 07s
      🟩 HostLaunch         Pass: 100%/3   | Total: 50m 52s | Avg: 16m 57s | Max: 17m 32s
      🟩 TestGPU            Pass: 100%/2   | Total: 46m 08s | Avg: 23m 04s | Max: 23m 06s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 42m 47s | Avg: 21m 23s | Max: 26m 38s
      🟩 90a                Pass: 100%/1   | Total: 23m 57s | Avg: 23m 57s | Max: 23m 57s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total:  1h 08m | Avg: 13m 38s | Max: 42m 58s
      🟩 14                 Pass: 100%/4   | Total:  3h 33m | Avg: 53m 22s | Max: 58m 02s | Hits:  67%/786   
      🟩 17                 Pass: 100%/12  | Total: 11h 16m | Avg: 56m 20s | Max:  1h 04m | Hits:  67%/1572  
      🟩 20                 Pass: 100%/26  | Total: 19h 37m | Avg: 45m 16s | Max:  1h 12m | Hits:  11%/786   
    
  • 🟩 thrust: Pass: 100%/46 | Total: 23h 59m | Avg: 31m 17s | Max: 1h 03m | Hits: 69%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 45m 14s | Avg: 22m 37s | Max: 28m 40s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total: 22h 49m | Avg: 31m 08s | Max:  1h 03m | Hits:  69%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  1h 09m | Avg: 34m 49s | Max: 38m 11s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  2h 40m | Avg: 22m 51s | Max: 59m 12s | Hits:  63%/1852  
      🟩 12.5               Pass: 100%/2   | Total:  1h 41m | Avg: 50m 50s | Max: 50m 59s
      🟩 12.6               Pass: 100%/37  | Total: 19h 37m | Avg: 31m 49s | Max:  1h 03m | Hits:  70%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 01m | Avg: 30m 36s | Max: 32m 12s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  2h 40m | Avg: 22m 51s | Max: 59m 12s | Hits:  63%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 41m | Avg: 50m 50s | Max: 50m 59s
      🟩 nvcc12.6           Pass: 100%/35  | Total: 18h 36m | Avg: 31m 54s | Max:  1h 03m | Hits:  70%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 01m | Avg: 30m 36s | Max: 32m 12s
      🟩 nvcc               Pass: 100%/44  | Total: 22h 58m | Avg: 31m 19s | Max:  1h 03m | Hits:  69%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total:  1h 10m | Avg: 17m 32s | Max: 33m 07s
      🟩 Clang10            Pass: 100%/1   | Total: 35m 39s | Avg: 35m 39s | Max: 35m 39s
      🟩 Clang11            Pass: 100%/1   | Total: 33m 22s | Avg: 33m 22s | Max: 33m 22s
      🟩 Clang12            Pass: 100%/1   | Total: 33m 33s | Avg: 33m 33s | Max: 33m 33s
      🟩 Clang13            Pass: 100%/1   | Total: 32m 29s | Avg: 32m 29s | Max: 32m 29s
      🟩 Clang14            Pass: 100%/1   | Total: 35m 15s | Avg: 35m 15s | Max: 35m 15s
      🟩 Clang15            Pass: 100%/1   | Total: 32m 13s | Avg: 32m 13s | Max: 32m 13s
      🟩 Clang16            Pass: 100%/1   | Total: 34m 56s | Avg: 34m 56s | Max: 34m 56s
      🟩 Clang17            Pass: 100%/1   | Total: 31m 35s | Avg: 31m 35s | Max: 31m 35s
      🟩 Clang18            Pass: 100%/7   | Total:  3h 02m | Avg: 26m 07s | Max: 35m 23s
      🟩 GCC6               Pass: 100%/2   | Total: 30m 57s | Avg: 15m 28s | Max: 27m 03s
      🟩 GCC7               Pass: 100%/2   | Total: 38m 18s | Avg: 19m 09s | Max: 33m 43s
      🟩 GCC8               Pass: 100%/1   | Total: 33m 01s | Avg: 33m 01s | Max: 33m 01s
      🟩 GCC9               Pass: 100%/3   | Total:  1h 11m | Avg: 23m 48s | Max: 34m 08s
      🟩 GCC10              Pass: 100%/1   | Total: 33m 16s | Avg: 33m 16s | Max: 33m 16s
      🟩 GCC11              Pass: 100%/1   | Total: 38m 00s | Avg: 38m 00s | Max: 38m 00s
      🟩 GCC12              Pass: 100%/1   | Total: 38m 30s | Avg: 38m 30s | Max: 38m 30s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 49m | Avg: 28m 40s | Max: 38m 11s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 40m 31s | Avg: 40m 31s | Max: 40m 31s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 59m 12s | Avg: 59m 12s | Max: 59m 12s | Hits:  63%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 55m 42s | Avg: 55m 42s | Max: 55m 42s | Hits:  63%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 27m | Avg: 49m 09s | Max:  1h 03m | Hits:  73%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 41m | Avg: 50m 50s | Max: 50m 59s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  8h 42m | Avg: 27m 28s | Max: 35m 39s
      🟩 GCC                Pass: 100%/19  | Total:  8h 32m | Avg: 26m 59s | Max: 38m 30s
      🟩 Intel              Pass: 100%/1   | Total: 40m 31s | Avg: 40m 31s | Max: 40m 31s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 22m | Avg: 52m 28s | Max:  1h 03m | Hits:  69%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 41m | Avg: 50m 50s | Max: 50m 59s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total: 23h 59m | Avg: 31m 17s | Max:  1h 03m | Hits:  69%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total: 22h 04m | Avg: 33m 07s | Max:  1h 03m | Hits:  61%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total:  1h 08m | Avg: 22m 45s | Max: 37m 13s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 46m 16s | Avg: 15m 25s | Max: 17m 40s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 20m 30s | Avg: 20m 30s | Max: 20m 30s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 22m 27s | Avg:  4m 29s | Max:  5m 26s
      🟩 14                 Pass: 100%/4   | Total:  2h 33m | Avg: 38m 16s | Max: 59m 12s | Hits:  63%/1852  
      🟩 17                 Pass: 100%/12  | Total:  7h 55m | Avg: 39m 39s | Max:  1h 03m | Hits:  63%/3704  
      🟩 20                 Pass: 100%/23  | Total: 12h 22m | Avg: 32m 18s | Max:  1h 00m | Hits:  78%/3704  
    
  • 🟩 cudax: Pass: 100%/26 | Total: 5h 56m | Avg: 13m 43s | Max: 22m 44s | Hits: 54%/312

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  5h 03m | Avg: 13m 48s | Max: 22m 44s | Hits:  54%/312   
      🟩 arm64              Pass: 100%/4   | Total: 53m 11s | Avg: 13m 17s | Max: 14m 22s
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 37m 05s | Avg: 12m 21s | Max: 13m 51s | Hits:  54%/156   
      🟩 12.5               Pass: 100%/2   | Total: 16m 36s | Avg:  8m 18s | Max:  8m 38s
      🟩 12.6               Pass: 100%/21  | Total:  5h 03m | Avg: 14m 26s | Max: 22m 44s | Hits:  54%/156   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 37m 05s | Avg: 12m 21s | Max: 13m 51s | Hits:  54%/156   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 16m 36s | Avg:  8m 18s | Max:  8m 38s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  5h 03m | Avg: 14m 26s | Max: 22m 44s | Hits:  54%/156   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  5h 56m | Avg: 13m 43s | Max: 22m 44s | Hits:  54%/312   
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total: 12m 56s | Avg: 12m 56s | Max: 12m 56s
      🟩 Clang10            Pass: 100%/1   | Total: 14m 40s | Avg: 14m 40s | Max: 14m 40s
      🟩 Clang11            Pass: 100%/1   | Total: 13m 58s | Avg: 13m 58s | Max: 13m 58s
      🟩 Clang12            Pass: 100%/1   | Total: 15m 11s | Avg: 15m 11s | Max: 15m 11s
      🟩 Clang13            Pass: 100%/1   | Total: 13m 07s | Avg: 13m 07s | Max: 13m 07s
      🟩 Clang14            Pass: 100%/1   | Total: 13m 10s | Avg: 13m 10s | Max: 13m 10s
      🟩 Clang15            Pass: 100%/1   | Total: 14m 48s | Avg: 14m 48s | Max: 14m 48s
      🟩 Clang16            Pass: 100%/1   | Total: 16m 31s | Avg: 16m 31s | Max: 16m 31s
      🟩 Clang17            Pass: 100%/1   | Total: 15m 32s | Avg: 15m 32s | Max: 15m 32s
      🟩 Clang18            Pass: 100%/4   | Total: 57m 08s | Avg: 14m 17s | Max: 16m 49s
      🟩 GCC9               Pass: 100%/1   | Total: 13m 51s | Avg: 13m 51s | Max: 13m 51s
      🟩 GCC10              Pass: 100%/1   | Total: 16m 12s | Avg: 16m 12s | Max: 16m 12s
      🟩 GCC11              Pass: 100%/1   | Total: 15m 38s | Avg: 15m 38s | Max: 15m 38s
      🟩 GCC12              Pass: 100%/2   | Total: 38m 48s | Avg: 19m 24s | Max: 22m 44s
      🟩 GCC13              Pass: 100%/4   | Total: 48m 48s | Avg: 12m 12s | Max: 14m 22s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 18s | Avg: 10m 18s | Max: 10m 18s | Hits:  54%/156   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  9m 44s | Avg:  9m 44s | Max:  9m 44s | Hits:  54%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 16m 36s | Avg:  8m 18s | Max:  8m 38s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/13  | Total:  3h 07m | Avg: 14m 23s | Max: 16m 49s
      🟩 GCC                Pass: 100%/9   | Total:  2h 13m | Avg: 14m 48s | Max: 22m 44s
      🟩 MSVC               Pass: 100%/2   | Total: 20m 02s | Avg: 10m 01s | Max: 10m 18s | Hits:  54%/312   
      🟩 NVHPC              Pass: 100%/2   | Total: 16m 36s | Avg:  8m 18s | Max:  8m 38s
    🟩 gpu
      🟩 v100               Pass: 100%/26  | Total:  5h 56m | Avg: 13m 43s | Max: 22m 44s | Hits:  54%/312   
    🟩 jobs
      🟩 Build              Pass: 100%/24  | Total:  5h 17m | Avg: 13m 13s | Max: 16m 31s | Hits:  54%/312   
      🟩 Test               Pass: 100%/2   | Total: 39m 33s | Avg: 19m 46s | Max: 22m 44s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 10m 29s | Avg: 10m 29s | Max: 10m 29s
      🟩 90a                Pass: 100%/1   | Total: 11m 07s | Avg: 11m 07s | Max: 11m 07s
    🟩 std
      🟩 17                 Pass: 100%/6   | Total:  1h 10m | Avg: 11m 46s | Max: 13m 51s
      🟩 20                 Pass: 100%/20  | Total:  4h 46m | Avg: 14m 19s | Max: 22m 44s | Hits:  54%/312   
    
  • 🟩 cccl: Pass: 100%/6 | Total: 32m 45s | Avg: 5m 27s | Max: 6m 19s

    🟩 cpu
      🟩 amd64              Pass: 100%/6   | Total: 32m 45s | Avg:  5m 27s | Max:  6m 19s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total: 10m 10s | Avg:  5m 05s | Max:  5m 11s
      🟩 12.0               Pass: 100%/2   | Total: 11m 33s | Avg:  5m 46s | Max:  6m 19s
      🟩 12.6               Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  6m 16s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total: 10m 10s | Avg:  5m 05s | Max:  5m 11s
      🟩 nvcc12.0           Pass: 100%/2   | Total: 11m 33s | Avg:  5m 46s | Max:  6m 19s
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  6m 16s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/6   | Total: 32m 45s | Avg:  5m 27s | Max:  6m 19s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  5m 11s | Avg:  5m 11s | Max:  5m 11s
      🟩 Clang14            Pass: 100%/1   | Total:  6m 19s | Avg:  6m 19s | Max:  6m 19s
      🟩 Clang18            Pass: 100%/1   | Total:  6m 16s | Avg:  6m 16s | Max:  6m 16s
      🟩 GCC6               Pass: 100%/1   | Total:  4m 59s | Avg:  4m 59s | Max:  4m 59s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 46s | Avg:  4m 46s | Max:  4m 46s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/3   | Total: 17m 46s | Avg:  5m 55s | Max:  6m 19s
      🟩 GCC                Pass: 100%/3   | Total: 14m 59s | Avg:  4m 59s | Max:  5m 14s
    🟩 gpu
      🟩 v100               Pass: 100%/6   | Total: 32m 45s | Avg:  5m 27s | Max:  6m 19s
    🟩 jobs
      🟩 Infra              Pass: 100%/6   | Total: 32m 45s | Avg:  5m 27s | Max:  6m 19s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 12m 38s | Avg: 6m 19s | Max: 10m 28s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 12m 38s | Avg:  6m 19s | Max: 10m 28s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 12m 38s | Avg:  6m 19s | Max: 10m 28s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 12m 38s | Avg:  6m 19s | Max: 10m 28s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 12m 38s | Avg:  6m 19s | Max: 10m 28s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 12m 38s | Avg:  6m 19s | Max: 10m 28s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 12m 38s | Avg:  6m 19s | Max: 10m 28s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 12m 38s | Avg:  6m 19s | Max: 10m 28s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 10s | Avg:  2m 10s | Max:  2m 10s
      🟩 Test               Pass: 100%/1   | Total: 10m 28s | Avg: 10m 28s | Max: 10m 28s
    
  • 🟩 python: Pass: 100%/1 | Total: 37m 03s | Avg: 37m 03s | Max: 37m 03s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 37m 03s | Avg: 37m 03s | Max: 37m 03s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 37m 03s | Avg: 37m 03s | Max: 37m 03s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 37m 03s | Avg: 37m 03s | Max: 37m 03s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 37m 03s | Avg: 37m 03s | Max: 37m 03s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 37m 03s | Avg: 37m 03s | Max: 37m 03s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 37m 03s | Avg: 37m 03s | Max: 37m 03s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 37m 03s | Avg: 37m 03s | Max: 37m 03s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 37m 03s | Avg: 37m 03s | Max: 37m 03s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
+/- python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 176)

# Runner
125 linux-amd64-cpu16
25 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Copy link
Contributor

🟩 CI finished in 1h 41m: Pass: 100%/176 | Total: 3d 17h | Avg: 30m 27s | Max: 1h 17m | Hits: 31%/22530
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 15h 39m | Avg: 19m 34s | Max: 47m 37s | Hits: 41%/9814

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total: 14h 56m | Avg: 19m 29s | Max: 47m 37s | Hits:  41%/9814  
      🟩 arm64              Pass: 100%/2   | Total: 43m 06s | Avg: 21m 33s | Max: 22m 01s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  1h 51m | Avg: 15m 51s | Max: 28m 31s | Hits:  49%/2239  
      🟩 12.5               Pass: 100%/2   | Total:  1h 00m | Avg: 30m 24s | Max: 31m 23s
      🟩 12.6               Pass: 100%/39  | Total: 12h 47m | Avg: 19m 41s | Max: 47m 37s | Hits:  39%/7575  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 03m | Avg: 15m 51s | Max: 20m 05s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  1h 51m | Avg: 15m 51s | Max: 28m 31s | Hits:  49%/2239  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 00m | Avg: 30m 24s | Max: 31m 23s
      🟩 nvcc12.6           Pass: 100%/35  | Total: 11h 44m | Avg: 20m 07s | Max: 47m 37s | Hits:  39%/7575  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 03m | Avg: 15m 51s | Max: 20m 05s
      🟩 nvcc               Pass: 100%/44  | Total: 14h 36m | Avg: 19m 54s | Max: 47m 37s | Hits:  41%/9814  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total:  1h 02m | Avg: 15m 34s | Max: 19m 17s
      🟩 Clang10            Pass: 100%/1   | Total: 22m 46s | Avg: 22m 46s | Max: 22m 46s
      🟩 Clang11            Pass: 100%/1   | Total: 15m 55s | Avg: 15m 55s | Max: 15m 55s
      🟩 Clang12            Pass: 100%/1   | Total: 23m 29s | Avg: 23m 29s | Max: 23m 29s
      🟩 Clang13            Pass: 100%/1   | Total: 16m 24s | Avg: 16m 24s | Max: 16m 24s
      🟩 Clang14            Pass: 100%/1   | Total: 21m 00s | Avg: 21m 00s | Max: 21m 00s
      🟩 Clang15            Pass: 100%/1   | Total: 21m 42s | Avg: 21m 42s | Max: 21m 42s
      🟩 Clang16            Pass: 100%/1   | Total: 16m 57s | Avg: 16m 57s | Max: 16m 57s
      🟩 Clang17            Pass: 100%/1   | Total: 18m 38s | Avg: 18m 38s | Max: 18m 38s
      🟩 Clang18            Pass: 100%/8   | Total:  2h 52m | Avg: 21m 31s | Max: 47m 37s
      🟩 GCC6               Pass: 100%/2   | Total: 14m 45s | Avg:  7m 22s | Max: 12m 17s
      🟩 GCC7               Pass: 100%/2   | Total: 23m 24s | Avg: 11m 42s | Max: 11m 52s
      🟩 GCC8               Pass: 100%/1   | Total: 21m 07s | Avg: 21m 07s | Max: 21m 07s
      🟩 GCC9               Pass: 100%/3   | Total: 56m 36s | Avg: 18m 52s | Max: 21m 55s
      🟩 GCC10              Pass: 100%/1   | Total: 16m 14s | Avg: 16m 14s | Max: 16m 14s
      🟩 GCC11              Pass: 100%/1   | Total: 18m 21s | Avg: 18m 21s | Max: 18m 21s
      🟩 GCC12              Pass: 100%/1   | Total: 17m 02s | Avg: 17m 02s | Max: 17m 02s
      🟩 GCC13              Pass: 100%/10  | Total:  2h 56m | Avg: 17m 38s | Max: 26m 36s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 23m 40s | Avg: 23m 40s | Max: 23m 40s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 28m 31s | Avg: 28m 31s | Max: 28m 31s | Hits:  49%/2239  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 27m 23s | Avg: 27m 23s | Max: 27m 23s | Hits:  46%/2476  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 04m | Avg: 32m 06s | Max: 34m 12s | Hits:  35%/5099  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 00m | Avg: 30m 24s | Max: 31m 23s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/20  | Total:  6h 31m | Avg: 19m 34s | Max: 47m 37s
      🟩 GCC                Pass: 100%/21  | Total:  5h 43m | Avg: 16m 22s | Max: 26m 36s
      🟩 Intel              Pass: 100%/1   | Total: 23m 40s | Avg: 23m 40s | Max: 23m 40s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 00m | Avg: 30m 01s | Max: 34m 12s | Hits:  41%/9814  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 00m | Avg: 30m 24s | Max: 31m 23s
    🟩 gpu
      🟩 v100               Pass: 100%/48  | Total: 15h 39m | Avg: 19m 34s | Max: 47m 37s | Hits:  41%/9814  
    🟩 jobs
      🟩 Build              Pass: 100%/41  | Total: 13h 03m | Avg: 19m 06s | Max: 34m 12s | Hits:  41%/9814  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 30m | Avg: 22m 42s | Max: 26m 36s
      🟩 Test               Pass: 100%/2   | Total:  1h 03m | Avg: 31m 53s | Max: 47m 37s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 51s | Avg:  1m 51s | Max:  1m 51s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 42s | Avg: 13m 42s | Max: 13m 42s
      🟩 90a                Pass: 100%/2   | Total: 24m 34s | Avg: 12m 17s | Max: 12m 23s
    🟩 std
      🟩 11                 Pass: 100%/6   | Total:  1h 27m | Avg: 14m 34s | Max: 23m 41s
      🟩 14                 Pass: 100%/5   | Total:  1h 30m | Avg: 18m 04s | Max: 28m 31s | Hits:  49%/2239  
      🟩 17                 Pass: 100%/13  | Total:  4h 47m | Avg: 22m 07s | Max: 34m 12s | Hits:  36%/4952  
      🟩 20                 Pass: 100%/23  | Total:  7h 52m | Avg: 20m 32s | Max: 47m 37s | Hits:  44%/2623  
    
  • 🟩 cub: Pass: 100%/47 | Total: 1d 15h | Avg: 50m 43s | Max: 1h 12m | Hits: 2%/3144

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 13h | Avg: 50m 24s | Max:  1h 12m | Hits:   2%/3144  
      🟩 arm64              Pass: 100%/2   | Total:  1h 55m | Avg: 57m 55s | Max: 58m 38s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  5h 42m | Avg: 48m 53s | Max:  1h 05m | Hits:   2%/786   
      🟩 12.5               Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 09m
      🟩 12.6               Pass: 100%/38  | Total:  1d 07h | Avg: 50m 11s | Max:  1h 12m | Hits:   2%/2358  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m
      🟩 nvcc11.1           Pass: 100%/7   | Total:  5h 42m | Avg: 48m 53s | Max:  1h 05m | Hits:   2%/786   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 09m
      🟩 nvcc12.6           Pass: 100%/36  | Total:  1d 05h | Avg: 49m 25s | Max:  1h 12m | Hits:   2%/2358  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m
      🟩 nvcc               Pass: 100%/45  | Total:  1d 13h | Avg: 50m 07s | Max:  1h 12m | Hits:   2%/3144  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total:  3h 22m | Avg: 50m 41s | Max: 57m 00s
      🟩 Clang10            Pass: 100%/1   | Total:  1h 02m | Avg:  1h 02m | Max:  1h 02m
      🟩 Clang11            Pass: 100%/1   | Total: 56m 14s | Avg: 56m 14s | Max: 56m 14s
      🟩 Clang12            Pass: 100%/1   | Total: 54m 46s | Avg: 54m 46s | Max: 54m 46s
      🟩 Clang13            Pass: 100%/1   | Total: 55m 21s | Avg: 55m 21s | Max: 55m 21s
      🟩 Clang14            Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 Clang15            Pass: 100%/1   | Total: 58m 53s | Avg: 58m 53s | Max: 58m 53s
      🟩 Clang16            Pass: 100%/1   | Total:  1h 02m | Avg:  1h 02m | Max:  1h 02m
      🟩 Clang17            Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
      🟩 Clang18            Pass: 100%/7   | Total:  5h 42m | Avg: 48m 59s | Max:  1h 04m
      🟩 GCC6               Pass: 100%/2   | Total:  1h 31m | Avg: 45m 36s | Max: 46m 18s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 46m | Avg: 53m 15s | Max: 53m 37s
      🟩 GCC8               Pass: 100%/1   | Total: 58m 58s | Avg: 58m 58s | Max: 58m 58s
      🟩 GCC9               Pass: 100%/3   | Total:  2h 26m | Avg: 48m 59s | Max: 55m 38s
      🟩 GCC10              Pass: 100%/1   | Total: 58m 36s | Avg: 58m 36s | Max: 58m 36s
      🟩 GCC11              Pass: 100%/1   | Total: 57m 39s | Avg: 57m 39s | Max: 57m 39s
      🟩 GCC12              Pass: 100%/3   | Total:  1h 43m | Avg: 34m 24s | Max: 55m 13s
      🟩 GCC13              Pass: 100%/8   | Total:  4h 31m | Avg: 33m 56s | Max: 58m 38s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  1h 03m | Avg:  1h 03m | Max:  1h 03m
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m | Hits:   2%/786   
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m | Hits:   2%/786   
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 12m | Hits:   2%/1572  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 09m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 16h 57m | Avg: 53m 33s | Max:  1h 04m
      🟩 GCC                Pass: 100%/21  | Total: 14h 54m | Avg: 42m 36s | Max: 58m 58s
      🟩 Intel              Pass: 100%/1   | Total:  1h 03m | Avg:  1h 03m | Max:  1h 03m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 33m | Avg:  1h 08m | Max:  1h 12m | Hits:   2%/3144  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 09m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 48m 01s | Avg: 24m 00s | Max: 27m 51s
      🟩 v100               Pass: 100%/45  | Total:  1d 14h | Avg: 51m 54s | Max:  1h 12m | Hits:   2%/3144  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  1d 13h | Avg: 56m 15s | Max:  1h 12m | Hits:   2%/3144  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 16m 33s | Avg: 16m 33s | Max: 16m 33s
      🟩 GraphCapture       Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
      🟩 HostLaunch         Pass: 100%/3   | Total: 55m 27s | Avg: 18m 29s | Max: 20m 10s
      🟩 TestGPU            Pass: 100%/2   | Total: 46m 50s | Avg: 23m 25s | Max: 24m 45s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 48m 01s | Avg: 24m 00s | Max: 27m 51s
      🟩 90a                Pass: 100%/1   | Total: 25m 30s | Avg: 25m 30s | Max: 25m 30s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total:  3h 59m | Avg: 47m 59s | Max: 53m 37s
      🟩 14                 Pass: 100%/4   | Total:  3h 41m | Avg: 55m 25s | Max:  1h 05m | Hits:   2%/786   
      🟩 17                 Pass: 100%/12  | Total: 12h 05m | Avg:  1h 00m | Max:  1h 12m | Hits:   2%/1572  
      🟩 20                 Pass: 100%/26  | Total: 19h 57m | Avg: 46m 02s | Max:  1h 09m | Hits:   2%/786   
    
  • 🟩 thrust: Pass: 100%/46 | Total: 1d 02h | Avg: 34m 48s | Max: 1h 17m | Hits: 30%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 41m 28s | Avg: 20m 44s | Max: 29m 44s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total:  1d 01h | Avg: 34m 49s | Max:  1h 17m | Hits:  30%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  1h 09m | Avg: 34m 43s | Max: 38m 42s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  3h 54m | Avg: 33m 32s | Max:  1h 05m | Hits:  12%/1852  
      🟩 12.5               Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 17m
      🟩 12.6               Pass: 100%/37  | Total: 20h 19m | Avg: 32m 56s | Max:  1h 15m | Hits:  34%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 00m | Avg: 30m 03s | Max: 32m 00s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  3h 54m | Avg: 33m 32s | Max:  1h 05m | Hits:  12%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 17m
      🟩 nvcc12.6           Pass: 100%/35  | Total: 19h 19m | Avg: 33m 06s | Max:  1h 15m | Hits:  34%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 00m | Avg: 30m 03s | Max: 32m 00s
      🟩 nvcc               Pass: 100%/44  | Total:  1d 01h | Avg: 35m 01s | Max:  1h 17m | Hits:  30%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total:  1h 52m | Avg: 28m 04s | Max: 36m 04s
      🟩 Clang10            Pass: 100%/1   | Total: 35m 32s | Avg: 35m 32s | Max: 35m 32s
      🟩 Clang11            Pass: 100%/1   | Total: 31m 03s | Avg: 31m 03s | Max: 31m 03s
      🟩 Clang12            Pass: 100%/1   | Total: 31m 55s | Avg: 31m 55s | Max: 31m 55s
      🟩 Clang13            Pass: 100%/1   | Total: 36m 48s | Avg: 36m 48s | Max: 36m 48s
      🟩 Clang14            Pass: 100%/1   | Total: 31m 22s | Avg: 31m 22s | Max: 31m 22s
      🟩 Clang15            Pass: 100%/1   | Total: 32m 32s | Avg: 32m 32s | Max: 32m 32s
      🟩 Clang16            Pass: 100%/1   | Total: 34m 07s | Avg: 34m 07s | Max: 34m 07s
      🟩 Clang17            Pass: 100%/1   | Total: 33m 38s | Avg: 33m 38s | Max: 33m 38s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 56m | Avg: 25m 09s | Max: 33m 57s
      🟩 GCC6               Pass: 100%/2   | Total: 53m 03s | Avg: 26m 31s | Max: 31m 26s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 29s | Max: 35m 27s
      🟩 GCC8               Pass: 100%/1   | Total: 37m 56s | Avg: 37m 56s | Max: 37m 56s
      🟩 GCC9               Pass: 100%/3   | Total:  1h 43m | Avg: 34m 24s | Max: 40m 13s
      🟩 GCC10              Pass: 100%/1   | Total: 38m 31s | Avg: 38m 31s | Max: 38m 31s
      🟩 GCC11              Pass: 100%/1   | Total: 34m 33s | Avg: 34m 33s | Max: 34m 33s
      🟩 GCC12              Pass: 100%/1   | Total: 39m 50s | Avg: 39m 50s | Max: 39m 50s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 19m | Avg: 24m 57s | Max: 42m 16s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 40m 22s | Avg: 40m 22s | Max: 40m 22s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m | Hits:  12%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 03m | Avg:  1h 03m | Max:  1h 03m | Hits:  12%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 40m | Avg: 53m 24s | Max:  1h 15m | Hits:  41%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 17m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  9h 15m | Avg: 29m 13s | Max: 36m 48s
      🟩 GCC                Pass: 100%/19  | Total:  9h 29m | Avg: 29m 59s | Max: 42m 16s
      🟩 Intel              Pass: 100%/1   | Total: 40m 22s | Avg: 40m 22s | Max: 40m 22s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 48m | Avg: 57m 42s | Max:  1h 15m | Hits:  30%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 17m
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  1d 02h | Avg: 34m 48s | Max:  1h 17m | Hits:  30%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  1d 01h | Avg: 38m 12s | Max:  1h 17m | Hits:  12%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 37m 04s | Avg: 12m 21s | Max: 21m 20s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 36m 22s | Avg: 12m 07s | Max: 12m 30s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 22m 33s | Avg: 22m 33s | Max: 22m 33s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total:  2h 01m | Avg: 24m 12s | Max: 27m 31s
      🟩 14                 Pass: 100%/4   | Total:  2h 48m | Avg: 42m 02s | Max:  1h 05m | Hits:  12%/1852  
      🟩 17                 Pass: 100%/12  | Total:  8h 32m | Avg: 42m 41s | Max:  1h 10m | Hits:  12%/3704  
      🟩 20                 Pass: 100%/23  | Total: 12h 38m | Avg: 32m 58s | Max:  1h 17m | Hits:  56%/3704  
    
  • 🟩 cudax: Pass: 100%/26 | Total: 5h 54m | Avg: 13m 37s | Max: 18m 21s | Hits: 34%/312

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  5h 00m | Avg: 13m 39s | Max: 18m 21s | Hits:  34%/312   
      🟩 arm64              Pass: 100%/4   | Total: 53m 56s | Avg: 13m 29s | Max: 14m 37s
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 37m 43s | Avg: 12m 34s | Max: 14m 08s | Hits:  34%/156   
      🟩 12.5               Pass: 100%/2   | Total: 18m 04s | Avg:  9m 02s | Max:  9m 22s
      🟩 12.6               Pass: 100%/21  | Total:  4h 58m | Avg: 14m 12s | Max: 18m 21s | Hits:  34%/156   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 37m 43s | Avg: 12m 34s | Max: 14m 08s | Hits:  34%/156   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 04s | Avg:  9m 02s | Max:  9m 22s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  4h 58m | Avg: 14m 12s | Max: 18m 21s | Hits:  34%/156   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  5h 54m | Avg: 13m 37s | Max: 18m 21s | Hits:  34%/312   
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total: 13m 19s | Avg: 13m 19s | Max: 13m 19s
      🟩 Clang10            Pass: 100%/1   | Total: 16m 06s | Avg: 16m 06s | Max: 16m 06s
      🟩 Clang11            Pass: 100%/1   | Total: 13m 31s | Avg: 13m 31s | Max: 13m 31s
      🟩 Clang12            Pass: 100%/1   | Total: 13m 18s | Avg: 13m 18s | Max: 13m 18s
      🟩 Clang13            Pass: 100%/1   | Total: 14m 14s | Avg: 14m 14s | Max: 14m 14s
      🟩 Clang14            Pass: 100%/1   | Total: 14m 05s | Avg: 14m 05s | Max: 14m 05s
      🟩 Clang15            Pass: 100%/1   | Total: 14m 33s | Avg: 14m 33s | Max: 14m 33s
      🟩 Clang16            Pass: 100%/1   | Total: 14m 35s | Avg: 14m 35s | Max: 14m 35s
      🟩 Clang17            Pass: 100%/1   | Total: 14m 37s | Avg: 14m 37s | Max: 14m 37s
      🟩 Clang18            Pass: 100%/4   | Total: 55m 56s | Avg: 13m 59s | Max: 15m 18s
      🟩 GCC9               Pass: 100%/1   | Total: 14m 08s | Avg: 14m 08s | Max: 14m 08s
      🟩 GCC10              Pass: 100%/1   | Total: 15m 21s | Avg: 15m 21s | Max: 15m 21s
      🟩 GCC11              Pass: 100%/1   | Total: 15m 21s | Avg: 15m 21s | Max: 15m 21s
      🟩 GCC12              Pass: 100%/2   | Total: 34m 12s | Avg: 17m 06s | Max: 18m 21s
      🟩 GCC13              Pass: 100%/4   | Total: 52m 28s | Avg: 13m 07s | Max: 14m 37s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 16s | Avg: 10m 16s | Max: 10m 16s | Hits:  34%/156   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 11s | Avg: 10m 11s | Max: 10m 11s | Hits:  34%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 04s | Avg:  9m 02s | Max:  9m 22s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/13  | Total:  3h 04m | Avg: 14m 10s | Max: 16m 06s
      🟩 GCC                Pass: 100%/9   | Total:  2h 11m | Avg: 14m 36s | Max: 18m 21s
      🟩 MSVC               Pass: 100%/2   | Total: 20m 27s | Avg: 10m 13s | Max: 10m 16s | Hits:  34%/312   
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 04s | Avg:  9m 02s | Max:  9m 22s
    🟩 gpu
      🟩 v100               Pass: 100%/26  | Total:  5h 54m | Avg: 13m 37s | Max: 18m 21s | Hits:  34%/312   
    🟩 jobs
      🟩 Build              Pass: 100%/24  | Total:  5h 23m | Avg: 13m 27s | Max: 18m 21s | Hits:  34%/312   
      🟩 Test               Pass: 100%/2   | Total: 31m 09s | Avg: 15m 34s | Max: 15m 51s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 11m 38s | Avg: 11m 38s | Max: 11m 38s
      🟩 90a                Pass: 100%/1   | Total: 12m 36s | Avg: 12m 36s | Max: 12m 36s
    🟩 std
      🟩 17                 Pass: 100%/6   | Total:  1h 14m | Avg: 12m 22s | Max: 14m 08s
      🟩 20                 Pass: 100%/20  | Total:  4h 40m | Avg: 14m 00s | Max: 18m 21s | Hits:  34%/312   
    
  • 🟩 cccl: Pass: 100%/6 | Total: 27m 59s | Avg: 4m 39s | Max: 5m 14s

    🟩 cpu
      🟩 amd64              Pass: 100%/6   | Total: 27m 59s | Avg:  4m 39s | Max:  5m 14s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  8m 03s | Avg:  4m 01s | Max:  4m 21s
      🟩 12.0               Pass: 100%/2   | Total:  9m 33s | Avg:  4m 46s | Max:  4m 58s
      🟩 12.6               Pass: 100%/2   | Total: 10m 23s | Avg:  5m 11s | Max:  5m 14s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  8m 03s | Avg:  4m 01s | Max:  4m 21s
      🟩 nvcc12.0           Pass: 100%/2   | Total:  9m 33s | Avg:  4m 46s | Max:  4m 58s
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 23s | Avg:  5m 11s | Max:  5m 14s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/6   | Total: 27m 59s | Avg:  4m 39s | Max:  5m 14s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  4m 21s | Avg:  4m 21s | Max:  4m 21s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 58s | Avg:  4m 58s | Max:  4m 58s
      🟩 Clang18            Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s
      🟩 GCC6               Pass: 100%/1   | Total:  3m 42s | Avg:  3m 42s | Max:  3m 42s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 35s | Avg:  4m 35s | Max:  4m 35s
      🟩 GCC13              Pass: 100%/1   | Total:  5m 09s | Avg:  5m 09s | Max:  5m 09s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/3   | Total: 14m 33s | Avg:  4m 51s | Max:  5m 14s
      🟩 GCC                Pass: 100%/3   | Total: 13m 26s | Avg:  4m 28s | Max:  5m 09s
    🟩 gpu
      🟩 v100               Pass: 100%/6   | Total: 27m 59s | Avg:  4m 39s | Max:  5m 14s
    🟩 jobs
      🟩 Infra              Pass: 100%/6   | Total: 27m 59s | Avg:  4m 39s | Max:  5m 14s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 8m 47s | Avg: 4m 23s | Max: 6m 41s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  8m 47s | Avg:  4m 23s | Max:  6m 41s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  8m 47s | Avg:  4m 23s | Max:  6m 41s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  8m 47s | Avg:  4m 23s | Max:  6m 41s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  8m 47s | Avg:  4m 23s | Max:  6m 41s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  8m 47s | Avg:  4m 23s | Max:  6m 41s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  8m 47s | Avg:  4m 23s | Max:  6m 41s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  8m 47s | Avg:  4m 23s | Max:  6m 41s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 06s | Avg:  2m 06s | Max:  2m 06s
      🟩 Test               Pass: 100%/1   | Total:  6m 41s | Avg:  6m 41s | Max:  6m 41s
    
  • 🟩 python: Pass: 100%/1 | Total: 44m 32s | Avg: 44m 32s | Max: 44m 32s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 44m 32s | Avg: 44m 32s | Max: 44m 32s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 44m 32s | Avg: 44m 32s | Max: 44m 32s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 44m 32s | Avg: 44m 32s | Max: 44m 32s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 44m 32s | Avg: 44m 32s | Max: 44m 32s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 44m 32s | Avg: 44m 32s | Max: 44m 32s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 44m 32s | Avg: 44m 32s | Max: 44m 32s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 44m 32s | Avg: 44m 32s | Max: 44m 32s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 44m 32s | Avg: 44m 32s | Max: 44m 32s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
+/- python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 176)

# Runner
125 linux-amd64-cpu16
25 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Copy link
Contributor

🟨 CI finished in 51m 02s: Pass: 99%/176 | Total: 1d 05h | Avg: 9m 59s | Max: 41m 53s | Hits: 82%/22530
  • 🟨 cub: Pass: 97%/47 | Total: 6h 51m | Avg: 8m 44s | Max: 38m 22s | Hits: 99%/3144

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/45  | Total:  6h 41m | Avg:  8m 55s | Max: 38m 22s | Hits:  99%/3144  
      🟩 arm64              Pass: 100%/2   | Total:  9m 39s | Avg:  4m 49s | Max:  5m 03s
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/7   | Total: 40m 12s | Avg:  5m 44s | Max: 14m 59s | Hits:  99%/786   
      🟩 12.5               Pass: 100%/2   | Total: 19m 55s | Avg:  9m 57s | Max: 10m 01s
      🔍 12.6               Pass:  97%/38  | Total:  5h 51m | Avg:  9m 14s | Max: 38m 22s | Hits:  99%/2358  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 10s | Avg:  4m 05s | Max:  4m 05s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 40m 12s | Avg:  5m 44s | Max: 14m 59s | Hits:  99%/786   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 19m 55s | Avg:  9m 57s | Max: 10m 01s
      🔍 nvcc12.6           Pass:  97%/36  | Total:  5h 42m | Avg:  9m 31s | Max: 38m 22s | Hits:  99%/2358  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 10s | Avg:  4m 05s | Max:  4m 05s
      🔍 nvcc               Pass:  97%/45  | Total:  6h 43m | Avg:  8m 57s | Max: 38m 22s | Hits:  99%/3144  
    🔍 cxx: GCC13 🔍
      🟩 Clang9             Pass: 100%/4   | Total: 20m 15s | Avg:  5m 03s | Max:  5m 55s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 35s | Avg:  6m 35s | Max:  6m 35s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 30s | Avg:  5m 30s | Max:  5m 30s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 08s | Avg:  5m 08s | Max:  5m 08s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 34s | Avg:  5m 34s | Max:  5m 34s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 30s | Avg:  5m 30s | Max:  5m 30s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 16s | Avg:  5m 16s | Max:  5m 16s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 12s | Avg:  5m 12s | Max:  5m 12s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 23m | Avg: 11m 53s | Max: 38m 22s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 11s | Avg:  4m 05s | Max:  4m 19s
      🟩 GCC7               Pass: 100%/2   | Total:  9m 58s | Avg:  4m 59s | Max:  5m 09s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 24s | Avg:  5m 24s | Max:  5m 24s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 16s | Avg:  4m 45s | Max:  5m 55s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 32s | Avg:  5m 32s | Max:  5m 32s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 22s | Avg:  5m 22s | Max:  5m 22s
      🟩 GCC12              Pass: 100%/3   | Total: 25m 40s | Avg:  8m 33s | Max: 16m 00s
      🔍 GCC13              Pass:  87%/8   | Total:  1h 47m | Avg: 13m 25s | Max: 30m 18s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  7m 03s | Avg:  7m 03s | Max:  7m 03s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s | Hits:  99%/786   
      🟩 MSVC14.29          Pass: 100%/1   | Total: 13m 30s | Avg: 13m 30s | Max: 13m 30s | Hits:  99%/786   
      🟩 MSVC14.39          Pass: 100%/2   | Total: 26m 20s | Avg: 13m 10s | Max: 13m 30s | Hits:  99%/1572  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 19m 55s | Avg:  9m 57s | Max: 10m 01s
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/19  | Total:  2h 27m | Avg:  7m 46s | Max: 38m 22s
      🔍 GCC                Pass:  95%/21  | Total:  3h 01m | Avg:  8m 39s | Max: 30m 18s
      🟩 Intel              Pass: 100%/1   | Total:  7m 03s | Avg:  7m 03s | Max:  7m 03s
      🟩 MSVC               Pass: 100%/4   | Total: 54m 49s | Avg: 13m 42s | Max: 14m 59s | Hits:  99%/3144  
      🟩 NVHPC              Pass: 100%/2   | Total: 19m 55s | Avg:  9m 57s | Max: 10m 01s
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 20m 01s | Avg: 10m 00s | Max: 16m 00s
      🔍 v100               Pass:  97%/45  | Total:  6h 31m | Avg:  8m 41s | Max: 38m 22s | Hits:  99%/3144  
    🔍 jobs: HostLaunch 🔍
      🟩 Build              Pass: 100%/40  | Total:  4h 09m | Avg:  6m 13s | Max: 14m 59s | Hits:  99%/3144  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 23m 12s | Avg: 23m 12s | Max: 23m 12s
      🟩 GraphCapture       Pass: 100%/1   | Total: 29m 22s | Avg: 29m 22s | Max: 29m 22s
      🔍 HostLaunch         Pass:  66%/3   | Total: 40m 48s | Avg: 13m 36s | Max: 21m 29s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 08m | Avg: 34m 20s | Max: 38m 22s
    🔍 std: 20 🔍
      🟩 11                 Pass: 100%/5   | Total: 22m 39s | Avg:  4m 31s | Max:  5m 39s
      🟩 14                 Pass: 100%/4   | Total: 30m 22s | Avg:  7m 35s | Max: 14m 59s | Hits:  99%/786   
      🟩 17                 Pass: 100%/12  | Total:  1h 25m | Avg:  7m 06s | Max: 13m 30s | Hits:  99%/1572  
      🔍 20                 Pass:  96%/26  | Total:  4h 32m | Avg: 10m 29s | Max: 38m 22s | Hits:  99%/786   
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 20m 01s | Avg: 10m 00s | Max: 16m 00s
      🟩 90a                Pass: 100%/1   | Total:  4m 30s | Avg:  4m 30s | Max:  4m 30s
    
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 13h 07m | Avg: 16m 24s | Max: 32m 36s | Hits: 61%/9814

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total: 12h 31m | Avg: 16m 19s | Max: 32m 36s | Hits:  61%/9814  
      🟩 arm64              Pass: 100%/2   | Total: 36m 41s | Avg: 18m 20s | Max: 20m 56s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  1h 33m | Avg: 13m 18s | Max: 27m 20s | Hits:  57%/2239  
      🟩 12.5               Pass: 100%/2   | Total: 32m 47s | Avg: 16m 23s | Max: 24m 50s
      🟩 12.6               Pass: 100%/39  | Total: 11h 02m | Avg: 16m 58s | Max: 32m 36s | Hits:  62%/7575  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 05m | Avg: 16m 16s | Max: 21m 15s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  1h 33m | Avg: 13m 18s | Max: 27m 20s | Hits:  57%/2239  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 32m 47s | Avg: 16m 23s | Max: 24m 50s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  9h 56m | Avg: 17m 03s | Max: 32m 36s | Hits:  62%/7575  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 05m | Avg: 16m 16s | Max: 21m 15s
      🟩 nvcc               Pass: 100%/44  | Total: 12h 02m | Avg: 16m 25s | Max: 32m 36s | Hits:  61%/9814  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 59m 32s | Avg: 14m 53s | Max: 19m 22s
      🟩 Clang10            Pass: 100%/1   | Total: 16m 28s | Avg: 16m 28s | Max: 16m 28s
      🟩 Clang11            Pass: 100%/1   | Total: 21m 12s | Avg: 21m 12s | Max: 21m 12s
      🟩 Clang12            Pass: 100%/1   | Total:  4m 25s | Avg:  4m 25s | Max:  4m 25s
      🟩 Clang13            Pass: 100%/1   | Total: 14m 47s | Avg: 14m 47s | Max: 14m 47s
      🟩 Clang14            Pass: 100%/1   | Total: 16m 16s | Avg: 16m 16s | Max: 16m 16s
      🟩 Clang15            Pass: 100%/1   | Total: 24m 47s | Avg: 24m 47s | Max: 24m 47s
      🟩 Clang16            Pass: 100%/1   | Total: 16m 00s | Avg: 16m 00s | Max: 16m 00s
      🟩 Clang17            Pass: 100%/1   | Total: 12m 01s | Avg: 12m 01s | Max: 12m 01s
      🟩 Clang18            Pass: 100%/8   | Total:  2h 12m | Avg: 16m 32s | Max: 21m 15s
      🟩 GCC6               Pass: 100%/2   | Total: 10m 27s | Avg:  5m 13s | Max:  7m 42s
      🟩 GCC7               Pass: 100%/2   | Total: 27m 05s | Avg: 13m 32s | Max: 15m 30s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 24s | Avg:  3m 24s | Max:  3m 24s
      🟩 GCC9               Pass: 100%/3   | Total: 34m 33s | Avg: 11m 31s | Max: 17m 14s
      🟩 GCC10              Pass: 100%/1   | Total: 15m 32s | Avg: 15m 32s | Max: 15m 32s
      🟩 GCC11              Pass: 100%/1   | Total: 10m 00s | Avg: 10m 00s | Max: 10m 00s
      🟩 GCC12              Pass: 100%/1   | Total: 22m 03s | Avg: 22m 03s | Max: 22m 03s
      🟩 GCC13              Pass: 100%/10  | Total:  3h 11m | Avg: 19m 10s | Max: 32m 34s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 18m 03s | Avg: 18m 03s | Max: 18m 03s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 27m 20s | Avg: 27m 20s | Max: 27m 20s | Hits:  57%/2239  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 32m 36s | Avg: 32m 36s | Max: 32m 36s | Hits:  32%/2476  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 44m 34s | Avg: 22m 17s | Max: 31m 30s | Hits:  76%/5099  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 32m 47s | Avg: 16m 23s | Max: 24m 50s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/20  | Total:  5h 17m | Avg: 15m 53s | Max: 24m 47s
      🟩 GCC                Pass: 100%/21  | Total:  5h 14m | Avg: 14m 59s | Max: 32m 34s
      🟩 Intel              Pass: 100%/1   | Total: 18m 03s | Avg: 18m 03s | Max: 18m 03s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 44m | Avg: 26m 07s | Max: 32m 36s | Hits:  61%/9814  
      🟩 NVHPC              Pass: 100%/2   | Total: 32m 47s | Avg: 16m 23s | Max: 24m 50s
    🟩 gpu
      🟩 v100               Pass: 100%/48  | Total: 13h 07m | Avg: 16m 24s | Max: 32m 36s | Hits:  61%/9814  
    🟩 jobs
      🟩 Build              Pass: 100%/41  | Total: 10h 36m | Avg: 15m 32s | Max: 32m 36s | Hits:  61%/9814  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 39m | Avg: 24m 56s | Max: 28m 12s
      🟩 Test               Pass: 100%/2   | Total: 49m 12s | Avg: 24m 36s | Max: 32m 34s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 00s | Avg:  2m 00s | Max:  2m 00s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 19s | Avg: 13m 19s | Max: 13m 19s
      🟩 90a                Pass: 100%/2   | Total: 15m 52s | Avg:  7m 56s | Max: 12m 02s
    🟩 std
      🟩 11                 Pass: 100%/6   | Total:  1h 14m | Avg: 12m 21s | Max: 26m 45s
      🟩 14                 Pass: 100%/5   | Total:  1h 31m | Avg: 18m 16s | Max: 28m 12s | Hits:  57%/2239  
      🟩 17                 Pass: 100%/13  | Total:  3h 54m | Avg: 18m 01s | Max: 32m 36s | Hits:  42%/4952  
      🟩 20                 Pass: 100%/23  | Total:  6h 26m | Avg: 16m 47s | Max: 32m 34s | Hits:  98%/2623  
    
  • 🟩 thrust: Pass: 100%/46 | Total: 5h 59m | Avg: 7m 49s | Max: 27m 23s | Hits: 99%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 19m 05s | Avg:  9m 32s | Max: 13m 03s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total:  5h 50m | Avg:  7m 57s | Max: 27m 23s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  9m 40s | Avg:  4m 50s | Max:  5m 03s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 42m 39s | Avg:  6m 05s | Max: 17m 34s | Hits:  99%/1852  
      🟩 12.5               Pass: 100%/2   | Total: 28m 37s | Avg: 14m 18s | Max: 15m 20s
      🟩 12.6               Pass: 100%/37  | Total:  4h 48m | Avg:  7m 47s | Max: 27m 23s | Hits:  99%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 15s | Avg:  5m 07s | Max:  5m 25s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 42m 39s | Avg:  6m 05s | Max: 17m 34s | Hits:  99%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 28m 37s | Avg: 14m 18s | Max: 15m 20s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  4h 38m | Avg:  7m 57s | Max: 27m 23s | Hits:  99%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 15s | Avg:  5m 07s | Max:  5m 25s
      🟩 nvcc               Pass: 100%/44  | Total:  5h 49m | Avg:  7m 56s | Max: 27m 23s | Hits:  99%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 20m 41s | Avg:  5m 10s | Max:  6m 37s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 22s | Avg:  6m 22s | Max:  6m 22s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 25s | Avg:  5m 25s | Max:  5m 25s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 01s | Avg:  5m 01s | Max:  5m 01s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 24s | Avg:  5m 24s | Max:  5m 24s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 01s | Avg:  5m 01s | Max:  5m 01s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 52s | Avg:  5m 52s | Max:  5m 52s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 11s | Avg:  5m 11s | Max:  5m 11s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 38s | Avg:  5m 38s | Max:  5m 38s
      🟩 Clang18            Pass: 100%/7   | Total: 43m 57s | Avg:  6m 16s | Max: 11m 20s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 56s | Avg:  3m 58s | Max:  4m 09s
      🟩 GCC7               Pass: 100%/2   | Total:  9m 45s | Avg:  4m 52s | Max:  4m 57s
      🟩 GCC8               Pass: 100%/1   | Total:  4m 58s | Avg:  4m 58s | Max:  4m 58s
      🟩 GCC9               Pass: 100%/3   | Total: 13m 37s | Avg:  4m 32s | Max:  5m 14s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 18s | Avg:  5m 18s | Max:  5m 18s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 39s | Avg:  5m 39s | Max:  5m 39s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 15m | Avg:  9m 28s | Max: 27m 23s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  8m 00s | Avg:  8m 00s | Max:  8m 00s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 17m 34s | Avg: 17m 34s | Max: 17m 34s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 15m 44s | Avg: 15m 44s | Max: 15m 44s | Hits:  99%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 53m 02s | Avg: 17m 40s | Max: 21m 48s | Hits:  99%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 28m 37s | Avg: 14m 18s | Max: 15m 20s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 48m | Avg:  5m 42s | Max: 11m 20s
      🟩 GCC                Pass: 100%/19  | Total:  2h 08m | Avg:  6m 45s | Max: 27m 23s
      🟩 Intel              Pass: 100%/1   | Total:  8m 00s | Avg:  8m 00s | Max:  8m 00s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 26m | Avg: 17m 16s | Max: 21m 48s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total: 28m 37s | Avg: 14m 18s | Max: 15m 20s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  5h 59m | Avg:  7m 49s | Max: 27m 23s | Hits:  99%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 31m | Avg:  6m 47s | Max: 17m 34s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 36m 32s | Avg: 12m 10s | Max: 21m 48s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 51m 46s | Avg: 17m 15s | Max: 27m 23s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 35s | Avg:  4m 35s | Max:  4m 35s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 21m 41s | Avg:  4m 20s | Max:  5m 18s
      🟩 14                 Pass: 100%/4   | Total: 33m 17s | Avg:  8m 19s | Max: 17m 34s | Hits:  99%/1852  
      🟩 17                 Pass: 100%/12  | Total:  1h 34m | Avg:  7m 53s | Max: 15m 44s | Hits:  99%/3704  
      🟩 20                 Pass: 100%/23  | Total:  3h 10m | Avg:  8m 18s | Max: 27m 23s | Hits:  99%/3704  
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 02m | Avg: 4m 42s | Max: 19m 14s | Hits: 92%/312

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  1h 52m | Avg:  5m 06s | Max: 19m 14s | Hits:  92%/312   
      🟩 arm64              Pass: 100%/4   | Total: 10m 21s | Avg:  2m 35s | Max:  2m 37s
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 14m 02s | Avg:  4m 40s | Max:  8m 15s | Hits:  92%/156   
      🟩 12.5               Pass: 100%/2   | Total: 10m 42s | Avg:  5m 21s | Max:  5m 31s
      🟩 12.6               Pass: 100%/21  | Total:  1h 37m | Avg:  4m 39s | Max: 19m 14s | Hits:  92%/156   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 14m 02s | Avg:  4m 40s | Max:  8m 15s | Hits:  92%/156   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 42s | Avg:  5m 21s | Max:  5m 31s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  1h 37m | Avg:  4m 39s | Max: 19m 14s | Hits:  92%/156   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 02m | Avg:  4m 42s | Max: 19m 14s | Hits:  92%/312   
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 03s | Avg:  3m 03s | Max:  3m 03s
      🟩 Clang10            Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s
      🟩 Clang11            Pass: 100%/1   | Total:  2m 58s | Avg:  2m 58s | Max:  2m 58s
      🟩 Clang12            Pass: 100%/1   | Total:  3m 13s | Avg:  3m 13s | Max:  3m 13s
      🟩 Clang13            Pass: 100%/1   | Total:  3m 14s | Avg:  3m 14s | Max:  3m 14s
      🟩 Clang14            Pass: 100%/1   | Total:  3m 09s | Avg:  3m 09s | Max:  3m 09s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 08s | Avg:  3m 08s | Max:  3m 08s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 08s | Avg:  3m 08s | Max:  3m 08s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 17s | Avg:  3m 17s | Max:  3m 17s
      🟩 Clang18            Pass: 100%/4   | Total: 27m 28s | Avg:  6m 52s | Max: 19m 14s
      🟩 GCC9               Pass: 100%/1   | Total:  2m 44s | Avg:  2m 44s | Max:  2m 44s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 03s | Avg:  3m 03s | Max:  3m 03s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 05s | Avg:  3m 05s | Max:  3m 05s
      🟩 GCC12              Pass: 100%/2   | Total: 18m 43s | Avg:  9m 21s | Max: 15m 31s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 41s | Avg:  2m 40s | Max:  2m 49s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  8m 15s | Avg:  8m 15s | Max:  8m 15s | Hits:  92%/156   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  8m 52s | Avg:  8m 52s | Max:  8m 52s | Hits:  92%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 10m 42s | Avg:  5m 21s | Max:  5m 31s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/13  | Total: 56m 31s | Avg:  4m 20s | Max: 19m 14s
      🟩 GCC                Pass: 100%/9   | Total: 38m 16s | Avg:  4m 15s | Max: 15m 31s
      🟩 MSVC               Pass: 100%/2   | Total: 17m 07s | Avg:  8m 33s | Max:  8m 52s | Hits:  92%/312   
      🟩 NVHPC              Pass: 100%/2   | Total: 10m 42s | Avg:  5m 21s | Max:  5m 31s
    🟩 gpu
      🟩 v100               Pass: 100%/26  | Total:  2h 02m | Avg:  4m 42s | Max: 19m 14s | Hits:  92%/312   
    🟩 jobs
      🟩 Build              Pass: 100%/24  | Total:  1h 27m | Avg:  3m 39s | Max:  8m 52s | Hits:  92%/312   
      🟩 Test               Pass: 100%/2   | Total: 34m 45s | Avg: 17m 22s | Max: 19m 14s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 49s | Avg:  2m 49s | Max:  2m 49s
      🟩 90a                Pass: 100%/1   | Total:  2m 43s | Avg:  2m 43s | Max:  2m 43s
    🟩 std
      🟩 17                 Pass: 100%/6   | Total: 18m 57s | Avg:  3m 09s | Max:  5m 11s
      🟩 20                 Pass: 100%/20  | Total:  1h 43m | Avg:  5m 10s | Max: 19m 14s | Hits:  92%/312   
    
  • 🟩 cccl: Pass: 100%/6 | Total: 24m 48s | Avg: 4m 08s | Max: 4m 33s

    🟩 cpu
      🟩 amd64              Pass: 100%/6   | Total: 24m 48s | Avg:  4m 08s | Max:  4m 33s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  7m 04s | Avg:  3m 32s | Max:  3m 52s
      🟩 12.0               Pass: 100%/2   | Total:  9m 00s | Avg:  4m 30s | Max:  4m 33s
      🟩 12.6               Pass: 100%/2   | Total:  8m 44s | Avg:  4m 22s | Max:  4m 28s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  7m 04s | Avg:  3m 32s | Max:  3m 52s
      🟩 nvcc12.0           Pass: 100%/2   | Total:  9m 00s | Avg:  4m 30s | Max:  4m 33s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  8m 44s | Avg:  4m 22s | Max:  4m 28s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/6   | Total: 24m 48s | Avg:  4m 08s | Max:  4m 33s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 33s | Avg:  4m 33s | Max:  4m 33s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 28s | Avg:  4m 28s | Max:  4m 28s
      🟩 GCC6               Pass: 100%/1   | Total:  3m 12s | Avg:  3m 12s | Max:  3m 12s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 27s | Avg:  4m 27s | Max:  4m 27s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 16s | Avg:  4m 16s | Max:  4m 16s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/3   | Total: 12m 53s | Avg:  4m 17s | Max:  4m 33s
      🟩 GCC                Pass: 100%/3   | Total: 11m 55s | Avg:  3m 58s | Max:  4m 27s
    🟩 gpu
      🟩 v100               Pass: 100%/6   | Total: 24m 48s | Avg:  4m 08s | Max:  4m 33s
    🟩 jobs
      🟩 Infra              Pass: 100%/6   | Total: 24m 48s | Avg:  4m 08s | Max:  4m 33s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 44s | Avg: 4m 52s | Max: 7m 37s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  7m 37s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  7m 37s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  7m 37s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  7m 37s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  7m 37s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  7m 37s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  7m 37s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 07s | Avg:  2m 07s | Max:  2m 07s
      🟩 Test               Pass: 100%/1   | Total:  7m 37s | Avg:  7m 37s | Max:  7m 37s
    
  • 🟩 python: Pass: 100%/1 | Total: 41m 53s | Avg: 41m 53s | Max: 41m 53s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 41m 53s | Avg: 41m 53s | Max: 41m 53s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 41m 53s | Avg: 41m 53s | Max: 41m 53s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 41m 53s | Avg: 41m 53s | Max: 41m 53s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 41m 53s | Avg: 41m 53s | Max: 41m 53s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 41m 53s | Avg: 41m 53s | Max: 41m 53s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 41m 53s | Avg: 41m 53s | Max: 41m 53s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 41m 53s | Avg: 41m 53s | Max: 41m 53s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 41m 53s | Avg: 41m 53s | Max: 41m 53s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
+/- python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 176)

# Runner
125 linux-amd64-cpu16
25 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Copy link
Contributor

🟩 CI finished in 1h 08m: Pass: 100%/176 | Total: 1d 05h | Avg: 10m 09s | Max: 47m 06s | Hits: 83%/22530
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 13h 01m | Avg: 16m 16s | Max: 47m 06s | Hits: 64%/9814

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total: 12h 36m | Avg: 16m 27s | Max: 47m 06s | Hits:  64%/9814  
      🟩 arm64              Pass: 100%/2   | Total: 24m 25s | Avg: 12m 12s | Max: 20m 49s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  1h 31m | Avg: 13m 02s | Max: 21m 44s | Hits:  98%/2239  
      🟩 12.5               Pass: 100%/2   | Total: 52m 36s | Avg: 26m 18s | Max: 30m 09s
      🟩 12.6               Pass: 100%/39  | Total: 10h 37m | Avg: 16m 20s | Max: 47m 06s | Hits:  53%/7575  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 08m | Avg: 17m 12s | Max: 22m 18s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  1h 31m | Avg: 13m 02s | Max: 21m 44s | Hits:  98%/2239  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 52m 36s | Avg: 26m 18s | Max: 30m 09s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  9h 28m | Avg: 16m 14s | Max: 47m 06s | Hits:  53%/7575  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 08m | Avg: 17m 12s | Max: 22m 18s
      🟩 nvcc               Pass: 100%/44  | Total: 11h 52m | Avg: 16m 11s | Max: 47m 06s | Hits:  64%/9814  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 44m 58s | Avg: 11m 14s | Max: 21m 44s
      🟩 Clang10            Pass: 100%/1   | Total: 22m 27s | Avg: 22m 27s | Max: 22m 27s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s
      🟩 Clang12            Pass: 100%/1   | Total: 16m 20s | Avg: 16m 20s | Max: 16m 20s
      🟩 Clang13            Pass: 100%/1   | Total:  3m 59s | Avg:  3m 59s | Max:  3m 59s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 10s | Avg:  4m 10s | Max:  4m 10s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 28s | Avg:  4m 28s | Max:  4m 28s
      🟩 Clang16            Pass: 100%/1   | Total: 15m 59s | Avg: 15m 59s | Max: 15m 59s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 23s | Avg:  4m 23s | Max:  4m 23s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 43m | Avg: 12m 58s | Max: 22m 18s
      🟩 GCC6               Pass: 100%/2   | Total: 22m 05s | Avg: 11m 02s | Max: 19m 12s
      🟩 GCC7               Pass: 100%/2   | Total:  6m 21s | Avg:  3m 10s | Max:  3m 15s
      🟩 GCC8               Pass: 100%/1   | Total: 14m 57s | Avg: 14m 57s | Max: 14m 57s
      🟩 GCC9               Pass: 100%/3   | Total: 28m 32s | Avg:  9m 30s | Max: 14m 26s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 44s | Avg:  3m 44s | Max:  3m 44s
      🟩 GCC11              Pass: 100%/1   | Total: 22m 29s | Avg: 22m 29s | Max: 22m 29s
      🟩 GCC12              Pass: 100%/1   | Total: 17m 54s | Avg: 17m 54s | Max: 17m 54s
      🟩 GCC13              Pass: 100%/10  | Total:  4h 34m | Avg: 27m 28s | Max: 47m 06s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 17m 22s | Avg: 17m 22s | Max: 17m 22s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 18m 19s | Avg: 18m 19s | Max: 18m 19s | Hits:  98%/2239  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 12m 45s | Avg: 12m 45s | Max: 12m 45s | Hits:  99%/2476  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 04m | Avg: 32m 26s | Max: 34m 29s | Hits:  32%/5099  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 52m 36s | Avg: 26m 18s | Max: 30m 09s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/20  | Total:  3h 44m | Avg: 11m 13s | Max: 22m 27s
      🟩 GCC                Pass: 100%/21  | Total:  6h 30m | Avg: 18m 36s | Max: 47m 06s
      🟩 Intel              Pass: 100%/1   | Total: 17m 22s | Avg: 17m 22s | Max: 17m 22s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 35m | Avg: 23m 59s | Max: 34m 29s | Hits:  64%/9814  
      🟩 NVHPC              Pass: 100%/2   | Total: 52m 36s | Avg: 26m 18s | Max: 30m 09s
    🟩 gpu
      🟩 v100               Pass: 100%/48  | Total: 13h 01m | Avg: 16m 16s | Max: 47m 06s | Hits:  64%/9814  
    🟩 jobs
      🟩 Build              Pass: 100%/41  | Total:  9h 00m | Avg: 13m 11s | Max: 34m 29s | Hits:  64%/9814  
      🟩 NVRTC              Pass: 100%/4   | Total:  2h 57m | Avg: 44m 23s | Max: 47m 06s
      🟩 Test               Pass: 100%/2   | Total:  1h 00m | Avg: 30m 23s | Max: 44m 38s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 57s | Avg:  1m 57s | Max:  1m 57s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 18s | Avg: 13m 18s | Max: 13m 18s
      🟩 90a                Pass: 100%/2   | Total: 16m 45s | Avg:  8m 22s | Max: 12m 48s
    🟩 std
      🟩 11                 Pass: 100%/6   | Total:  1h 28m | Avg: 14m 48s | Max: 38m 24s
      🟩 14                 Pass: 100%/5   | Total:  1h 13m | Avg: 14m 46s | Max: 45m 02s | Hits:  98%/2239  
      🟩 17                 Pass: 100%/13  | Total:  3h 56m | Avg: 18m 12s | Max: 47m 00s | Hits:  65%/4952  
      🟩 20                 Pass: 100%/23  | Total:  6h 19m | Avg: 16m 30s | Max: 47m 06s | Hits:  31%/2623  
    
  • 🟩 cub: Pass: 100%/47 | Total: 6h 48m | Avg: 8m 40s | Max: 32m 50s | Hits: 99%/3144

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  6h 38m | Avg:  8m 51s | Max: 32m 50s | Hits:  99%/3144  
      🟩 arm64              Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  5m 04s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 40m 44s | Avg:  5m 49s | Max: 14m 43s | Hits:  99%/786   
      🟩 12.5               Pass: 100%/2   | Total: 19m 07s | Avg:  9m 33s | Max: 10m 12s
      🟩 12.6               Pass: 100%/38  | Total:  5h 48m | Avg:  9m 09s | Max: 32m 50s | Hits:  99%/2358  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 37s | Avg:  4m 18s | Max:  4m 23s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 40m 44s | Avg:  5m 49s | Max: 14m 43s | Hits:  99%/786   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 19m 07s | Avg:  9m 33s | Max: 10m 12s
      🟩 nvcc12.6           Pass: 100%/36  | Total:  5h 39m | Avg:  9m 25s | Max: 32m 50s | Hits:  99%/2358  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 37s | Avg:  4m 18s | Max:  4m 23s
      🟩 nvcc               Pass: 100%/45  | Total:  6h 39m | Avg:  8m 52s | Max: 32m 50s | Hits:  99%/3144  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 21m 25s | Avg:  5m 21s | Max:  6m 23s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 39s | Avg:  6m 39s | Max:  6m 39s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 08s | Avg:  5m 08s | Max:  5m 08s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 18s | Avg:  5m 18s | Max:  5m 18s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 24s | Avg:  5m 24s | Max:  5m 24s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 34s | Avg:  5m 34s | Max:  5m 34s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 41s | Avg:  5m 41s | Max:  5m 41s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 46s | Avg:  5m 46s | Max:  5m 46s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 22m | Avg: 11m 44s | Max: 32m 50s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 27s | Avg:  4m 13s | Max:  4m 33s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 09s | Avg:  5m 04s | Max:  5m 19s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 20s | Avg:  5m 20s | Max:  5m 20s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 10s | Avg:  4m 43s | Max:  5m 21s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 GCC12              Pass: 100%/3   | Total: 32m 29s | Avg: 10m 49s | Max: 22m 34s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 37m | Avg: 12m 14s | Max: 22m 48s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  6m 57s | Avg:  6m 57s | Max:  6m 57s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 14m 43s | Avg: 14m 43s | Max: 14m 43s | Hits:  99%/786   
      🟩 MSVC14.29          Pass: 100%/1   | Total: 12m 17s | Avg: 12m 17s | Max: 12m 17s | Hits:  99%/786   
      🟩 MSVC14.39          Pass: 100%/2   | Total: 27m 20s | Avg: 13m 40s | Max: 14m 37s | Hits:  99%/1572  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 19m 07s | Avg:  9m 33s | Max: 10m 12s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 28m | Avg:  7m 48s | Max: 32m 50s
      🟩 GCC                Pass: 100%/21  | Total:  2h 59m | Avg:  8m 32s | Max: 22m 48s
      🟩 Intel              Pass: 100%/1   | Total:  6m 57s | Avg:  6m 57s | Max:  6m 57s
      🟩 MSVC               Pass: 100%/4   | Total: 54m 20s | Avg: 13m 35s | Max: 14m 43s | Hits:  99%/3144  
      🟩 NVHPC              Pass: 100%/2   | Total: 19m 07s | Avg:  9m 33s | Max: 10m 12s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 26m 45s | Avg: 13m 22s | Max: 22m 34s
      🟩 v100               Pass: 100%/45  | Total:  6h 21m | Avg:  8m 28s | Max: 32m 50s | Hits:  99%/3144  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 11m | Avg:  6m 16s | Max: 14m 43s | Hits:  99%/3144  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 16m 01s | Avg: 16m 01s | Max: 16m 01s
      🟩 GraphCapture       Pass: 100%/1   | Total: 22m 48s | Avg: 22m 48s | Max: 22m 48s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 04m | Avg: 21m 22s | Max: 25m 28s
      🟩 TestGPU            Pass: 100%/2   | Total: 53m 58s | Avg: 26m 59s | Max: 32m 50s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 26m 45s | Avg: 13m 22s | Max: 22m 34s
      🟩 90a                Pass: 100%/1   | Total:  4m 30s | Avg:  4m 30s | Max:  4m 30s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 23m 47s | Avg:  4m 45s | Max:  6m 23s
      🟩 14                 Pass: 100%/4   | Total: 30m 52s | Avg:  7m 43s | Max: 14m 43s | Hits:  99%/786   
      🟩 17                 Pass: 100%/12  | Total:  1h 24m | Avg:  7m 01s | Max: 12m 43s | Hits:  99%/1572  
      🟩 20                 Pass: 100%/26  | Total:  4h 29m | Avg: 10m 20s | Max: 32m 50s | Hits:  99%/786   
    
  • 🟩 thrust: Pass: 100%/46 | Total: 6h 39m | Avg: 8m 41s | Max: 45m 50s | Hits: 99%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 25m 15s | Avg: 12m 37s | Max: 19m 38s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total:  6h 30m | Avg:  8m 52s | Max: 45m 50s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  9m 42s | Avg:  4m 51s | Max:  5m 07s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 42m 41s | Avg:  6m 05s | Max: 17m 43s | Hits:  99%/1852  
      🟩 12.5               Pass: 100%/2   | Total: 27m 04s | Avg: 13m 32s | Max: 13m 33s
      🟩 12.6               Pass: 100%/37  | Total:  5h 30m | Avg:  8m 55s | Max: 45m 50s | Hits:  99%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  4m 56s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 42m 41s | Avg:  6m 05s | Max: 17m 43s | Hits:  99%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 27m 04s | Avg: 13m 32s | Max: 13m 33s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  5h 20m | Avg:  9m 09s | Max: 45m 50s | Hits:  99%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  4m 56s
      🟩 nvcc               Pass: 100%/44  | Total:  6h 30m | Avg:  8m 52s | Max: 45m 50s | Hits:  99%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 20m 00s | Avg:  5m 00s | Max:  6m 11s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 33s | Avg:  6m 33s | Max:  6m 33s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 35s | Avg:  5m 35s | Max:  5m 35s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 48s | Avg:  5m 48s | Max:  5m 48s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 12s | Avg:  5m 12s | Max:  5m 12s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 18m | Avg: 11m 09s | Max: 45m 50s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 58s | Avg:  3m 59s | Max:  4m 15s
      🟩 GCC7               Pass: 100%/2   | Total:  9m 36s | Avg:  4m 48s | Max:  4m 48s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 05s | Avg:  5m 05s | Max:  5m 05s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 07s | Avg:  4m 42s | Max:  5m 38s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 30s | Avg:  5m 30s | Max:  5m 30s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 15m | Avg:  9m 29s | Max: 20m 32s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  7m 01s | Avg:  7m 01s | Max:  7m 01s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 17m 43s | Avg: 17m 43s | Max: 17m 43s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 16m 22s | Avg: 16m 22s | Max: 16m 22s | Hits:  99%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  1h 00m | Avg: 20m 00s | Max: 23m 44s | Hits:  99%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 27m 04s | Avg: 13m 32s | Max: 13m 33s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 22m | Avg:  7m 30s | Max: 45m 50s
      🟩 GCC                Pass: 100%/19  | Total:  2h 09m | Avg:  6m 47s | Max: 20m 32s
      🟩 Intel              Pass: 100%/1   | Total:  7m 01s | Avg:  7m 01s | Max:  7m 01s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 34m | Avg: 18m 49s | Max: 23m 44s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total: 27m 04s | Avg: 13m 32s | Max: 13m 33s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  6h 39m | Avg:  8m 41s | Max: 45m 50s | Hits:  99%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 34m | Avg:  6m 52s | Max: 20m 08s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 38m 59s | Avg: 12m 59s | Max: 23m 44s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 26m | Avg: 28m 40s | Max: 45m 50s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 50s | Avg:  4m 50s | Max:  4m 50s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 21m 52s | Avg:  4m 22s | Max:  5m 18s
      🟩 14                 Pass: 100%/4   | Total: 32m 57s | Avg:  8m 14s | Max: 17m 43s | Hits:  99%/1852  
      🟩 17                 Pass: 100%/12  | Total:  1h 38m | Avg:  8m 14s | Max: 20m 08s | Hits:  99%/3704  
      🟩 20                 Pass: 100%/23  | Total:  3h 41m | Avg:  9m 36s | Max: 45m 50s | Hits:  99%/3704  
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 02m | Avg: 4m 42s | Max: 20m 35s | Hits: 92%/312

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  1h 51m | Avg:  5m 05s | Max: 20m 35s | Hits:  92%/312   
      🟩 arm64              Pass: 100%/4   | Total: 10m 33s | Avg:  2m 38s | Max:  2m 42s
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 13m 42s | Avg:  4m 34s | Max:  8m 09s | Hits:  92%/156   
      🟩 12.5               Pass: 100%/2   | Total: 10m 22s | Avg:  5m 11s | Max:  5m 11s
      🟩 12.6               Pass: 100%/21  | Total:  1h 38m | Avg:  4m 41s | Max: 20m 35s | Hits:  92%/156   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 13m 42s | Avg:  4m 34s | Max:  8m 09s | Hits:  92%/156   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 22s | Avg:  5m 11s | Max:  5m 11s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  1h 38m | Avg:  4m 41s | Max: 20m 35s | Hits:  92%/156   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 02m | Avg:  4m 42s | Max: 20m 35s | Hits:  92%/312   
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 03s | Avg:  3m 03s | Max:  3m 03s
      🟩 Clang10            Pass: 100%/1   | Total:  3m 24s | Avg:  3m 24s | Max:  3m 24s
      🟩 Clang11            Pass: 100%/1   | Total:  2m 57s | Avg:  2m 57s | Max:  2m 57s
      🟩 Clang12            Pass: 100%/1   | Total:  3m 00s | Avg:  3m 00s | Max:  3m 00s
      🟩 Clang13            Pass: 100%/1   | Total:  3m 02s | Avg:  3m 02s | Max:  3m 02s
      🟩 Clang14            Pass: 100%/1   | Total:  3m 14s | Avg:  3m 14s | Max:  3m 14s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 16s | Avg:  3m 16s | Max:  3m 16s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 11s | Avg:  3m 11s | Max:  3m 11s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 11s | Avg:  3m 11s | Max:  3m 11s
      🟩 Clang18            Pass: 100%/4   | Total: 23m 48s | Avg:  5m 57s | Max: 15m 31s
      🟩 GCC9               Pass: 100%/1   | Total:  2m 30s | Avg:  2m 30s | Max:  2m 30s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 06s | Avg:  3m 06s | Max:  3m 06s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 10s | Avg:  3m 10s | Max:  3m 10s
      🟩 GCC12              Pass: 100%/2   | Total: 23m 36s | Avg: 11m 48s | Max: 20m 35s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 54s | Avg:  2m 43s | Max:  2m 53s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  8m 09s | Avg:  8m 09s | Max:  8m 09s | Hits:  92%/156   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  8m 38s | Avg:  8m 38s | Max:  8m 38s | Hits:  92%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 10m 22s | Avg:  5m 11s | Max:  5m 11s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/13  | Total: 52m 06s | Avg:  4m 00s | Max: 15m 31s
      🟩 GCC                Pass: 100%/9   | Total: 43m 16s | Avg:  4m 48s | Max: 20m 35s
      🟩 MSVC               Pass: 100%/2   | Total: 16m 47s | Avg:  8m 23s | Max:  8m 38s | Hits:  92%/312   
      🟩 NVHPC              Pass: 100%/2   | Total: 10m 22s | Avg:  5m 11s | Max:  5m 11s
    🟩 gpu
      🟩 v100               Pass: 100%/26  | Total:  2h 02m | Avg:  4m 42s | Max: 20m 35s | Hits:  92%/312   
    🟩 jobs
      🟩 Build              Pass: 100%/24  | Total:  1h 26m | Avg:  3m 36s | Max:  8m 38s | Hits:  92%/312   
      🟩 Test               Pass: 100%/2   | Total: 36m 06s | Avg: 18m 03s | Max: 20m 35s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 45s | Avg:  2m 45s | Max:  2m 45s
      🟩 90a                Pass: 100%/1   | Total:  2m 53s | Avg:  2m 53s | Max:  2m 53s
    🟩 std
      🟩 17                 Pass: 100%/6   | Total: 18m 40s | Avg:  3m 06s | Max:  5m 11s
      🟩 20                 Pass: 100%/20  | Total:  1h 43m | Avg:  5m 11s | Max: 20m 35s | Hits:  92%/312   
    
  • 🟩 cccl: Pass: 100%/6 | Total: 24m 41s | Avg: 4m 06s | Max: 4m 33s

    🟩 cpu
      🟩 amd64              Pass: 100%/6   | Total: 24m 41s | Avg:  4m 06s | Max:  4m 33s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  7m 10s | Avg:  3m 35s | Max:  3m 57s
      🟩 12.0               Pass: 100%/2   | Total:  8m 29s | Avg:  4m 14s | Max:  4m 21s
      🟩 12.6               Pass: 100%/2   | Total:  9m 02s | Avg:  4m 31s | Max:  4m 33s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  7m 10s | Avg:  3m 35s | Max:  3m 57s
      🟩 nvcc12.0           Pass: 100%/2   | Total:  8m 29s | Avg:  4m 14s | Max:  4m 21s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 02s | Avg:  4m 31s | Max:  4m 33s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/6   | Total: 24m 41s | Avg:  4m 06s | Max:  4m 33s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 57s | Avg:  3m 57s | Max:  3m 57s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 21s | Avg:  4m 21s | Max:  4m 21s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 29s | Avg:  4m 29s | Max:  4m 29s
      🟩 GCC6               Pass: 100%/1   | Total:  3m 13s | Avg:  3m 13s | Max:  3m 13s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 08s | Avg:  4m 08s | Max:  4m 08s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 33s | Avg:  4m 33s | Max:  4m 33s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/3   | Total: 12m 47s | Avg:  4m 15s | Max:  4m 29s
      🟩 GCC                Pass: 100%/3   | Total: 11m 54s | Avg:  3m 58s | Max:  4m 33s
    🟩 gpu
      🟩 v100               Pass: 100%/6   | Total: 24m 41s | Avg:  4m 06s | Max:  4m 33s
    🟩 jobs
      🟩 Infra              Pass: 100%/6   | Total: 24m 41s | Avg:  4m 06s | Max:  4m 33s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 8m 58s | Avg: 4m 29s | Max: 6m 54s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  8m 58s | Avg:  4m 29s | Max:  6m 54s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  8m 58s | Avg:  4m 29s | Max:  6m 54s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  8m 58s | Avg:  4m 29s | Max:  6m 54s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  8m 58s | Avg:  4m 29s | Max:  6m 54s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  8m 58s | Avg:  4m 29s | Max:  6m 54s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  8m 58s | Avg:  4m 29s | Max:  6m 54s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  8m 58s | Avg:  4m 29s | Max:  6m 54s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 04s | Avg:  2m 04s | Max:  2m 04s
      🟩 Test               Pass: 100%/1   | Total:  6m 54s | Avg:  6m 54s | Max:  6m 54s
    
  • 🟩 python: Pass: 100%/1 | Total: 41m 23s | Avg: 41m 23s | Max: 41m 23s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 41m 23s | Avg: 41m 23s | Max: 41m 23s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 41m 23s | Avg: 41m 23s | Max: 41m 23s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 41m 23s | Avg: 41m 23s | Max: 41m 23s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 41m 23s | Avg: 41m 23s | Max: 41m 23s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 41m 23s | Avg: 41m 23s | Max: 41m 23s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 41m 23s | Avg: 41m 23s | Max: 41m 23s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 41m 23s | Avg: 41m 23s | Max: 41m 23s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 41m 23s | Avg: 41m 23s | Max: 41m 23s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
+/- python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 176)

# Runner
125 linux-amd64-cpu16
25 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Copy link
Contributor

🟩 CI finished in 1h 18m: Pass: 100%/176 | Total: 1d 04h | Avg: 9m 52s | Max: 52m 11s | Hits: 87%/22530
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 11h 56m | Avg: 14m 55s | Max: 45m 24s | Hits: 71%/9814

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total: 11h 43m | Avg: 15m 17s | Max: 45m 24s | Hits:  71%/9814  
      🟩 arm64              Pass: 100%/2   | Total: 12m 53s | Avg:  6m 26s | Max:  9m 35s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  1h 13m | Avg: 10m 26s | Max: 29m 58s | Hits:  35%/2239  
      🟩 12.5               Pass: 100%/2   | Total: 38m 27s | Avg: 19m 13s | Max: 29m 48s
      🟩 12.6               Pass: 100%/39  | Total: 10h 04m | Avg: 15m 30s | Max: 45m 24s | Hits:  82%/7575  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 04m | Avg: 16m 02s | Max: 20m 00s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  1h 13m | Avg: 10m 26s | Max: 29m 58s | Hits:  35%/2239  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 38m 27s | Avg: 19m 13s | Max: 29m 48s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  9h 00m | Avg: 15m 26s | Max: 45m 24s | Hits:  82%/7575  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 04m | Avg: 16m 02s | Max: 20m 00s
      🟩 nvcc               Pass: 100%/44  | Total: 10h 52m | Avg: 14m 49s | Max: 45m 24s | Hits:  71%/9814  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 24m 06s | Avg:  6m 01s | Max: 13m 22s
      🟩 Clang10            Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s
      🟩 Clang11            Pass: 100%/1   | Total: 22m 24s | Avg: 22m 24s | Max: 22m 24s
      🟩 Clang12            Pass: 100%/1   | Total: 15m 11s | Avg: 15m 11s | Max: 15m 11s
      🟩 Clang13            Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
      🟩 Clang14            Pass: 100%/1   | Total: 21m 17s | Avg: 21m 17s | Max: 21m 17s
      🟩 Clang15            Pass: 100%/1   | Total: 15m 44s | Avg: 15m 44s | Max: 15m 44s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 15s | Avg:  4m 15s | Max:  4m 15s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 00s | Avg:  4m 00s | Max:  4m 00s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 51m | Avg: 13m 54s | Max: 20m 00s
      🟩 GCC6               Pass: 100%/2   | Total: 26m 17s | Avg: 13m 08s | Max: 18m 06s
      🟩 GCC7               Pass: 100%/2   | Total: 18m 31s | Avg:  9m 15s | Max: 15m 14s
      🟩 GCC8               Pass: 100%/1   | Total: 18m 51s | Avg: 18m 51s | Max: 18m 51s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 23s | Avg:  4m 47s | Max:  8m 29s
      🟩 GCC10              Pass: 100%/1   | Total: 23m 22s | Avg: 23m 22s | Max: 23m 22s
      🟩 GCC11              Pass: 100%/1   | Total: 23m 15s | Avg: 23m 15s | Max: 23m 15s
      🟩 GCC12              Pass: 100%/1   | Total:  3m 39s | Avg:  3m 39s | Max:  3m 39s
      🟩 GCC13              Pass: 100%/10  | Total:  3h 41m | Avg: 22m 08s | Max: 45m 24s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  5m 53s | Avg:  5m 53s | Max:  5m 53s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 29m 58s | Avg: 29m 58s | Max: 29m 58s | Hits:  35%/2239  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 12m 18s | Avg: 12m 18s | Max: 12m 18s | Hits:  99%/2476  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 41m 34s | Avg: 20m 47s | Max: 29m 08s | Hits:  74%/5099  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 38m 27s | Avg: 19m 13s | Max: 29m 48s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/20  | Total:  3h 58m | Avg: 11m 55s | Max: 22m 24s
      🟩 GCC                Pass: 100%/21  | Total:  5h 49m | Avg: 16m 39s | Max: 45m 24s
      🟩 Intel              Pass: 100%/1   | Total:  5m 53s | Avg:  5m 53s | Max:  5m 53s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 23m | Avg: 20m 57s | Max: 29m 58s | Hits:  71%/9814  
      🟩 NVHPC              Pass: 100%/2   | Total: 38m 27s | Avg: 19m 13s | Max: 29m 48s
    🟩 gpu
      🟩 v100               Pass: 100%/48  | Total: 11h 56m | Avg: 14m 55s | Max: 45m 24s | Hits:  71%/9814  
    🟩 jobs
      🟩 Build              Pass: 100%/41  | Total:  8h 26m | Avg: 12m 21s | Max: 29m 58s | Hits:  71%/9814  
      🟩 NVRTC              Pass: 100%/4   | Total:  2h 50m | Avg: 42m 44s | Max: 45m 24s
      🟩 Test               Pass: 100%/2   | Total: 36m 32s | Avg: 18m 16s | Max: 18m 42s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 54s | Avg:  1m 54s | Max:  1m 54s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 12m 40s | Avg: 12m 40s | Max: 12m 40s
      🟩 90a                Pass: 100%/2   | Total: 16m 15s | Avg:  8m 07s | Max: 12m 24s
    🟩 std
      🟩 11                 Pass: 100%/6   | Total:  1h 16m | Avg: 12m 46s | Max: 36m 52s
      🟩 14                 Pass: 100%/5   | Total:  1h 41m | Avg: 20m 17s | Max: 43m 21s | Hits:  35%/2239  
      🟩 17                 Pass: 100%/13  | Total:  3h 19m | Avg: 15m 22s | Max: 45m 20s | Hits:  99%/4952  
      🟩 20                 Pass: 100%/23  | Total:  5h 36m | Avg: 14m 37s | Max: 45m 24s | Hits:  51%/2623  
    
  • 🟩 cub: Pass: 100%/47 | Total: 7h 23m | Avg: 9m 25s | Max: 52m 11s | Hits: 99%/3144

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  7h 13m | Avg:  9m 38s | Max: 52m 11s | Hits:  99%/3144  
      🟩 arm64              Pass: 100%/2   | Total:  9m 34s | Avg:  4m 47s | Max:  5m 01s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 40m 58s | Avg:  5m 51s | Max: 14m 34s | Hits:  99%/786   
      🟩 12.5               Pass: 100%/2   | Total: 17m 59s | Avg:  8m 59s | Max:  9m 10s
      🟩 12.6               Pass: 100%/38  | Total:  6h 24m | Avg: 10m 06s | Max: 52m 11s | Hits:  99%/2358  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 53s | Avg:  4m 26s | Max:  4m 27s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 40m 58s | Avg:  5m 51s | Max: 14m 34s | Hits:  99%/786   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 59s | Avg:  8m 59s | Max:  9m 10s
      🟩 nvcc12.6           Pass: 100%/36  | Total:  6h 15m | Avg: 10m 25s | Max: 52m 11s | Hits:  99%/2358  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 53s | Avg:  4m 26s | Max:  4m 27s
      🟩 nvcc               Pass: 100%/45  | Total:  7h 14m | Avg:  9m 39s | Max: 52m 11s | Hits:  99%/3144  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 21m 54s | Avg:  5m 28s | Max:  6m 29s
      🟩 Clang10            Pass: 100%/1   | Total:  7m 00s | Avg:  7m 00s | Max:  7m 00s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 16s | Avg:  5m 16s | Max:  5m 16s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 41s | Avg:  5m 41s | Max:  5m 41s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 38s | Avg:  5m 38s | Max:  5m 38s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 44s | Avg:  5m 44s | Max:  5m 44s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 33s | Avg:  5m 33s | Max:  5m 33s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 42s | Avg:  5m 42s | Max:  5m 42s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 36s | Avg:  5m 36s | Max:  5m 36s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 12m | Avg: 10m 18s | Max: 24m 49s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 41s | Avg:  4m 20s | Max:  4m 37s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 38s | Avg:  5m 19s | Max:  5m 24s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 37s | Avg:  5m 37s | Max:  5m 37s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 19s | Avg:  4m 46s | Max:  5m 52s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 42s | Avg:  5m 42s | Max:  5m 42s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 33s | Avg:  5m 33s | Max:  5m 33s
      🟩 GCC12              Pass: 100%/3   | Total: 27m 41s | Avg:  9m 13s | Max: 17m 29s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 27m | Avg: 18m 27s | Max: 52m 11s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  6m 33s | Avg:  6m 33s | Max:  6m 33s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s | Hits:  99%/786   
      🟩 MSVC14.29          Pass: 100%/1   | Total: 12m 12s | Avg: 12m 12s | Max: 12m 12s | Hits:  99%/786   
      🟩 MSVC14.39          Pass: 100%/2   | Total: 25m 49s | Avg: 12m 54s | Max: 12m 56s | Hits:  99%/1572  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 59s | Avg:  8m 59s | Max:  9m 10s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 20m | Avg:  7m 22s | Max: 24m 49s
      🟩 GCC                Pass: 100%/21  | Total:  3h 45m | Avg: 10m 45s | Max: 52m 11s
      🟩 Intel              Pass: 100%/1   | Total:  6m 33s | Avg:  6m 33s | Max:  6m 33s
      🟩 MSVC               Pass: 100%/4   | Total: 52m 35s | Avg: 13m 08s | Max: 14m 34s | Hits:  99%/3144  
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 59s | Avg:  8m 59s | Max:  9m 10s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 21m 34s | Avg: 10m 47s | Max: 17m 29s
      🟩 v100               Pass: 100%/45  | Total:  7h 01m | Avg:  9m 22s | Max: 52m 11s | Hits:  99%/3144  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 57m | Avg:  7m 26s | Max: 52m 11s | Hits:  99%/3144  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 16m 34s | Avg: 16m 34s | Max: 16m 34s
      🟩 GraphCapture       Pass: 100%/1   | Total: 18m 45s | Avg: 18m 45s | Max: 18m 45s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 02m | Avg: 20m 58s | Max: 24m 49s
      🟩 TestGPU            Pass: 100%/2   | Total: 47m 11s | Avg: 23m 35s | Max: 24m 11s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 21m 34s | Avg: 10m 47s | Max: 17m 29s
      🟩 90a                Pass: 100%/1   | Total:  4m 31s | Avg:  4m 31s | Max:  4m 31s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 23m 55s | Avg:  4m 47s | Max:  6m 09s
      🟩 14                 Pass: 100%/4   | Total: 31m 04s | Avg:  7m 46s | Max: 14m 34s | Hits:  99%/786   
      🟩 17                 Pass: 100%/12  | Total:  1h 24m | Avg:  7m 02s | Max: 12m 56s | Hits:  99%/1572  
      🟩 20                 Pass: 100%/26  | Total:  5h 03m | Avg: 11m 40s | Max: 52m 11s | Hits:  99%/786   
    
  • 🟩 thrust: Pass: 100%/46 | Total: 6h 16m | Avg: 8m 10s | Max: 23m 28s | Hits: 99%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 25m 33s | Avg: 12m 46s | Max: 19m 48s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total:  6h 06m | Avg:  8m 20s | Max: 23m 28s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  9m 31s | Avg:  4m 45s | Max:  5m 02s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 43m 43s | Avg:  6m 14s | Max: 17m 26s | Hits:  99%/1852  
      🟩 12.5               Pass: 100%/2   | Total: 27m 08s | Avg: 13m 34s | Max: 13m 38s
      🟩 12.6               Pass: 100%/37  | Total:  5h 05m | Avg:  8m 15s | Max: 23m 28s | Hits:  99%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  5m 04s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 43m 43s | Avg:  6m 14s | Max: 17m 26s | Hits:  99%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 27m 08s | Avg: 13m 34s | Max: 13m 38s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  4h 55m | Avg:  8m 26s | Max: 23m 28s | Hits:  99%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  5m 04s
      🟩 nvcc               Pass: 100%/44  | Total:  6h 06m | Avg:  8m 19s | Max: 23m 28s | Hits:  99%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 21m 40s | Avg:  5m 25s | Max:  6m 14s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 48s | Avg:  6m 48s | Max:  6m 48s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s
      🟩 Clang12            Pass: 100%/1   | Total:  4m 59s | Avg:  4m 59s | Max:  4m 59s
      🟩 Clang13            Pass: 100%/1   | Total:  4m 59s | Avg:  4m 59s | Max:  4m 59s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 01s | Avg:  5m 01s | Max:  5m 01s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 32s | Avg:  5m 32s | Max:  5m 32s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s
      🟩 Clang18            Pass: 100%/7   | Total: 55m 41s | Avg:  7m 57s | Max: 23m 28s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 21s | Avg:  4m 10s | Max:  4m 28s
      🟩 GCC7               Pass: 100%/2   | Total:  9m 52s | Avg:  4m 56s | Max:  5m 22s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 28s | Avg:  5m 28s | Max:  5m 28s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 06s | Avg:  4m 42s | Max:  6m 01s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 12s | Avg:  5m 12s | Max:  5m 12s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 56s | Avg:  5m 56s | Max:  5m 56s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 59s | Avg:  5m 59s | Max:  5m 59s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 17m | Avg:  9m 40s | Max: 22m 23s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  7m 06s | Avg:  7m 06s | Max:  7m 06s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 17m 26s | Avg: 17m 26s | Max: 17m 26s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 16m 36s | Avg: 16m 36s | Max: 16m 36s | Hits:  99%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 55m 15s | Avg: 18m 25s | Max: 22m 28s | Hits:  99%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 27m 08s | Avg: 13m 34s | Max: 13m 38s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 00m | Avg:  6m 20s | Max: 23m 28s
      🟩 GCC                Pass: 100%/19  | Total:  2h 12m | Avg:  6m 57s | Max: 22m 23s
      🟩 Intel              Pass: 100%/1   | Total:  7m 06s | Avg:  7m 06s | Max:  7m 06s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 29m | Avg: 17m 51s | Max: 22m 28s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total: 27m 08s | Avg: 13m 34s | Max: 13m 38s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  6h 16m | Avg:  8m 10s | Max: 23m 28s | Hits:  99%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 32m | Avg:  6m 48s | Max: 17m 26s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 38m 05s | Avg: 12m 41s | Max: 22m 28s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 05m | Avg: 21m 53s | Max: 23m 28s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 20s | Avg:  4m 20s | Max:  4m 20s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 22m 11s | Avg:  4m 26s | Max:  5m 35s
      🟩 14                 Pass: 100%/4   | Total: 33m 30s | Avg:  8m 22s | Max: 17m 26s | Hits:  99%/1852  
      🟩 17                 Pass: 100%/12  | Total:  1h 36m | Avg:  8m 04s | Max: 16m 36s | Hits:  99%/3704  
      🟩 20                 Pass: 100%/23  | Total:  3h 18m | Avg:  8m 37s | Max: 23m 28s | Hits:  99%/3704  
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 00m | Avg: 4m 37s | Max: 15m 57s | Hits: 92%/312

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  1h 49m | Avg:  4m 59s | Max: 15m 57s | Hits:  92%/312   
      🟩 arm64              Pass: 100%/4   | Total: 10m 22s | Avg:  2m 35s | Max:  2m 41s
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 13m 59s | Avg:  4m 39s | Max:  8m 17s | Hits:  92%/156   
      🟩 12.5               Pass: 100%/2   | Total: 11m 17s | Avg:  5m 38s | Max:  5m 43s
      🟩 12.6               Pass: 100%/21  | Total:  1h 34m | Avg:  4m 31s | Max: 15m 57s | Hits:  92%/156   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 13m 59s | Avg:  4m 39s | Max:  8m 17s | Hits:  92%/156   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 11m 17s | Avg:  5m 38s | Max:  5m 43s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  1h 34m | Avg:  4m 31s | Max: 15m 57s | Hits:  92%/156   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 00m | Avg:  4m 37s | Max: 15m 57s | Hits:  92%/312   
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 06s | Avg:  3m 06s | Max:  3m 06s
      🟩 Clang10            Pass: 100%/1   | Total:  3m 44s | Avg:  3m 44s | Max:  3m 44s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 18s | Avg:  3m 18s | Max:  3m 18s
      🟩 Clang12            Pass: 100%/1   | Total:  2m 59s | Avg:  2m 59s | Max:  2m 59s
      🟩 Clang13            Pass: 100%/1   | Total:  3m 13s | Avg:  3m 13s | Max:  3m 13s
      🟩 Clang14            Pass: 100%/1   | Total:  3m 18s | Avg:  3m 18s | Max:  3m 18s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 09s | Avg:  3m 09s | Max:  3m 09s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 19s | Avg:  3m 19s | Max:  3m 19s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 22s | Avg:  3m 22s | Max:  3m 22s
      🟩 Clang18            Pass: 100%/4   | Total: 23m 14s | Avg:  5m 48s | Max: 14m 49s
      🟩 GCC9               Pass: 100%/1   | Total:  2m 36s | Avg:  2m 36s | Max:  2m 36s
      🟩 GCC10              Pass: 100%/1   | Total:  2m 56s | Avg:  2m 56s | Max:  2m 56s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 14s | Avg:  3m 14s | Max:  3m 14s
      🟩 GCC12              Pass: 100%/2   | Total: 19m 07s | Avg:  9m 33s | Max: 15m 57s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 43s | Avg:  2m 40s | Max:  2m 49s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  8m 17s | Avg:  8m 17s | Max:  8m 17s | Hits:  92%/156   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  9m 19s | Avg:  9m 19s | Max:  9m 19s | Hits:  92%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 11m 17s | Avg:  5m 38s | Max:  5m 43s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/13  | Total: 52m 42s | Avg:  4m 03s | Max: 14m 49s
      🟩 GCC                Pass: 100%/9   | Total: 38m 36s | Avg:  4m 17s | Max: 15m 57s
      🟩 MSVC               Pass: 100%/2   | Total: 17m 36s | Avg:  8m 48s | Max:  9m 19s | Hits:  92%/312   
      🟩 NVHPC              Pass: 100%/2   | Total: 11m 17s | Avg:  5m 38s | Max:  5m 43s
    🟩 gpu
      🟩 v100               Pass: 100%/26  | Total:  2h 00m | Avg:  4m 37s | Max: 15m 57s | Hits:  92%/312   
    🟩 jobs
      🟩 Build              Pass: 100%/24  | Total:  1h 29m | Avg:  3m 43s | Max:  9m 19s | Hits:  92%/312   
      🟩 Test               Pass: 100%/2   | Total: 30m 46s | Avg: 15m 23s | Max: 15m 57s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 46s | Avg:  2m 46s | Max:  2m 46s
      🟩 90a                Pass: 100%/1   | Total:  2m 49s | Avg:  2m 49s | Max:  2m 49s
    🟩 std
      🟩 17                 Pass: 100%/6   | Total: 19m 26s | Avg:  3m 14s | Max:  5m 43s
      🟩 20                 Pass: 100%/20  | Total:  1h 40m | Avg:  5m 02s | Max: 15m 57s | Hits:  92%/312   
    
  • 🟩 cccl: Pass: 100%/6 | Total: 24m 09s | Avg: 4m 01s | Max: 4m 32s

    🟩 cpu
      🟩 amd64              Pass: 100%/6   | Total: 24m 09s | Avg:  4m 01s | Max:  4m 32s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  7m 11s | Avg:  3m 35s | Max:  3m 52s
      🟩 12.0               Pass: 100%/2   | Total:  8m 25s | Avg:  4m 12s | Max:  4m 19s
      🟩 12.6               Pass: 100%/2   | Total:  8m 33s | Avg:  4m 16s | Max:  4m 32s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  7m 11s | Avg:  3m 35s | Max:  3m 52s
      🟩 nvcc12.0           Pass: 100%/2   | Total:  8m 25s | Avg:  4m 12s | Max:  4m 19s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  8m 33s | Avg:  4m 16s | Max:  4m 32s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/6   | Total: 24m 09s | Avg:  4m 01s | Max:  4m 32s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 19s | Avg:  4m 19s | Max:  4m 19s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 32s | Avg:  4m 32s | Max:  4m 32s
      🟩 GCC6               Pass: 100%/1   | Total:  3m 19s | Avg:  3m 19s | Max:  3m 19s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 06s | Avg:  4m 06s | Max:  4m 06s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 01s | Avg:  4m 01s | Max:  4m 01s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/3   | Total: 12m 43s | Avg:  4m 14s | Max:  4m 32s
      🟩 GCC                Pass: 100%/3   | Total: 11m 26s | Avg:  3m 48s | Max:  4m 06s
    🟩 gpu
      🟩 v100               Pass: 100%/6   | Total: 24m 09s | Avg:  4m 01s | Max:  4m 32s
    🟩 jobs
      🟩 Infra              Pass: 100%/6   | Total: 24m 09s | Avg:  4m 01s | Max:  4m 32s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 29s | Avg: 5m 14s | Max: 8m 30s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 29s | Avg:  5m 14s | Max:  8m 30s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 29s | Avg:  5m 14s | Max:  8m 30s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 29s | Avg:  5m 14s | Max:  8m 30s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 29s | Avg:  5m 14s | Max:  8m 30s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 29s | Avg:  5m 14s | Max:  8m 30s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 29s | Avg:  5m 14s | Max:  8m 30s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 29s | Avg:  5m 14s | Max:  8m 30s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  1m 59s | Avg:  1m 59s | Max:  1m 59s
      🟩 Test               Pass: 100%/1   | Total:  8m 30s | Avg:  8m 30s | Max:  8m 30s
    
  • 🟩 python: Pass: 100%/1 | Total: 45m 56s | Avg: 45m 56s | Max: 45m 56s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 45m 56s | Avg: 45m 56s | Max: 45m 56s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 45m 56s | Avg: 45m 56s | Max: 45m 56s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 45m 56s | Avg: 45m 56s | Max: 45m 56s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 45m 56s | Avg: 45m 56s | Max: 45m 56s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 45m 56s | Avg: 45m 56s | Max: 45m 56s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 45m 56s | Avg: 45m 56s | Max: 45m 56s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 45m 56s | Avg: 45m 56s | Max: 45m 56s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 45m 56s | Avg: 45m 56s | Max: 45m 56s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
+/- python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 176)

# Runner
125 linux-amd64-cpu16
25 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Review
Development

Successfully merging this pull request may close these issues.

[FEA]: Introduce Python module with CCCL headers
5 participants