Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ptx: Add add_ptx_instruction.py #3190

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

bernhardmgruber
Copy link
Contributor

This file helps create the necessary structure for new PTX instructions.

This file helps create the necessary structure for new PTX instructions.
We should find a better place for it though.
Copy link
Contributor

@ahendriksen ahendriksen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@bernhardmgruber bernhardmgruber changed the title ptx: Add add_instruction.py ptx: Add add_ptx_instruction.py Dec 18, 2024
@bernhardmgruber
Copy link
Contributor Author

I mostly wonder where we should put the script. It looks a bit odd at the repository root. Does anybody have any suggestions?

Copy link
Contributor

🟩 CI finished in 1h 55m: Pass: 100%/176 | Total: 1d 03h | Avg: 9m 17s | Max: 1h 26m | Hits: 99%/22510
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 9h 21m | Avg: 11m 41s | Max: 35m 21s | Hits: 98%/9814

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total:  9h 09m | Avg: 11m 57s | Max: 35m 21s | Hits:  98%/9814  
      🟩 arm64              Pass: 100%/2   | Total: 11m 40s | Avg:  5m 50s | Max:  7m 24s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  1h 10m | Avg: 10m 07s | Max: 20m 43s | Hits:  98%/2239  
      🟩 12.5               Pass: 100%/2   | Total: 26m 41s | Avg: 13m 20s | Max: 18m 40s
      🟩 12.6               Pass: 100%/39  | Total:  7h 43m | Avg: 11m 53s | Max: 35m 21s | Hits:  98%/7575  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 03m | Avg: 15m 58s | Max: 19m 12s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  1h 10m | Avg: 10m 07s | Max: 20m 43s | Hits:  98%/2239  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 26m 41s | Avg: 13m 20s | Max: 18m 40s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  6h 40m | Avg: 11m 25s | Max: 35m 21s | Hits:  98%/7575  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 03m | Avg: 15m 58s | Max: 19m 12s
      🟩 nvcc               Pass: 100%/44  | Total:  8h 17m | Avg: 11m 18s | Max: 35m 21s | Hits:  98%/9814  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 25m 03s | Avg:  6m 15s | Max:  7m 54s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 56s | Avg:  4m 56s | Max:  4m 56s
      🟩 Clang11            Pass: 100%/1   | Total:  4m 32s | Avg:  4m 32s | Max:  4m 32s
      🟩 Clang12            Pass: 100%/1   | Total: 15m 17s | Avg: 15m 17s | Max: 15m 17s
      🟩 Clang13            Pass: 100%/1   | Total: 14m 54s | Avg: 14m 54s | Max: 14m 54s
      🟩 Clang14            Pass: 100%/1   | Total:  7m 03s | Avg:  7m 03s | Max:  7m 03s
      🟩 Clang15            Pass: 100%/1   | Total: 16m 12s | Avg: 16m 12s | Max: 16m 12s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 35s | Avg:  4m 35s | Max:  4m 35s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 43s | Avg:  4m 43s | Max:  4m 43s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 33m | Avg: 11m 44s | Max: 19m 12s
      🟩 GCC6               Pass: 100%/2   | Total: 10m 31s | Avg:  5m 15s | Max:  7m 32s
      🟩 GCC7               Pass: 100%/2   | Total: 15m 16s | Avg:  7m 38s | Max: 11m 55s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 41s | Avg:  3m 41s | Max:  3m 41s
      🟩 GCC9               Pass: 100%/3   | Total: 28m 03s | Avg:  9m 21s | Max: 20m 43s
      🟩 GCC10              Pass: 100%/1   | Total:  4m 30s | Avg:  4m 30s | Max:  4m 30s
      🟩 GCC11              Pass: 100%/1   | Total: 20m 15s | Avg: 20m 15s | Max: 20m 15s
      🟩 GCC12              Pass: 100%/1   | Total: 15m 53s | Avg: 15m 53s | Max: 15m 53s
      🟩 GCC13              Pass: 100%/10  | Total:  2h 56m | Avg: 17m 40s | Max: 35m 21s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  5m 56s | Avg:  5m 56s | Max:  5m 56s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 20m 36s | Avg: 20m 36s | Max: 20m 36s | Hits:  98%/2239  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 13m 41s | Avg: 13m 41s | Max: 13m 41s | Hits:  99%/2476  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 28m 38s | Avg: 14m 19s | Max: 14m 57s | Hits:  98%/5099  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 26m 41s | Avg: 13m 20s | Max: 18m 40s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/20  | Total:  3h 11m | Avg:  9m 33s | Max: 19m 12s
      🟩 GCC                Pass: 100%/21  | Total:  4h 34m | Avg: 13m 05s | Max: 35m 21s
      🟩 Intel              Pass: 100%/1   | Total:  5m 56s | Avg:  5m 56s | Max:  5m 56s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 02m | Avg: 15m 43s | Max: 20m 36s | Hits:  98%/9814  
      🟩 NVHPC              Pass: 100%/2   | Total: 26m 41s | Avg: 13m 20s | Max: 18m 40s
    🟩 gpu
      🟩 v100               Pass: 100%/48  | Total:  9h 21m | Avg: 11m 41s | Max: 35m 21s | Hits:  98%/9814  
    🟩 jobs
      🟩 Build              Pass: 100%/41  | Total:  6h 40m | Avg:  9m 45s | Max: 20m 43s | Hits:  98%/9814  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 54m | Avg: 28m 44s | Max: 35m 21s
      🟩 Test               Pass: 100%/2   | Total: 44m 23s | Avg: 22m 11s | Max: 27m 44s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 55s | Avg:  1m 55s | Max:  1m 55s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 37s | Avg: 13m 37s | Max: 13m 37s
      🟩 90a                Pass: 100%/2   | Total: 17m 07s | Avg:  8m 33s | Max: 13m 28s
    🟩 std
      🟩 11                 Pass: 100%/6   | Total:  1h 14m | Avg: 12m 24s | Max: 26m 34s
      🟩 14                 Pass: 100%/5   | Total: 58m 29s | Avg: 11m 41s | Max: 22m 05s | Hits:  98%/2239  
      🟩 17                 Pass: 100%/13  | Total:  2h 26m | Avg: 11m 16s | Max: 30m 59s | Hits:  98%/4952  
      🟩 20                 Pass: 100%/23  | Total:  4h 40m | Avg: 12m 10s | Max: 35m 21s | Hits:  98%/2623  
    
  • 🟩 cub: Pass: 100%/47 | Total: 8h 19m | Avg: 10m 37s | Max: 1h 26m | Hits: 99%/3124

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  8h 09m | Avg: 10m 52s | Max:  1h 26m | Hits:  99%/3124  
      🟩 arm64              Pass: 100%/2   | Total: 10m 03s | Avg:  5m 01s | Max:  5m 06s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 41m 03s | Avg:  5m 51s | Max: 15m 11s | Hits:  99%/781   
      🟩 12.5               Pass: 100%/2   | Total: 18m 54s | Avg:  9m 27s | Max:  9m 30s
      🟩 12.6               Pass: 100%/38  | Total:  7h 19m | Avg: 11m 34s | Max:  1h 26m | Hits:  99%/2343  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 59s | Avg:  4m 29s | Max:  4m 47s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 41m 03s | Avg:  5m 51s | Max: 15m 11s | Hits:  99%/781   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 54s | Avg:  9m 27s | Max:  9m 30s
      🟩 nvcc12.6           Pass: 100%/36  | Total:  7h 10m | Avg: 11m 57s | Max:  1h 26m | Hits:  99%/2343  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 59s | Avg:  4m 29s | Max:  4m 47s
      🟩 nvcc               Pass: 100%/45  | Total:  8h 10m | Avg: 10m 54s | Max:  1h 26m | Hits:  99%/3124  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 21m 17s | Avg:  5m 19s | Max:  6m 16s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 32s | Avg:  6m 32s | Max:  6m 32s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 33s | Avg:  5m 33s | Max:  5m 33s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 51s | Avg:  5m 51s | Max:  5m 51s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 49s | Avg:  5m 49s | Max:  5m 49s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 25s | Avg:  5m 25s | Max:  5m 25s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 56s | Avg:  5m 56s | Max:  5m 56s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 49s | Avg:  5m 49s | Max:  5m 49s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 16s | Avg:  5m 16s | Max:  5m 16s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 37m | Avg: 13m 59s | Max: 41m 22s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 20s | Avg:  4m 10s | Max:  4m 18s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  5m 19s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 28s | Avg:  5m 28s | Max:  5m 28s
      🟩 GCC9               Pass: 100%/3   | Total: 13m 49s | Avg:  4m 36s | Max:  5m 22s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 GCC12              Pass: 100%/3   | Total: 27m 14s | Avg:  9m 04s | Max: 16m 54s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 56m | Avg: 22m 06s | Max:  1h 26m
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  6m 36s | Avg:  6m 36s | Max:  6m 36s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 11s | Avg: 15m 11s | Max: 15m 11s | Hits:  99%/781   
      🟩 MSVC14.29          Pass: 100%/1   | Total: 14m 10s | Avg: 14m 10s | Max: 14m 10s | Hits:  99%/781   
      🟩 MSVC14.39          Pass: 100%/2   | Total: 26m 40s | Avg: 13m 20s | Max: 13m 33s | Hits:  99%/1562  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 54s | Avg:  9m 27s | Max:  9m 30s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 45m | Avg:  8m 42s | Max: 41m 22s
      🟩 GCC                Pass: 100%/21  | Total:  4h 12m | Avg: 12m 01s | Max:  1h 26m
      🟩 Intel              Pass: 100%/1   | Total:  6m 36s | Avg:  6m 36s | Max:  6m 36s
      🟩 MSVC               Pass: 100%/4   | Total: 56m 01s | Avg: 14m 00s | Max: 15m 11s | Hits:  99%/3124  
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 54s | Avg:  9m 27s | Max:  9m 30s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 21m 14s | Avg: 10m 37s | Max: 16m 54s
      🟩 v100               Pass: 100%/45  | Total:  7h 58m | Avg: 10m 37s | Max:  1h 26m | Hits:  99%/3124  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 14m | Avg:  6m 21s | Max: 15m 11s | Hits:  99%/3124  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 17m 02s | Avg: 17m 02s | Max: 17m 02s
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 55s | Avg: 19m 55s | Max: 19m 55s
      🟩 HostLaunch         Pass: 100%/3   | Total:  2h 15m | Avg: 45m 16s | Max:  1h 26m
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 12m | Avg: 36m 25s | Max: 41m 22s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 21m 14s | Avg: 10m 37s | Max: 16m 54s
      🟩 90a                Pass: 100%/1   | Total:  4m 26s | Avg:  4m 26s | Max:  4m 26s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 23m 30s | Avg:  4m 42s | Max:  5m 56s
      🟩 14                 Pass: 100%/4   | Total: 31m 04s | Avg:  7m 46s | Max: 15m 11s | Hits:  99%/781   
      🟩 17                 Pass: 100%/12  | Total:  1h 25m | Avg:  7m 06s | Max: 14m 10s | Hits:  99%/1562  
      🟩 20                 Pass: 100%/26  | Total:  5h 59m | Avg: 13m 50s | Max:  1h 26m | Hits:  99%/781   
    
  • 🟩 thrust: Pass: 100%/46 | Total: 6h 24m | Avg: 8m 22s | Max: 27m 54s | Hits: 99%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 17m 33s | Avg:  8m 46s | Max: 11m 48s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total:  6h 15m | Avg:  8m 31s | Max: 27m 54s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  9m 42s | Avg:  4m 51s | Max:  5m 01s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 43m 37s | Avg:  6m 13s | Max: 18m 25s | Hits:  99%/1852  
      🟩 12.5               Pass: 100%/2   | Total: 28m 11s | Avg: 14m 05s | Max: 14m 38s
      🟩 12.6               Pass: 100%/37  | Total:  5h 13m | Avg:  8m 27s | Max: 27m 54s | Hits:  99%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 12s | Avg:  5m 06s | Max:  5m 18s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 43m 37s | Avg:  6m 13s | Max: 18m 25s | Hits:  99%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 28m 11s | Avg: 14m 05s | Max: 14m 38s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  5h 02m | Avg:  8m 39s | Max: 27m 54s | Hits:  99%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 12s | Avg:  5m 06s | Max:  5m 18s
      🟩 nvcc               Pass: 100%/44  | Total:  6h 14m | Avg:  8m 31s | Max: 27m 54s | Hits:  99%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 21m 32s | Avg:  5m 23s | Max:  6m 30s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 22s | Avg:  6m 22s | Max:  6m 22s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 00s | Avg:  5m 00s | Max:  5m 00s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 04s | Avg:  5m 04s | Max:  5m 04s
      🟩 Clang13            Pass: 100%/1   | Total:  6m 04s | Avg:  6m 04s | Max:  6m 04s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 32s | Avg:  5m 32s | Max:  5m 32s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 49s | Avg:  5m 49s | Max:  5m 49s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 52s | Avg:  5m 52s | Max:  5m 52s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 01m | Avg:  8m 43s | Max: 27m 54s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 47s | Avg:  3m 53s | Max:  4m 03s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 06s | Avg:  5m 03s | Max:  5m 26s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 11s | Avg:  5m 11s | Max:  5m 11s
      🟩 GCC9               Pass: 100%/3   | Total: 13m 45s | Avg:  4m 35s | Max:  5m 42s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 13s | Avg:  5m 13s | Max:  5m 13s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 57s | Avg:  5m 57s | Max:  5m 57s
      🟩 GCC12              Pass: 100%/1   | Total:  6m 09s | Avg:  6m 09s | Max:  6m 09s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 12m | Avg:  9m 05s | Max: 25m 23s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  7m 32s | Avg:  7m 32s | Max:  7m 32s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 18m 25s | Avg: 18m 25s | Max: 18m 25s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 16m 16s | Avg: 16m 16s | Max: 16m 16s | Hits:  99%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 59m 59s | Avg: 19m 59s | Max: 24m 16s | Hits:  99%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 28m 11s | Avg: 14m 05s | Max: 14m 38s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 07m | Avg:  6m 43s | Max: 27m 54s
      🟩 GCC                Pass: 100%/19  | Total:  2h 06m | Avg:  6m 40s | Max: 25m 23s
      🟩 Intel              Pass: 100%/1   | Total:  7m 32s | Avg:  7m 32s | Max:  7m 32s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 34m | Avg: 18m 56s | Max: 24m 16s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total: 28m 11s | Avg: 14m 05s | Max: 14m 38s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  6h 24m | Avg:  8m 22s | Max: 27m 54s | Hits:  99%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 40m | Avg:  7m 01s | Max: 18m 25s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 39m 11s | Avg: 13m 03s | Max: 24m 16s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 05m | Avg: 21m 41s | Max: 27m 54s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 45s | Avg:  4m 45s | Max:  4m 45s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 22m 08s | Avg:  4m 25s | Max:  5m 40s
      🟩 14                 Pass: 100%/4   | Total: 34m 24s | Avg:  8m 36s | Max: 18m 25s | Hits:  99%/1852  
      🟩 17                 Pass: 100%/12  | Total:  1h 39m | Avg:  8m 19s | Max: 18m 21s | Hits:  99%/3704  
      🟩 20                 Pass: 100%/23  | Total:  3h 30m | Avg:  9m 10s | Max: 27m 54s | Hits:  99%/3704  
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 01m | Avg: 4m 41s | Max: 16m 14s | Hits: 92%/312

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  1h 50m | Avg:  5m 02s | Max: 16m 14s | Hits:  92%/312   
      🟩 arm64              Pass: 100%/4   | Total: 10m 55s | Avg:  2m 43s | Max:  3m 12s
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 15m 03s | Avg:  5m 01s | Max:  9m 05s | Hits:  92%/156   
      🟩 12.5               Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 33s
      🟩 12.6               Pass: 100%/21  | Total:  1h 35m | Avg:  4m 33s | Max: 16m 14s | Hits:  92%/156   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 15m 03s | Avg:  5m 01s | Max:  9m 05s | Hits:  92%/156   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 33s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  1h 35m | Avg:  4m 33s | Max: 16m 14s | Hits:  92%/156   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 01m | Avg:  4m 41s | Max: 16m 14s | Hits:  92%/312   
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  2m 59s | Avg:  2m 59s | Max:  2m 59s
      🟩 Clang10            Pass: 100%/1   | Total:  3m 45s | Avg:  3m 45s | Max:  3m 45s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 04s | Avg:  3m 04s | Max:  3m 04s
      🟩 Clang12            Pass: 100%/1   | Total:  2m 56s | Avg:  2m 56s | Max:  2m 56s
      🟩 Clang13            Pass: 100%/1   | Total:  3m 05s | Avg:  3m 05s | Max:  3m 05s
      🟩 Clang14            Pass: 100%/1   | Total:  3m 05s | Avg:  3m 05s | Max:  3m 05s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 16s | Avg:  3m 16s | Max:  3m 16s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 12s | Avg:  3m 12s | Max:  3m 12s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 14s | Avg:  3m 14s | Max:  3m 14s
      🟩 Clang18            Pass: 100%/4   | Total: 24m 18s | Avg:  6m 04s | Max: 15m 50s
      🟩 GCC9               Pass: 100%/1   | Total:  2m 59s | Avg:  2m 59s | Max:  2m 59s
      🟩 GCC10              Pass: 100%/1   | Total:  2m 55s | Avg:  2m 55s | Max:  2m 55s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 00s | Avg:  3m 00s | Max:  3m 00s
      🟩 GCC12              Pass: 100%/2   | Total: 19m 40s | Avg:  9m 50s | Max: 16m 14s
      🟩 GCC13              Pass: 100%/4   | Total: 11m 23s | Avg:  2m 50s | Max:  3m 12s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  9m 05s | Avg:  9m 05s | Max:  9m 05s | Hits:  92%/156   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  8m 56s | Avg:  8m 56s | Max:  8m 56s | Hits:  92%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 33s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/13  | Total: 52m 54s | Avg:  4m 04s | Max: 15m 50s
      🟩 GCC                Pass: 100%/9   | Total: 39m 57s | Avg:  4m 26s | Max: 16m 14s
      🟩 MSVC               Pass: 100%/2   | Total: 18m 01s | Avg:  9m 00s | Max:  9m 05s | Hits:  92%/312   
      🟩 NVHPC              Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 33s
    🟩 gpu
      🟩 v100               Pass: 100%/26  | Total:  2h 01m | Avg:  4m 41s | Max: 16m 14s | Hits:  92%/312   
    🟩 jobs
      🟩 Build              Pass: 100%/24  | Total:  1h 29m | Avg:  3m 44s | Max:  9m 05s | Hits:  92%/312   
      🟩 Test               Pass: 100%/2   | Total: 32m 04s | Avg: 16m 02s | Max: 16m 14s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 52s | Avg:  2m 52s | Max:  2m 52s
      🟩 90a                Pass: 100%/1   | Total:  2m 46s | Avg:  2m 46s | Max:  2m 46s
    🟩 std
      🟩 17                 Pass: 100%/6   | Total: 19m 35s | Avg:  3m 15s | Max:  5m 33s
      🟩 20                 Pass: 100%/20  | Total:  1h 42m | Avg:  5m 06s | Max: 16m 14s | Hits:  92%/312   
    
  • 🟩 cccl: Pass: 100%/6 | Total: 26m 54s | Avg: 4m 29s | Max: 5m 14s

    🟩 cpu
      🟩 amd64              Pass: 100%/6   | Total: 26m 54s | Avg:  4m 29s | Max:  5m 14s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  8m 24s | Avg:  4m 12s | Max:  4m 18s
      🟩 12.0               Pass: 100%/2   | Total:  9m 30s | Avg:  4m 45s | Max:  5m 14s
      🟩 12.6               Pass: 100%/2   | Total:  9m 00s | Avg:  4m 30s | Max:  4m 52s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  8m 24s | Avg:  4m 12s | Max:  4m 18s
      🟩 nvcc12.0           Pass: 100%/2   | Total:  9m 30s | Avg:  4m 45s | Max:  5m 14s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 00s | Avg:  4m 30s | Max:  4m 52s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/6   | Total: 26m 54s | Avg:  4m 29s | Max:  5m 14s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  4m 06s | Avg:  4m 06s | Max:  4m 06s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 52s | Avg:  4m 52s | Max:  4m 52s
      🟩 GCC6               Pass: 100%/1   | Total:  4m 18s | Avg:  4m 18s | Max:  4m 18s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 16s | Avg:  4m 16s | Max:  4m 16s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 08s | Avg:  4m 08s | Max:  4m 08s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/3   | Total: 14m 12s | Avg:  4m 44s | Max:  5m 14s
      🟩 GCC                Pass: 100%/3   | Total: 12m 42s | Avg:  4m 14s | Max:  4m 18s
    🟩 gpu
      🟩 v100               Pass: 100%/6   | Total: 26m 54s | Avg:  4m 29s | Max:  5m 14s
    🟩 jobs
      🟩 Infra              Pass: 100%/6   | Total: 26m 54s | Avg:  4m 29s | Max:  5m 14s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 8m 54s | Avg: 4m 27s | Max: 6m 55s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  8m 54s | Avg:  4m 27s | Max:  6m 55s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  8m 54s | Avg:  4m 27s | Max:  6m 55s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  8m 54s | Avg:  4m 27s | Max:  6m 55s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  8m 54s | Avg:  4m 27s | Max:  6m 55s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  8m 54s | Avg:  4m 27s | Max:  6m 55s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  8m 54s | Avg:  4m 27s | Max:  6m 55s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  8m 54s | Avg:  4m 27s | Max:  6m 55s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  1m 59s | Avg:  1m 59s | Max:  1m 59s
      🟩 Test               Pass: 100%/1   | Total:  6m 55s | Avg:  6m 55s | Max:  6m 55s
    
  • 🟩 python: Pass: 100%/1 | Total: 30m 31s | Avg: 30m 31s | Max: 30m 31s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 30m 31s | Avg: 30m 31s | Max: 30m 31s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 30m 31s | Avg: 30m 31s | Max: 30m 31s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 30m 31s | Avg: 30m 31s | Max: 30m 31s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 30m 31s | Avg: 30m 31s | Max: 30m 31s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 30m 31s | Avg: 30m 31s | Max: 30m 31s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 30m 31s | Avg: 30m 31s | Max: 30m 31s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 30m 31s | Avg: 30m 31s | Max: 30m 31s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 30m 31s | Avg: 30m 31s | Max: 30m 31s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 176)

# Runner
125 linux-amd64-cpu16
25 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Review
Development

Successfully merging this pull request may close these issues.

2 participants