Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ceil_div return common type and optmize #3229

Merged
merged 20 commits into from
Jan 23, 2025

Conversation

fbusato
Copy link
Contributor

@fbusato fbusato commented Jan 1, 2025

Fixes #2845, #2391

Description

ceil_div returns the resulting type of the operation and has been optimized for CUDA

Features

  • Add concept-like macro
  • Use common type for the output
  • Handle enumerator for a, b, and both
  • Document performance considerations

#### DO NOT MERGE

  • require C++17
  • breaking change in the API

@fbusato fbusato requested review from a team as code owners January 1, 2025 01:44
@fbusato fbusato requested review from wmaxey and alliepiper January 1, 2025 01:44
Copy link
Collaborator

@miscco miscco left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I find the previous implementation much simpler, please keep signed and unsigned separate

@fbusato
Copy link
Contributor Author

fbusato commented Jan 2, 2025

I find the previous implementation much simpler, please keep signed and unsigned separate

I don't quite agree...With two separate functions we need to duplicate 15 lines of code, which is not great.

template <class _Tp,
          class _Up,
          _CUDA_VSTD::enable_if_t<_CCCL_TRAIT(_CUDA_VSTD::is_integral, _Tp), int> = 0,
          _CUDA_VSTD::enable_if_t<_CCCL_TRAIT(_CUDA_VSTD::is_integral, _Up), int> = 0>
_CCCL_NODISCARD _LIBCUDACXX_HIDE_FROM_ABI _CCCL_CONSTEXPR_CXX14 decltype(_Tp{} / _Up{})
ceil_div(const _Tp __a, const _Up __b) noexcept
{
  _CCCL_ASSERT(__b > _Up{0}, "cuda::ceil_div: b must be positive");
  using _Common  = decltype(_Tp{} / _Up{});
  using _UCommon = _CUDA_VSTD::make_unsigned_t<_Common>;
  if constexpr (_CUDA_VSTD::is_signed_v<_Tp>)
  {
    _CCCL_ASSERT(__a >= _Tp{0}, "cuda::ceil_div: a must be non negative");
  }
  auto __a1 = static_cast<_UCommon>(__a);
  auto __b1 = static_cast<_UCommon>(__b);

Copy link
Contributor

github-actions bot commented Jan 2, 2025

🟨 CI finished in 2h 02m: Pass: 79%/170 | Total: 3d 02h | Avg: 26m 16s | Max: 1h 23m | Hits: 36%/17647
  • 🟨 libcudacxx: Pass: 72%/48 | Total: 15h 01m | Avg: 18m 46s | Max: 1h 23m | Hits: 30%/7578

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  71%/46  | Total: 14h 17m | Avg: 18m 38s | Max:  1h 23m | Hits:  30%/7578  
      🟩 arm64              Pass: 100%/2   | Total: 43m 46s | Avg: 21m 53s | Max: 22m 50s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 09m | Avg: 17m 24s | Max: 22m 43s
      🔍 nvcc               Pass:  70%/44  | Total: 13h 51m | Avg: 18m 53s | Max:  1h 23m | Hits:  30%/7578  
    🟨 ctk
      🟥 11.1               Pass:   0%/7   | Total: 58m 15s | Avg:  8m 19s | Max: 17m 41s
      🟩 12.5               Pass: 100%/2   | Total: 49m 28s | Avg: 24m 44s | Max: 34m 40s
      🟨 12.6               Pass:  84%/39  | Total: 13h 13m | Avg: 20m 20s | Max:  1h 23m | Hits:  30%/7578  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 09m | Avg: 17m 24s | Max: 22m 43s
      🟥 nvcc11.1           Pass:   0%/7   | Total: 58m 15s | Avg:  8m 19s | Max: 17m 41s
      🟩 nvcc12.5           Pass: 100%/2   | Total: 49m 28s | Avg: 24m 44s | Max: 34m 40s
      🟨 nvcc12.6           Pass:  82%/35  | Total: 12h 03m | Avg: 20m 40s | Max:  1h 23m | Hits:  30%/7578  
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total: 26m 32s | Avg:  6m 38s | Max: 17m 41s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 45s | Avg:  4m 45s | Max:  4m 45s
      🟩 Clang11            Pass: 100%/1   | Total:  4m 23s | Avg:  4m 23s | Max:  4m 23s
      🟩 Clang12            Pass: 100%/1   | Total:  4m 03s | Avg:  4m 03s | Max:  4m 03s
      🟩 Clang13            Pass: 100%/1   | Total: 21m 21s | Avg: 21m 21s | Max: 21m 21s
      🟩 Clang14            Pass: 100%/1   | Total: 22m 42s | Avg: 22m 42s | Max: 22m 42s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 30s | Avg:  4m 30s | Max:  4m 30s
      🟩 Clang16            Pass: 100%/1   | Total: 21m 34s | Avg: 21m 34s | Max: 21m 34s
      🟩 Clang17            Pass: 100%/1   | Total: 23m 37s | Avg: 23m 37s | Max: 23m 37s
      🟩 Clang18            Pass: 100%/8   | Total:  2h 44m | Avg: 20m 31s | Max: 44m 04s
      🟥 GCC6               Pass:   0%/2   | Total:  3m 39s | Avg:  1m 49s | Max:  1m 50s
      🟥 GCC7               Pass:   0%/2   | Total: 20m 01s | Avg: 10m 00s | Max: 17m 55s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 37s | Avg:  3m 37s | Max:  3m 37s
      🟨 GCC9               Pass:  33%/3   | Total: 40m 54s | Avg: 13m 38s | Max: 22m 57s
      🟩 GCC10              Pass: 100%/1   | Total:  4m 18s | Avg:  4m 18s | Max:  4m 18s
      🟩 GCC11              Pass: 100%/1   | Total: 23m 35s | Avg: 23m 35s | Max: 23m 35s
      🟩 GCC12              Pass: 100%/1   | Total: 22m 31s | Avg: 22m 31s | Max: 22m 31s
      🟨 GCC13              Pass:  80%/10  | Total:  4h 44m | Avg: 28m 28s | Max:  1h 23m
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 22m 52s | Avg: 22m 52s | Max: 22m 52s
      🟥 MSVC14.16          Pass:   0%/1   | Total: 16m 54s | Avg: 16m 54s | Max: 16m 54s
      🟩 MSVC14.29          Pass: 100%/1   | Total: 37m 57s | Avg: 37m 57s | Max: 37m 57s | Hits:  30%/2477  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 13m | Avg: 36m 30s | Max: 37m 15s | Hits:  29%/5101  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 49m 28s | Avg: 24m 44s | Max: 34m 40s
    🟨 cxx_family
      🟨 Clang              Pass:  80%/20  | Total:  4h 57m | Avg: 14m 52s | Max: 44m 04s
      🟨 GCC                Pass:  61%/21  | Total:  6h 43m | Avg: 19m 12s | Max:  1h 23m
      🟩 Intel              Pass: 100%/1   | Total: 22m 52s | Avg: 22m 52s | Max: 22m 52s
      🟨 MSVC               Pass:  75%/4   | Total:  2h 07m | Avg: 31m 58s | Max: 37m 57s | Hits:  30%/7578  
      🟩 NVHPC              Pass: 100%/2   | Total: 49m 28s | Avg: 24m 44s | Max: 34m 40s
    🟨 jobs
      🟨 Build              Pass:  73%/41  | Total: 10h 46m | Avg: 15m 46s | Max: 37m 57s | Hits:  30%/7578  
      🟨 NVRTC              Pass:  50%/4   | Total:  2h 05m | Avg: 31m 23s | Max: 38m 59s
      🟩 Test               Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 23m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 54s | Avg:  1m 54s | Max:  1m 54s
    🟨 std
      🟥 11                 Pass:   0%/6   | Total: 42m 32s | Avg:  7m 05s | Max: 32m 46s
      🟥 14                 Pass:   0%/5   | Total:  1h 05m | Avg: 13m 00s | Max: 23m 40s
      🟨 17                 Pass:  84%/13  | Total:  4h 29m | Avg: 20m 46s | Max: 37m 57s | Hits:  30%/4954  
      🟩 20                 Pass: 100%/23  | Total:  8h 41m | Avg: 22m 41s | Max:  1h 23m | Hits:  29%/2624  
    🟨 gpu
      🟨 v100               Pass:  72%/48  | Total: 15h 01m | Avg: 18m 46s | Max:  1h 23m | Hits:  30%/7578  
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 11s | Avg: 13m 11s | Max: 13m 11s
      🟩 90a                Pass: 100%/2   | Total: 20m 56s | Avg: 10m 28s | Max: 13m 09s
    
  • 🟨 cub: Pass: 76%/47 | Total: 1d 07h | Avg: 40m 03s | Max: 1h 13m | Hits: 27%/2349

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  75%/45  | Total:  1d 05h | Avg: 39m 07s | Max:  1h 13m | Hits:  27%/2349  
      🟩 arm64              Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 06m
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 04m
      🔍 nvcc               Pass:  75%/45  | Total:  1d 05h | Avg: 39m 08s | Max:  1h 13m | Hits:  27%/2349  
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 50m 10s | Avg: 25m 05s | Max: 27m 26s
      🔍 v100               Pass:  75%/45  | Total:  1d 06h | Avg: 40m 43s | Max:  1h 13m | Hits:  27%/2349  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  72%/40  | Total:  1d 04h | Avg: 42m 24s | Max:  1h 13m | Hits:  27%/2349  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 45m 00s | Avg: 45m 00s | Max: 45m 00s
      🟩 GraphCapture       Pass: 100%/1   | Total: 20m 38s | Avg: 20m 38s | Max: 20m 38s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 02m | Avg: 20m 54s | Max: 22m 44s
      🟩 TestGPU            Pass: 100%/2   | Total: 57m 57s | Avg: 28m 58s | Max: 29m 03s
    🟨 ctk
      🟥 11.1               Pass:   0%/7   | Total: 33m 37s | Avg:  4m 48s | Max: 18m 21s
      🟩 12.5               Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 13m
      🟨 12.6               Pass:  89%/38  | Total:  1d 04h | Avg: 44m 48s | Max:  1h 06m | Hits:  27%/2349  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 04m
      🟥 nvcc11.1           Pass:   0%/7   | Total: 33m 37s | Avg:  4m 48s | Max: 18m 21s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 13m
      🟨 nvcc12.6           Pass:  88%/36  | Total:  1d 02h | Avg: 43m 55s | Max:  1h 06m | Hits:  27%/2349  
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total: 12m 14s | Avg:  3m 03s | Max:  3m 14s
      🟩 Clang10            Pass: 100%/1   | Total: 58m 01s | Avg: 58m 01s | Max: 58m 01s
      🟩 Clang11            Pass: 100%/1   | Total: 59m 56s | Avg: 59m 56s | Max: 59m 56s
      🟩 Clang12            Pass: 100%/1   | Total: 52m 27s | Avg: 52m 27s | Max: 52m 27s
      🟩 Clang13            Pass: 100%/1   | Total: 53m 29s | Avg: 53m 29s | Max: 53m 29s
      🟩 Clang14            Pass: 100%/1   | Total: 53m 47s | Avg: 53m 47s | Max: 53m 47s
      🟩 Clang15            Pass: 100%/1   | Total: 57m 41s | Avg: 57m 41s | Max: 57m 41s
      🟩 Clang16            Pass: 100%/1   | Total: 53m 41s | Avg: 53m 41s | Max: 53m 41s
      🟩 Clang17            Pass: 100%/1   | Total: 55m 59s | Avg: 55m 59s | Max: 55m 59s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 35m | Avg: 47m 52s | Max:  1h 04m
      🟥 GCC6               Pass:   0%/2   | Total:  4m 06s | Avg:  2m 03s | Max:  2m 04s
      🟥 GCC7               Pass:   0%/2   | Total:  5m 53s | Avg:  2m 56s | Max:  2m 57s
      🟩 GCC8               Pass: 100%/1   | Total: 58m 49s | Avg: 58m 49s | Max: 58m 49s
      🟨 GCC9               Pass:  33%/3   | Total: 59m 09s | Avg: 19m 43s | Max: 53m 48s
      🟩 GCC10              Pass: 100%/1   | Total: 56m 43s | Avg: 56m 43s | Max: 56m 43s
      🟩 GCC11              Pass: 100%/1   | Total: 58m 25s | Avg: 58m 25s | Max: 58m 25s
      🟩 GCC12              Pass: 100%/3   | Total:  1h 45m | Avg: 35m 01s | Max: 54m 53s
      🟩 GCC13              Pass: 100%/8   | Total:  5h 23m | Avg: 40m 23s | Max:  1h 06m
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
      🟥 MSVC14.16          Pass:   0%/1   | Total: 18m 21s | Avg: 18m 21s | Max: 18m 21s
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 04m | Avg:  1h 04m | Max:  1h 04m | Hits:  28%/783   
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m | Hits:  27%/1566  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 13m
    🟨 cxx_family
      🟨 Clang              Pass:  78%/19  | Total: 13h 12m | Avg: 41m 42s | Max:  1h 04m
      🟨 GCC                Pass:  71%/21  | Total: 11h 11m | Avg: 31m 57s | Max:  1h 06m
      🟩 Intel              Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
      🟨 MSVC               Pass:  75%/4   | Total:  3h 31m | Avg: 52m 52s | Max:  1h 04m | Hits:  27%/2349  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 13m
    🟨 std
      🟥 11                 Pass:   0%/5   | Total: 13m 41s | Avg:  2m 44s | Max:  3m 14s
      🟥 14                 Pass:   0%/4   | Total: 26m 33s | Avg:  6m 38s | Max: 18m 21s
      🟨 17                 Pass:  83%/12  | Total: 10h 16m | Avg: 51m 21s | Max:  1h 13m | Hits:  28%/1566  
      🟩 20                 Pass: 100%/26  | Total: 20h 26m | Avg: 47m 09s | Max:  1h 12m | Hits:  27%/783   
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 50m 10s | Avg: 25m 05s | Max: 27m 26s
      🟩 90a                Pass: 100%/1   | Total: 27m 02s | Avg: 27m 02s | Max: 27m 02s
    
  • 🟨 thrust: Pass: 76%/46 | Total: 1d 00h | Avg: 32m 06s | Max: 1h 15m | Hits: 43%/7408

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  75%/44  | Total: 23h 21m | Avg: 31m 50s | Max:  1h 15m | Hits:  43%/7408  
      🟩 arm64              Pass: 100%/2   | Total:  1h 15m | Avg: 37m 59s | Max: 41m 06s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 09m | Avg: 34m 56s | Max: 38m 30s
      🔍 nvcc               Pass:  75%/44  | Total: 23h 27m | Avg: 31m 58s | Max:  1h 15m | Hits:  43%/7408  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  72%/40  | Total: 22h 23m | Avg: 33m 35s | Max:  1h 15m | Hits:  24%/5556  
      🟩 TestCPU            Pass: 100%/3   | Total: 38m 05s | Avg: 12m 41s | Max: 21m 56s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 35m | Avg: 31m 42s | Max: 54m 30s
    🟨 ctk
      🟥 11.1               Pass:   0%/7   | Total: 36m 15s | Avg:  5m 10s | Max: 25m 00s
      🟩 12.5               Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 15m
      🟨 12.6               Pass:  89%/37  | Total: 21h 37m | Avg: 35m 03s | Max:  1h 12m | Hits:  43%/7408  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 09m | Avg: 34m 56s | Max: 38m 30s
      🟥 nvcc11.1           Pass:   0%/7   | Total: 36m 15s | Avg:  5m 10s | Max: 25m 00s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 15m
      🟨 nvcc12.6           Pass:  88%/35  | Total: 20h 27m | Avg: 35m 04s | Max:  1h 12m | Hits:  43%/7408  
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total:  8m 43s | Avg:  2m 10s | Max:  2m 26s
      🟩 Clang10            Pass: 100%/1   | Total: 39m 37s | Avg: 39m 37s | Max: 39m 37s
      🟩 Clang11            Pass: 100%/1   | Total: 36m 48s | Avg: 36m 48s | Max: 36m 48s
      🟩 Clang12            Pass: 100%/1   | Total: 36m 20s | Avg: 36m 20s | Max: 36m 20s
      🟩 Clang13            Pass: 100%/1   | Total: 34m 48s | Avg: 34m 48s | Max: 34m 48s
      🟩 Clang14            Pass: 100%/1   | Total: 41m 57s | Avg: 41m 57s | Max: 41m 57s
      🟩 Clang15            Pass: 100%/1   | Total: 43m 30s | Avg: 43m 30s | Max: 43m 30s
      🟩 Clang16            Pass: 100%/1   | Total: 40m 17s | Avg: 40m 17s | Max: 40m 17s
      🟩 Clang17            Pass: 100%/1   | Total: 40m 59s | Avg: 40m 59s | Max: 40m 59s
      🟩 Clang18            Pass: 100%/7   | Total:  3h 32m | Avg: 30m 23s | Max: 40m 27s
      🟥 GCC6               Pass:   0%/2   | Total:  3m 42s | Avg:  1m 51s | Max:  1m 54s
      🟥 GCC7               Pass:   0%/2   | Total:  4m 20s | Avg:  2m 10s | Max:  2m 11s
      🟩 GCC8               Pass: 100%/1   | Total: 41m 50s | Avg: 41m 50s | Max: 41m 50s
      🟨 GCC9               Pass:  33%/3   | Total: 44m 23s | Avg: 14m 47s | Max: 40m 50s
      🟩 GCC10              Pass: 100%/1   | Total: 43m 10s | Avg: 43m 10s | Max: 43m 10s
      🟩 GCC11              Pass: 100%/1   | Total: 41m 24s | Avg: 41m 24s | Max: 41m 24s
      🟩 GCC12              Pass: 100%/1   | Total: 46m 46s | Avg: 46m 46s | Max: 46m 46s
      🟩 GCC13              Pass: 100%/8   | Total:  4h 22m | Avg: 32m 49s | Max: 54m 30s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 50m 01s | Avg: 50m 01s | Max: 50m 01s
      🟥 MSVC14.16          Pass:   0%/1   | Total: 25m 00s | Avg: 25m 00s | Max: 25m 00s
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m | Hits:  24%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 46m | Avg: 55m 38s | Max:  1h 12m | Hits:  49%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 15m
    🟨 cxx_family
      🟨 Clang              Pass:  78%/19  | Total:  8h 55m | Avg: 28m 11s | Max: 43m 30s
      🟨 GCC                Pass:  68%/19  | Total:  8h 08m | Avg: 25m 41s | Max: 54m 30s
      🟩 Intel              Pass: 100%/1   | Total: 50m 01s | Avg: 50m 01s | Max: 50m 01s
      🟨 MSVC               Pass:  80%/5   | Total:  4h 19m | Avg: 51m 54s | Max:  1h 12m | Hits:  43%/7408  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 15m
    🟨 std
      🟥 11                 Pass:   0%/5   | Total: 10m 05s | Avg:  2m 01s | Max:  2m 17s
      🟥 14                 Pass:   0%/4   | Total: 31m 23s | Avg:  7m 50s | Max: 25m 00s
      🟨 17                 Pass:  83%/12  | Total:  8h 10m | Avg: 40m 50s | Max:  1h 12m | Hits:  24%/3704  
      🟩 20                 Pass: 100%/23  | Total: 14h 54m | Avg: 38m 54s | Max:  1h 15m | Hits:  62%/3704  
    🟨 gpu
      🟨 v100               Pass:  76%/46  | Total:  1d 00h | Avg: 32m 06s | Max:  1h 15m | Hits:  43%/7408  
    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 50m 54s | Avg: 25m 27s | Max: 35m 34s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 26m 46s | Avg: 26m 46s | Max: 26m 46s
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 46m | Avg: 6m 25s | Max: 34m 19s | Hits: 90%/312

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 32m | Avg:  6m 57s | Max: 34m 19s | Hits:  90%/312   
      🟩 arm64              Pass: 100%/4   | Total: 13m 55s | Avg:  3m 28s | Max:  3m 41s
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 16m 58s | Avg:  5m 39s | Max:  9m 53s | Hits:  91%/156   
      🟩 12.5               Pass: 100%/2   | Total: 12m 04s | Avg:  6m 02s | Max:  6m 15s
      🟩 12.6               Pass: 100%/21  | Total:  2h 17m | Avg:  6m 33s | Max: 34m 19s | Hits:  90%/156   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 16m 58s | Avg:  5m 39s | Max:  9m 53s | Hits:  91%/156   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 12m 04s | Avg:  6m 02s | Max:  6m 15s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  2h 17m | Avg:  6m 33s | Max: 34m 19s | Hits:  90%/156   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 46m | Avg:  6m 25s | Max: 34m 19s | Hits:  90%/312   
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 42s | Avg:  3m 42s | Max:  3m 42s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 37s | Avg:  4m 37s | Max:  4m 37s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s
      🟩 Clang12            Pass: 100%/1   | Total:  3m 35s | Avg:  3m 35s | Max:  3m 35s
      🟩 Clang13            Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 09s | Avg:  4m 09s | Max:  4m 09s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 49s | Avg:  3m 49s | Max:  3m 49s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 02s | Avg:  4m 02s | Max:  4m 02s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 54s | Avg:  3m 54s | Max:  3m 54s
      🟩 Clang18            Pass: 100%/4   | Total: 34m 48s | Avg:  8m 42s | Max: 23m 59s
      🟩 GCC9               Pass: 100%/1   | Total:  3m 23s | Avg:  3m 23s | Max:  3m 23s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 37s | Avg:  3m 37s | Max:  3m 37s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s
      🟩 GCC12              Pass: 100%/2   | Total: 38m 27s | Avg: 19m 13s | Max: 34m 19s
      🟩 GCC13              Pass: 100%/4   | Total: 13m 50s | Avg:  3m 27s | Max:  3m 41s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  9m 53s | Avg:  9m 53s | Max:  9m 53s | Hits:  91%/156   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 25s | Avg: 11m 25s | Max: 11m 25s | Hits:  90%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 12m 04s | Avg:  6m 02s | Max:  6m 15s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/13  | Total:  1h 10m | Avg:  5m 24s | Max: 23m 59s
      🟩 GCC                Pass: 100%/9   | Total:  1h 03m | Avg:  7m 01s | Max: 34m 19s
      🟩 MSVC               Pass: 100%/2   | Total: 21m 18s | Avg: 10m 39s | Max: 11m 25s | Hits:  90%/312   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 04s | Avg:  6m 02s | Max:  6m 15s
    🟩 gpu
      🟩 v100               Pass: 100%/26  | Total:  2h 46m | Avg:  6m 25s | Max: 34m 19s | Hits:  90%/312   
    🟩 jobs
      🟩 Build              Pass: 100%/24  | Total:  1h 48m | Avg:  4m 31s | Max: 11m 25s | Hits:  90%/312   
      🟩 Test               Pass: 100%/2   | Total: 58m 18s | Avg: 29m 09s | Max: 34m 19s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 03s | Avg:  3m 03s | Max:  3m 03s
      🟩 90a                Pass: 100%/1   | Total:  3m 35s | Avg:  3m 35s | Max:  3m 35s
    🟩 std
      🟩 17                 Pass: 100%/6   | Total: 22m 54s | Avg:  3m 49s | Max:  5m 49s
      🟩 20                 Pass: 100%/20  | Total:  2h 23m | Avg:  7m 11s | Max: 34m 19s | Hits:  90%/312   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 55s | Avg: 4m 57s | Max: 7m 34s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 34s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 34s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 34s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 34s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 34s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 34s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 34s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 21s | Avg:  2m 21s | Max:  2m 21s
      🟩 Test               Pass: 100%/1   | Total:  7m 34s | Avg:  7m 34s | Max:  7m 34s
    
  • 🟩 python: Pass: 100%/1 | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 170)

# Runner
125 linux-amd64-cpu16
19 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@fbusato fbusato self-assigned this Jan 2, 2025
Copy link
Contributor

github-actions bot commented Jan 8, 2025

🟨 CI finished in 2h 27m: Pass: 79%/170 | Total: 2d 18h | Avg: 23m 36s | Max: 1h 24m | Hits: 15%/17650
  • 🟨 libcudacxx: Pass: 72%/48 | Total: 8h 52m | Avg: 11m 05s | Max: 44m 54s | Hits: 11%/7590

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  71%/46  | Total:  8h 45m | Avg: 11m 24s | Max: 44m 54s | Hits:  11%/7590  
      🟩 arm64              Pass: 100%/2   | Total:  7m 00s | Avg:  3m 30s | Max:  3m 36s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 06m | Avg: 16m 30s | Max: 21m 29s
      🔍 nvcc               Pass:  70%/44  | Total:  7h 46m | Avg: 10m 35s | Max: 44m 54s | Hits:  11%/7590  
    🟨 ctk
      🟥 11.1               Pass:   0%/7   | Total: 36m 01s | Avg:  5m 08s | Max: 21m 41s
      🟩 12.5               Pass: 100%/2   | Total:  1h 06m | Avg: 33m 03s | Max: 33m 39s
      🟨 12.6               Pass:  84%/39  | Total:  7h 10m | Avg: 11m 01s | Max: 44m 54s | Hits:  11%/7590  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 06m | Avg: 16m 30s | Max: 21m 29s
      🟥 nvcc11.1           Pass:   0%/7   | Total: 36m 01s | Avg:  5m 08s | Max: 21m 41s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 06m | Avg: 33m 03s | Max: 33m 39s
      🟨 nvcc12.6           Pass:  82%/35  | Total:  6h 04m | Avg: 10m 24s | Max: 44m 54s | Hits:  11%/7590  
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total: 12m 54s | Avg:  3m 13s | Max:  5m 04s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 42s | Avg:  4m 42s | Max:  4m 42s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 59s | Avg:  3m 59s | Max:  3m 59s
      🟩 Clang12            Pass: 100%/1   | Total:  4m 16s | Avg:  4m 16s | Max:  4m 16s
      🟩 Clang13            Pass: 100%/1   | Total:  4m 12s | Avg:  4m 12s | Max:  4m 12s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 05s | Avg:  4m 05s | Max:  4m 05s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 37s | Avg:  4m 37s | Max:  4m 37s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 31s | Avg:  4m 31s | Max:  4m 31s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 14s | Avg:  4m 14s | Max:  4m 14s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 36m | Avg: 12m 02s | Max: 21m 29s
      🟥 GCC6               Pass:   0%/2   | Total:  3m 42s | Avg:  1m 51s | Max:  2m 00s
      🟥 GCC7               Pass:   0%/2   | Total:  5m 27s | Avg:  2m 43s | Max:  3m 21s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 35s | Avg:  3m 35s | Max:  3m 35s
      🟨 GCC9               Pass:  33%/3   | Total:  8m 41s | Avg:  2m 53s | Max:  3m 40s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 39s | Avg:  3m 39s | Max:  3m 39s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 50s | Avg:  3m 50s | Max:  3m 50s
      🟩 GCC12              Pass: 100%/1   | Total:  3m 59s | Avg:  3m 59s | Max:  3m 59s
      🟨 GCC13              Pass:  80%/10  | Total:  2h 02m | Avg: 12m 16s | Max: 26m 14s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 25m 33s | Avg: 25m 33s | Max: 25m 33s
      🟥 MSVC14.16          Pass:   0%/1   | Total: 21m 41s | Avg: 21m 41s | Max: 21m 41s
      🟩 MSVC14.29          Pass: 100%/1   | Total: 38m 36s | Avg: 38m 36s | Max: 38m 36s | Hits:  11%/2481  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 20m | Avg: 40m 22s | Max: 44m 54s | Hits:  11%/5109  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 06m | Avg: 33m 03s | Max: 33m 39s
    🟨 cxx_family
      🟨 Clang              Pass:  80%/20  | Total:  2h 23m | Avg:  7m 11s | Max: 21m 29s
      🟨 GCC                Pass:  61%/21  | Total:  2h 35m | Avg:  7m 24s | Max: 26m 14s
      🟩 Intel              Pass: 100%/1   | Total: 25m 33s | Avg: 25m 33s | Max: 25m 33s
      🟨 MSVC               Pass:  75%/4   | Total:  2h 21m | Avg: 35m 15s | Max: 44m 54s | Hits:  11%/7590  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 03s | Max: 33m 39s
    🟨 jobs
      🟨 Build              Pass:  73%/41  | Total:  6h 46m | Avg:  9m 55s | Max: 44m 54s | Hits:  11%/7590  
      🟨 NVRTC              Pass:  50%/4   | Total:  1h 28m | Avg: 22m 11s | Max: 26m 14s
      🟩 Test               Pass: 100%/2   | Total: 34m 28s | Avg: 17m 14s | Max: 17m 53s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 58s | Avg:  1m 58s | Max:  1m 58s
    🟨 std
      🟥 11                 Pass:   0%/6   | Total: 30m 42s | Avg:  5m 07s | Max: 20m 35s
      🟥 14                 Pass:   0%/5   | Total: 53m 16s | Avg: 10m 39s | Max: 21m 41s
      🟨 17                 Pass:  84%/13  | Total:  3h 25m | Avg: 15m 48s | Max: 38m 36s | Hits:  11%/4962  
      🟩 20                 Pass: 100%/23  | Total:  4h 00m | Avg: 10m 27s | Max: 44m 54s | Hits:  10%/2628  
    🟨 gpu
      🟨 v100               Pass:  72%/48  | Total:  8h 52m | Avg: 11m 05s | Max: 44m 54s | Hits:  11%/7590  
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 12m 44s | Avg: 12m 44s | Max: 12m 44s
      🟩 90a                Pass: 100%/2   | Total: 17m 17s | Avg:  8m 38s | Max: 13m 22s
    
  • 🟨 cub: Pass: 76%/47 | Total: 1d 06h | Avg: 39m 34s | Max: 1h 11m | Hits: 2%/2340

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  75%/45  | Total:  1d 05h | Avg: 38m 41s | Max:  1h 11m | Hits:   2%/2340  
      🟩 arm64              Pass: 100%/2   | Total:  1h 59m | Avg: 59m 33s | Max:  1h 02m
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m
      🔍 nvcc               Pass:  75%/45  | Total:  1d 04h | Avg: 38m 39s | Max:  1h 11m | Hits:   2%/2340  
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 41m 01s | Avg: 20m 30s | Max: 24m 53s
      🔍 v100               Pass:  75%/45  | Total:  1d 06h | Avg: 40m 25s | Max:  1h 11m | Hits:   2%/2340  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  72%/40  | Total:  1d 04h | Avg: 43m 05s | Max:  1h 11m | Hits:   2%/2340  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 18m 27s | Avg: 18m 27s | Max: 18m 27s
      🟩 GraphCapture       Pass: 100%/1   | Total: 14m 27s | Avg: 14m 27s | Max: 14m 27s
      🟩 HostLaunch         Pass: 100%/3   | Total: 52m 58s | Avg: 17m 39s | Max: 18m 38s
      🟩 TestGPU            Pass: 100%/2   | Total: 50m 28s | Avg: 25m 14s | Max: 25m 45s
    🟨 ctk
      🟥 11.1               Pass:   0%/7   | Total: 34m 02s | Avg:  4m 51s | Max: 19m 19s
      🟩 12.5               Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 11m
      🟨 12.6               Pass:  89%/38  | Total:  1d 04h | Avg: 44m 21s | Max:  1h 11m | Hits:   2%/2340  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m
      🟥 nvcc11.1           Pass:   0%/7   | Total: 34m 02s | Avg:  4m 51s | Max: 19m 19s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 11m
      🟨 nvcc12.6           Pass:  88%/36  | Total:  1d 02h | Avg: 43m 28s | Max:  1h 11m | Hits:   2%/2340  
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total: 12m 20s | Avg:  3m 05s | Max:  3m 20s
      🟩 Clang10            Pass: 100%/1   | Total: 57m 21s | Avg: 57m 21s | Max: 57m 21s
      🟩 Clang11            Pass: 100%/1   | Total: 54m 25s | Avg: 54m 25s | Max: 54m 25s
      🟩 Clang12            Pass: 100%/1   | Total: 55m 48s | Avg: 55m 48s | Max: 55m 48s
      🟩 Clang13            Pass: 100%/1   | Total: 54m 40s | Avg: 54m 40s | Max: 54m 40s
      🟩 Clang14            Pass: 100%/1   | Total: 55m 17s | Avg: 55m 17s | Max: 55m 17s
      🟩 Clang15            Pass: 100%/1   | Total: 59m 36s | Avg: 59m 36s | Max: 59m 36s
      🟩 Clang16            Pass: 100%/1   | Total: 57m 02s | Avg: 57m 02s | Max: 57m 02s
      🟩 Clang17            Pass: 100%/1   | Total: 59m 11s | Avg: 59m 11s | Max: 59m 11s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 36m | Avg: 48m 06s | Max:  1h 01m
      🟥 GCC6               Pass:   0%/2   | Total:  3m 47s | Avg:  1m 53s | Max:  1m 56s
      🟥 GCC7               Pass:   0%/2   | Total:  6m 13s | Avg:  3m 06s | Max:  3m 13s
      🟩 GCC8               Pass: 100%/1   | Total: 55m 20s | Avg: 55m 20s | Max: 55m 20s
      🟨 GCC9               Pass:  33%/3   | Total:  1h 03m | Avg: 21m 10s | Max: 58m 18s
      🟩 GCC10              Pass: 100%/1   | Total: 57m 23s | Avg: 57m 23s | Max: 57m 23s
      🟩 GCC11              Pass: 100%/1   | Total: 57m 35s | Avg: 57m 35s | Max: 57m 35s
      🟩 GCC12              Pass: 100%/3   | Total:  1h 45m | Avg: 35m 15s | Max:  1h 04m
      🟩 GCC13              Pass: 100%/8   | Total:  4h 43m | Avg: 35m 29s | Max:  1h 04m
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
      🟥 MSVC14.16          Pass:   0%/1   | Total: 19m 19s | Avg: 19m 19s | Max: 19m 19s
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 03m | Avg:  1h 03m | Max:  1h 03m | Hits:   2%/780   
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 11m | Hits:   2%/1560  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 11m
    🟨 cxx_family
      🟨 Clang              Pass:  78%/19  | Total: 13h 22m | Avg: 42m 13s | Max:  1h 01m
      🟨 GCC                Pass:  71%/21  | Total: 10h 33m | Avg: 30m 09s | Max:  1h 04m
      🟩 Intel              Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
      🟨 MSVC               Pass:  75%/4   | Total:  3h 43m | Avg: 55m 50s | Max:  1h 11m | Hits:   2%/2340  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 11m
    🟨 std
      🟥 11                 Pass:   0%/5   | Total: 13m 59s | Avg:  2m 47s | Max:  3m 16s
      🟥 14                 Pass:   0%/4   | Total: 27m 30s | Avg:  6m 52s | Max: 19m 19s
      🟨 17                 Pass:  83%/12  | Total: 10h 11m | Avg: 50m 58s | Max:  1h 11m | Hits:   2%/1560  
      🟩 20                 Pass: 100%/26  | Total: 20h 06m | Avg: 46m 24s | Max:  1h 11m | Hits:   2%/780   
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 41m 01s | Avg: 20m 30s | Max: 24m 53s
      🟩 90a                Pass: 100%/1   | Total: 26m 42s | Avg: 26m 42s | Max: 26m 42s
    
  • 🟨 thrust: Pass: 76%/46 | Total: 1d 00h | Avg: 31m 19s | Max: 1h 24m | Hits: 25%/7408

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  75%/44  | Total: 22h 45m | Avg: 31m 01s | Max:  1h 24m | Hits:  25%/7408  
      🟩 arm64              Pass: 100%/2   | Total:  1h 15m | Avg: 37m 47s | Max: 40m 20s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 14m | Avg: 37m 19s | Max: 37m 55s
      🔍 nvcc               Pass:  75%/44  | Total: 22h 46m | Avg: 31m 03s | Max:  1h 24m | Hits:  25%/7408  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  72%/40  | Total: 22h 45m | Avg: 34m 08s | Max:  1h 24m | Hits:   0%/5556  
      🟩 TestCPU            Pass: 100%/3   | Total: 37m 34s | Avg: 12m 31s | Max: 22m 27s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 37m 54s | Avg: 12m 38s | Max: 13m 21s
    🟨 ctk
      🟥 11.1               Pass:   0%/7   | Total: 35m 21s | Avg:  5m 03s | Max: 24m 16s
      🟩 12.5               Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 14m
      🟨 12.6               Pass:  89%/37  | Total: 21h 01m | Avg: 34m 05s | Max:  1h 24m | Hits:  25%/7408  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 14m | Avg: 37m 19s | Max: 37m 55s
      🟥 nvcc11.1           Pass:   0%/7   | Total: 35m 21s | Avg:  5m 03s | Max: 24m 16s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 14m
      🟨 nvcc12.6           Pass:  88%/35  | Total: 19h 46m | Avg: 33m 53s | Max:  1h 24m | Hits:  25%/7408  
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total:  8m 20s | Avg:  2m 05s | Max:  2m 16s
      🟩 Clang10            Pass: 100%/1   | Total: 39m 02s | Avg: 39m 02s | Max: 39m 02s
      🟩 Clang11            Pass: 100%/1   | Total: 40m 12s | Avg: 40m 12s | Max: 40m 12s
      🟩 Clang12            Pass: 100%/1   | Total: 38m 47s | Avg: 38m 47s | Max: 38m 47s
      🟩 Clang13            Pass: 100%/1   | Total: 39m 00s | Avg: 39m 00s | Max: 39m 00s
      🟩 Clang14            Pass: 100%/1   | Total: 41m 32s | Avg: 41m 32s | Max: 41m 32s
      🟩 Clang15            Pass: 100%/1   | Total: 39m 02s | Avg: 39m 02s | Max: 39m 02s
      🟩 Clang16            Pass: 100%/1   | Total: 42m 09s | Avg: 42m 09s | Max: 42m 09s
      🟩 Clang17            Pass: 100%/1   | Total: 39m 00s | Avg: 39m 00s | Max: 39m 00s
      🟩 Clang18            Pass: 100%/7   | Total:  3h 28m | Avg: 29m 50s | Max: 40m 19s
      🟥 GCC6               Pass:   0%/2   | Total:  3m 35s | Avg:  1m 47s | Max:  1m 59s
      🟥 GCC7               Pass:   0%/2   | Total:  4m 11s | Avg:  2m 05s | Max:  2m 15s
      🟩 GCC8               Pass: 100%/1   | Total: 35m 56s | Avg: 35m 56s | Max: 35m 56s
      🟨 GCC9               Pass:  33%/3   | Total: 43m 37s | Avg: 14m 32s | Max: 40m 01s
      🟩 GCC10              Pass: 100%/1   | Total: 40m 38s | Avg: 40m 38s | Max: 40m 38s
      🟩 GCC11              Pass: 100%/1   | Total: 44m 57s | Avg: 44m 57s | Max: 44m 57s
      🟩 GCC12              Pass: 100%/1   | Total: 42m 45s | Avg: 42m 45s | Max: 42m 45s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 41m | Avg: 27m 39s | Max: 44m 21s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 51m 09s | Avg: 51m 09s | Max: 51m 09s
      🟥 MSVC14.16          Pass:   0%/1   | Total: 24m 16s | Avg: 24m 16s | Max: 24m 16s
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m | Hits:   0%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 56m | Avg: 58m 55s | Max:  1h 24m | Hits:  33%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 14m
    🟨 cxx_family
      🟨 Clang              Pass:  78%/19  | Total:  8h 55m | Avg: 28m 12s | Max: 42m 09s
      🟨 GCC                Pass:  68%/19  | Total:  7h 16m | Avg: 22m 59s | Max: 44m 57s
      🟩 Intel              Pass: 100%/1   | Total: 51m 09s | Avg: 51m 09s | Max: 51m 09s
      🟨 MSVC               Pass:  80%/5   | Total:  4h 32m | Avg: 54m 29s | Max:  1h 24m | Hits:  25%/7408  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 14m
    🟨 std
      🟥 11                 Pass:   0%/5   | Total:  9m 56s | Avg:  1m 59s | Max:  2m 10s
      🟥 14                 Pass:   0%/4   | Total: 30m 23s | Avg:  7m 35s | Max: 24m 16s
      🟨 17                 Pass:  83%/12  | Total:  8h 22m | Avg: 41m 51s | Max:  1h 11m | Hits:   0%/3704  
      🟩 20                 Pass: 100%/23  | Total: 14h 11m | Avg: 37m 02s | Max:  1h 24m | Hits:  49%/3704  
    🟨 gpu
      🟨 v100               Pass:  76%/46  | Total:  1d 00h | Avg: 31m 19s | Max:  1h 24m | Hits:  25%/7408  
    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 46m 21s | Avg: 23m 10s | Max: 34m 39s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 29m 40s | Avg: 29m 40s | Max: 29m 40s
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 25m | Avg: 5m 34s | Max: 19m 40s | Hits: 9%/312

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 11m | Avg:  5m 57s | Max: 19m 40s | Hits:   9%/312   
      🟩 arm64              Pass: 100%/4   | Total: 14m 02s | Avg:  3m 30s | Max:  3m 40s
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 18m 52s | Avg:  6m 17s | Max: 11m 49s | Hits:   9%/156   
      🟩 12.5               Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 31s
      🟩 12.6               Pass: 100%/21  | Total:  1h 53m | Avg:  5m 24s | Max: 19m 40s | Hits:   9%/156   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 18m 52s | Avg:  6m 17s | Max: 11m 49s | Hits:   9%/156   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 31s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  1h 53m | Avg:  5m 24s | Max: 19m 40s | Hits:   9%/156   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 25m | Avg:  5m 34s | Max: 19m 40s | Hits:   9%/312   
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 35s | Avg:  3m 35s | Max:  3m 35s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 23s | Avg:  4m 23s | Max:  4m 23s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s
      🟩 Clang12            Pass: 100%/1   | Total:  3m 42s | Avg:  3m 42s | Max:  3m 42s
      🟩 Clang13            Pass: 100%/1   | Total:  4m 02s | Avg:  4m 02s | Max:  4m 02s
      🟩 Clang14            Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 56s | Avg:  3m 56s | Max:  3m 56s
      🟩 Clang18            Pass: 100%/4   | Total: 30m 19s | Avg:  7m 34s | Max: 19m 40s
      🟩 GCC9               Pass: 100%/1   | Total:  3m 28s | Avg:  3m 28s | Max:  3m 28s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 55s | Avg:  3m 55s | Max:  3m 55s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 51s | Avg:  3m 51s | Max:  3m 51s
      🟩 GCC12              Pass: 100%/2   | Total: 19m 10s | Avg:  9m 35s | Max: 15m 10s
      🟩 GCC13              Pass: 100%/4   | Total: 13m 27s | Avg:  3m 21s | Max:  3m 40s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 11m 49s | Avg: 11m 49s | Max: 11m 49s | Hits:   9%/156   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 23s | Avg: 11m 23s | Max: 11m 23s | Hits:   9%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 31s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/13  | Total:  1h 05m | Avg:  5m 02s | Max: 19m 40s
      🟩 GCC                Pass: 100%/9   | Total: 43m 51s | Avg:  4m 52s | Max: 15m 10s
      🟩 MSVC               Pass: 100%/2   | Total: 23m 12s | Avg: 11m 36s | Max: 11m 49s | Hits:   9%/312   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 31s
    🟩 gpu
      🟩 v100               Pass: 100%/26  | Total:  2h 25m | Avg:  5m 34s | Max: 19m 40s | Hits:   9%/312   
    🟩 jobs
      🟩 Build              Pass: 100%/24  | Total:  1h 50m | Avg:  4m 35s | Max: 11m 49s | Hits:   9%/312   
      🟩 Test               Pass: 100%/2   | Total: 34m 50s | Avg: 17m 25s | Max: 19m 40s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 12s | Avg:  3m 12s | Max:  3m 12s
      🟩 90a                Pass: 100%/1   | Total:  3m 03s | Avg:  3m 03s | Max:  3m 03s
    🟩 std
      🟩 17                 Pass: 100%/6   | Total: 23m 18s | Avg:  3m 53s | Max:  6m 02s
      🟩 20                 Pass: 100%/20  | Total:  2h 01m | Avg:  6m 05s | Max: 19m 40s | Hits:   9%/312   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 54s | Avg: 4m 57s | Max: 7m 42s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 42s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 42s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 42s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 42s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 42s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 42s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 42s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 12s | Avg:  2m 12s | Max:  2m 12s
      🟩 Test               Pass: 100%/1   | Total:  7m 42s | Avg:  7m 42s | Max:  7m 42s
    
  • 🟩 python: Pass: 100%/1 | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 170)

# Runner
125 linux-amd64-cpu16
19 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@fbusato fbusato requested a review from miscco January 9, 2025 01:01
Copy link
Contributor

github-actions bot commented Jan 9, 2025

🟨 CI finished in 2h 28m: Pass: 78%/164 | Total: 2d 15h | Avg: 23m 08s | Max: 1h 16m | Hits: 419%/15310
  • 🟨 cub: Pass: 71%/45 | Total: 1d 05h | Avg: 38m 54s | Max: 1h 11m

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  69%/43  | Total:  1d 03h | Avg: 37m 55s | Max:  1h 11m
      🟩 arm64              Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 04m
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 57m | Avg: 58m 41s | Max: 58m 52s
      🔍 nvcc               Pass:  69%/43  | Total:  1d 03h | Avg: 37m 59s | Max:  1h 11m
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 40m 41s | Avg: 20m 20s | Max: 24m 38s
      🔍 v100               Pass:  69%/43  | Total:  1d 04h | Avg: 39m 46s | Max:  1h 11m
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  65%/38  | Total:  1d 02h | Avg: 42m 16s | Max:  1h 11m
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 18m 59s | Avg: 18m 59s | Max: 18m 59s
      🟩 GraphCapture       Pass: 100%/1   | Total: 15m 59s | Avg: 15m 59s | Max: 15m 59s
      🟩 HostLaunch         Pass: 100%/3   | Total: 56m 59s | Avg: 18m 59s | Max: 22m 09s
      🟩 TestGPU            Pass: 100%/2   | Total: 52m 27s | Avg: 26m 13s | Max: 29m 15s
    🟨 ctk
      🟥 11.1               Pass:   0%/6   | Total: 15m 52s | Avg:  2m 38s | Max:  2m 52s
      🟩 12.5               Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 11m
      🟨 12.6               Pass:  81%/37  | Total:  1d 02h | Avg: 43m 01s | Max:  1h 08m
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 57m | Avg: 58m 41s | Max: 58m 52s
      🟥 nvcc11.1           Pass:   0%/6   | Total: 15m 52s | Avg:  2m 38s | Max:  2m 52s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 11m
      🟨 nvcc12.6           Pass:  80%/35  | Total:  1d 00h | Avg: 42m 07s | Max:  1h 08m
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total: 12m 09s | Avg:  3m 02s | Max:  3m 23s
      🟩 Clang10            Pass: 100%/1   | Total: 55m 19s | Avg: 55m 19s | Max: 55m 19s
      🟩 Clang11            Pass: 100%/1   | Total: 57m 36s | Avg: 57m 36s | Max: 57m 36s
      🟩 Clang12            Pass: 100%/1   | Total: 53m 50s | Avg: 53m 50s | Max: 53m 50s
      🟩 Clang13            Pass: 100%/1   | Total: 56m 35s | Avg: 56m 35s | Max: 56m 35s
      🟩 Clang14            Pass: 100%/1   | Total: 52m 50s | Avg: 52m 50s | Max: 52m 50s
      🟩 Clang15            Pass: 100%/1   | Total: 59m 25s | Avg: 59m 25s | Max: 59m 25s
      🟩 Clang16            Pass: 100%/1   | Total: 55m 43s | Avg: 55m 43s | Max: 55m 43s
      🟩 Clang17            Pass: 100%/1   | Total: 52m 53s | Avg: 52m 53s | Max: 52m 53s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 39m | Avg: 48m 28s | Max: 59m 16s
      🟥 GCC7               Pass:   0%/4   | Total: 11m 23s | Avg:  2m 50s | Max:  3m 11s
      🟩 GCC8               Pass: 100%/1   | Total: 54m 14s | Avg: 54m 14s | Max: 54m 14s
      🟨 GCC9               Pass:  33%/3   | Total: 58m 22s | Avg: 19m 27s | Max: 53m 11s
      🟩 GCC10              Pass: 100%/1   | Total: 55m 47s | Avg: 55m 47s | Max: 55m 47s
      🟩 GCC11              Pass: 100%/1   | Total: 58m 19s | Avg: 58m 19s | Max: 58m 19s
      🟩 GCC12              Pass: 100%/3   | Total:  1h 37m | Avg: 32m 33s | Max: 56m 59s
      🟩 GCC13              Pass: 100%/8   | Total:  4h 37m | Avg: 34m 37s | Max:  1h 04m
      🟥 MSVC14.29          Pass:   0%/1   | Total:  1h 04m | Avg:  1h 04m | Max:  1h 04m
      🟥 MSVC14.39          Pass:   0%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 08m
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 11m
    🟨 cxx_family
      🟨 Clang              Pass:  78%/19  | Total: 13h 15m | Avg: 41m 52s | Max: 59m 25s
      🟨 GCC                Pass:  71%/21  | Total: 10h 12m | Avg: 29m 10s | Max:  1h 04m
      🟥 MSVC               Pass:   0%/3   | Total:  3h 19m | Avg:  1h 06m | Max:  1h 08m
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 11m
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 40m 41s | Avg: 20m 20s | Max: 24m 38s
      🟩 90a                Pass: 100%/1   | Total: 24m 10s | Avg: 24m 10s | Max: 24m 10s
    🟨 std
      🟥 11                 Pass:   0%/5   | Total: 14m 06s | Avg:  2m 49s | Max:  3m 16s
      🟥 14                 Pass:   0%/2   | Total:  6m 34s | Avg:  3m 17s | Max:  3m 23s
      🟨 17                 Pass:  58%/12  | Total:  9h 08m | Avg: 45m 40s | Max:  1h 11m
      🟨 20                 Pass:  96%/26  | Total: 19h 42m | Avg: 45m 27s | Max:  1h 11m
    
  • 🟨 libcudacxx: Pass: 73%/46 | Total: 8h 29m | Avg: 11m 04s | Max: 32m 21s | Hits: 657%/7590

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  72%/44  | Total:  8h 22m | Avg: 11m 24s | Max: 32m 21s | Hits: 657%/7590  
      🟩 arm64              Pass: 100%/2   | Total:  7m 06s | Avg:  3m 33s | Max:  3m 41s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 05m | Avg: 16m 26s | Max: 20m 20s
      🔍 nvcc               Pass:  71%/42  | Total:  7h 23m | Avg: 10m 33s | Max: 32m 21s | Hits: 657%/7590  
    🟨 ctk
      🟥 11.1               Pass:   0%/6   | Total: 26m 14s | Avg:  4m 22s | Max: 14m 40s
      🟩 12.5               Pass: 100%/2   | Total: 38m 46s | Avg: 19m 23s | Max: 30m 26s
      🟨 12.6               Pass:  84%/38  | Total:  7h 24m | Avg: 11m 41s | Max: 32m 21s | Hits: 657%/7590  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 05m | Avg: 16m 26s | Max: 20m 20s
      🟥 nvcc11.1           Pass:   0%/6   | Total: 26m 14s | Avg:  4m 22s | Max: 14m 40s
      🟩 nvcc12.5           Pass: 100%/2   | Total: 38m 46s | Avg: 19m 23s | Max: 30m 26s
      🟨 nvcc12.6           Pass:  82%/34  | Total:  6h 18m | Avg: 11m 07s | Max: 32m 21s | Hits: 657%/7590  
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total: 12m 06s | Avg:  3m 01s | Max:  4m 23s
      🟩 Clang10            Pass: 100%/1   | Total:  5m 17s | Avg:  5m 17s | Max:  5m 17s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s
      🟩 Clang12            Pass: 100%/1   | Total:  3m 59s | Avg:  3m 59s | Max:  3m 59s
      🟩 Clang13            Pass: 100%/1   | Total: 20m 31s | Avg: 20m 31s | Max: 20m 31s
      🟩 Clang14            Pass: 100%/1   | Total:  3m 55s | Avg:  3m 55s | Max:  3m 55s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 34s | Avg:  4m 34s | Max:  4m 34s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 30s | Avg:  4m 30s | Max:  4m 30s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 14s | Avg:  4m 14s | Max:  4m 14s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 46m | Avg: 13m 19s | Max: 20m 20s
      🟥 GCC7               Pass:   0%/4   | Total: 21m 54s | Avg:  5m 28s | Max: 14m 40s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 34s | Avg:  3m 34s | Max:  3m 34s
      🟨 GCC9               Pass:  33%/3   | Total:  8m 29s | Avg:  2m 49s | Max:  4m 00s
      🟩 GCC10              Pass: 100%/1   | Total:  4m 01s | Avg:  4m 01s | Max:  4m 01s
      🟩 GCC11              Pass: 100%/1   | Total: 20m 39s | Avg: 20m 39s | Max: 20m 39s
      🟩 GCC12              Pass: 100%/1   | Total:  3m 41s | Avg:  3m 41s | Max:  3m 41s
      🟨 GCC13              Pass:  80%/10  | Total:  2h 35m | Avg: 15m 33s | Max: 26m 31s
      🟩 MSVC14.29          Pass: 100%/1   | Total: 26m 43s | Avg: 26m 43s | Max: 26m 43s | Hits: 681%/2481  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 56m 18s | Avg: 28m 09s | Max: 32m 21s | Hits: 646%/5109  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 38m 46s | Avg: 19m 23s | Max: 30m 26s
    🟨 cxx_family
      🟨 Clang              Pass:  80%/20  | Total:  2h 49m | Avg:  8m 28s | Max: 20m 31s
      🟨 GCC                Pass:  61%/21  | Total:  3h 37m | Avg: 10m 22s | Max: 26m 31s
      🟩 MSVC               Pass: 100%/3   | Total:  1h 23m | Avg: 27m 40s | Max: 32m 21s | Hits: 657%/7590  
      🟩 NVHPC              Pass: 100%/2   | Total: 38m 46s | Avg: 19m 23s | Max: 30m 26s
    🟨 jobs
      🟨 Build              Pass:  74%/39  | Total:  6h 08m | Avg:  9m 27s | Max: 32m 21s | Hits: 657%/7590  
      🟨 NVRTC              Pass:  50%/4   | Total:  1h 34m | Avg: 23m 34s | Max: 25m 56s
      🟩 Test               Pass: 100%/2   | Total: 44m 05s | Avg: 22m 02s | Max: 26m 31s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 53s | Avg:  1m 53s | Max:  1m 53s
    🟨 std
      🟥 11                 Pass:   0%/6   | Total: 31m 24s | Avg:  5m 14s | Max: 22m 13s
      🟥 14                 Pass:   0%/3   | Total: 30m 01s | Avg: 10m 00s | Max: 21m 59s
      🟨 17                 Pass:  76%/13  | Total:  2h 57m | Avg: 13m 39s | Max: 30m 26s | Hits: 682%/4962  
      🟩 20                 Pass: 100%/23  | Total:  4h 28m | Avg: 11m 39s | Max: 32m 21s | Hits: 612%/2628  
    🟨 gpu
      🟨 v100               Pass:  73%/46  | Total:  8h 29m | Avg: 11m 04s | Max: 32m 21s | Hits: 657%/7590  
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 43s | Avg: 13m 43s | Max: 13m 43s
      🟩 90a                Pass: 100%/2   | Total: 16m 18s | Avg:  8m 09s | Max: 12m 26s
    
  • 🟨 thrust: Pass: 77%/44 | Total: 22h 30m | Avg: 30m 41s | Max: 1h 16m | Hits: 168%/7408

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  76%/42  | Total: 21h 10m | Avg: 30m 15s | Max:  1h 16m | Hits: 168%/7408  
      🟩 arm64              Pass: 100%/2   | Total:  1h 19m | Avg: 39m 48s | Max: 41m 55s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 05m | Avg: 32m 41s | Max: 33m 21s
      🔍 nvcc               Pass:  76%/42  | Total: 21h 25m | Avg: 30m 35s | Max:  1h 16m | Hits: 168%/7408  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  73%/38  | Total: 20h 57m | Avg: 33m 04s | Max:  1h 16m | Hits: 102%/5556  
      🟩 TestCPU            Pass: 100%/3   | Total: 52m 23s | Avg: 17m 27s | Max: 37m 18s | Hits: 365%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 41m 04s | Avg: 13m 41s | Max: 15m 06s
    🟨 ctk
      🟥 11.1               Pass:   0%/6   | Total: 10m 42s | Avg:  1m 47s | Max:  2m 01s
      🟩 12.5               Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 16m
      🟨 12.6               Pass:  88%/36  | Total: 19h 55m | Avg: 33m 12s | Max:  1h 13m | Hits: 168%/7408  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 05m | Avg: 32m 41s | Max: 33m 21s
      🟥 nvcc11.1           Pass:   0%/6   | Total: 10m 42s | Avg:  1m 47s | Max:  2m 01s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 16m
      🟨 nvcc12.6           Pass:  88%/34  | Total: 18h 50m | Avg: 33m 14s | Max:  1h 13m | Hits: 168%/7408  
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total:  8m 42s | Avg:  2m 10s | Max:  2m 25s
      🟩 Clang10            Pass: 100%/1   | Total: 38m 35s | Avg: 38m 35s | Max: 38m 35s
      🟩 Clang11            Pass: 100%/1   | Total: 40m 18s | Avg: 40m 18s | Max: 40m 18s
      🟩 Clang12            Pass: 100%/1   | Total: 40m 09s | Avg: 40m 09s | Max: 40m 09s
      🟩 Clang13            Pass: 100%/1   | Total: 38m 09s | Avg: 38m 09s | Max: 38m 09s
      🟩 Clang14            Pass: 100%/1   | Total: 36m 11s | Avg: 36m 11s | Max: 36m 11s
      🟩 Clang15            Pass: 100%/1   | Total: 36m 51s | Avg: 36m 51s | Max: 36m 51s
      🟩 Clang16            Pass: 100%/1   | Total: 38m 49s | Avg: 38m 49s | Max: 38m 49s
      🟩 Clang17            Pass: 100%/1   | Total: 39m 55s | Avg: 39m 55s | Max: 39m 55s
      🟩 Clang18            Pass: 100%/7   | Total:  3h 22m | Avg: 28m 56s | Max: 39m 58s
      🟥 GCC7               Pass:   0%/4   | Total:  7m 38s | Avg:  1m 54s | Max:  2m 07s
      🟩 GCC8               Pass: 100%/1   | Total: 39m 55s | Avg: 39m 55s | Max: 39m 55s
      🟨 GCC9               Pass:  33%/3   | Total: 45m 51s | Avg: 15m 17s | Max: 42m 35s
      🟩 GCC10              Pass: 100%/1   | Total: 41m 45s | Avg: 41m 45s | Max: 41m 45s
      🟩 GCC11              Pass: 100%/1   | Total: 40m 32s | Avg: 40m 32s | Max: 40m 32s
      🟩 GCC12              Pass: 100%/1   | Total: 44m 28s | Avg: 44m 28s | Max: 44m 28s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 36m | Avg: 27m 01s | Max: 42m 43s
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 13m | Avg:  1h 13m | Max:  1h 13m | Hits: 104%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 55m | Avg: 58m 36s | Max:  1h 11m | Hits: 189%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 16m
    🟨 cxx_family
      🟨 Clang              Pass:  78%/19  | Total:  8h 40m | Avg: 27m 22s | Max: 40m 18s
      🟨 GCC                Pass:  68%/19  | Total:  7h 16m | Avg: 22m 57s | Max: 44m 28s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 09m | Avg:  1h 02m | Max:  1h 13m | Hits: 168%/7408  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 16m
    🟨 std
      🟥 11                 Pass:   0%/5   | Total:  9m 51s | Avg:  1m 58s | Max:  2m 16s
      🟥 14                 Pass:   0%/2   | Total:  4m 32s | Avg:  2m 16s | Max:  2m 25s
      🟨 17                 Pass:  75%/12  | Total:  7h 27m | Avg: 37m 19s | Max:  1h 13m | Hits: 102%/3704  
      🟩 20                 Pass: 100%/23  | Total: 14h 02m | Avg: 36m 37s | Max:  1h 16m | Hits: 233%/3704  
    🟨 gpu
      🟨 v100               Pass:  77%/44  | Total: 22h 30m | Avg: 30m 41s | Max:  1h 16m | Hits: 168%/7408  
    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 45m 49s | Avg: 22m 54s | Max: 33m 56s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 24m 16s | Avg: 24m 16s | Max: 24m 16s
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 30m | Avg: 5m 46s | Max: 19m 04s | Hits: 574%/312

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 15m | Avg:  6m 10s | Max: 19m 04s | Hits: 574%/312   
      🟩 arm64              Pass: 100%/4   | Total: 14m 20s | Avg:  3m 35s | Max:  3m 43s
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 18m 56s | Avg:  6m 18s | Max: 11m 34s | Hits: 574%/156   
      🟩 12.5               Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 21s
      🟩 12.6               Pass: 100%/21  | Total:  1h 58m | Avg:  5m 39s | Max: 19m 04s | Hits: 574%/156   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 18m 56s | Avg:  6m 18s | Max: 11m 34s | Hits: 574%/156   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 21s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  1h 58m | Avg:  5m 39s | Max: 19m 04s | Hits: 574%/156   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 30m | Avg:  5m 46s | Max: 19m 04s | Hits: 574%/312   
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 38s | Avg:  3m 38s | Max:  3m 38s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 22s | Avg:  4m 22s | Max:  4m 22s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 35s | Avg:  3m 35s | Max:  3m 35s
      🟩 Clang12            Pass: 100%/1   | Total:  3m 45s | Avg:  3m 45s | Max:  3m 45s
      🟩 Clang13            Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 00s | Avg:  4m 00s | Max:  4m 00s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 12s | Avg:  4m 12s | Max:  4m 12s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 10s | Avg:  4m 10s | Max:  4m 10s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 47s | Avg:  3m 47s | Max:  3m 47s
      🟩 Clang18            Pass: 100%/4   | Total: 30m 09s | Avg:  7m 32s | Max: 19m 04s
      🟩 GCC9               Pass: 100%/1   | Total:  3m 44s | Avg:  3m 44s | Max:  3m 44s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 51s | Avg:  3m 51s | Max:  3m 51s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 37s | Avg:  3m 37s | Max:  3m 37s
      🟩 GCC12              Pass: 100%/2   | Total: 22m 41s | Avg: 11m 20s | Max: 18m 37s
      🟩 GCC13              Pass: 100%/4   | Total: 13m 56s | Avg:  3m 29s | Max:  3m 43s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 11m 34s | Avg: 11m 34s | Max: 11m 34s | Hits: 574%/156   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 50s | Avg: 12m 50s | Max: 12m 50s | Hits: 574%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 21s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/13  | Total:  1h 05m | Avg:  5m 02s | Max: 19m 04s
      🟩 GCC                Pass: 100%/9   | Total: 47m 49s | Avg:  5m 18s | Max: 18m 37s
      🟩 MSVC               Pass: 100%/2   | Total: 24m 24s | Avg: 12m 12s | Max: 12m 50s | Hits: 574%/312   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 21s
    🟩 gpu
      🟩 v100               Pass: 100%/26  | Total:  2h 30m | Avg:  5m 46s | Max: 19m 04s | Hits: 574%/312   
    🟩 jobs
      🟩 Build              Pass: 100%/24  | Total:  1h 52m | Avg:  4m 41s | Max: 12m 50s | Hits: 574%/312   
      🟩 Test               Pass: 100%/2   | Total: 37m 41s | Avg: 18m 50s | Max: 19m 04s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 13s | Avg:  3m 13s | Max:  3m 13s
      🟩 90a                Pass: 100%/1   | Total:  3m 20s | Avg:  3m 20s | Max:  3m 20s
    🟩 std
      🟩 17                 Pass: 100%/6   | Total: 23m 52s | Avg:  3m 58s | Max:  6m 12s
      🟩 20                 Pass: 100%/20  | Total:  2h 06m | Avg:  6m 19s | Max: 19m 04s | Hits: 574%/312   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 8m 50s | Avg: 4m 25s | Max: 6m 33s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 33s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 33s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 33s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 33s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 33s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 33s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 33s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 17s | Avg:  2m 17s | Max:  2m 17s
      🟩 Test               Pass: 100%/1   | Total:  6m 33s | Avg:  6m 33s | Max:  6m 33s
    
  • 🟩 python: Pass: 100%/1 | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 164)

# Runner
122 linux-amd64-cpu16
19 linux-amd64-gpu-v100-latest-1
12 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@fbusato fbusato added the 3.0 Targeted for 3.0 release label Jan 16, 2025
@fbusato fbusato changed the title [DO NOT MERGE] ceil_div return common type and optmize ceil_div return common type and optmize Jan 21, 2025
Copy link
Contributor

🟩 CI finished in 5h 36m: Pass: 100%/135 | Total: 2d 18h | Avg: 29m 35s | Max: 1h 44m | Hits: 360%/23291
  • 🟩 cub: Pass: 100%/38 | Total: 1d 08h | Avg: 51m 39s | Max: 1h 44m | Hits: 48%/3528

    🟩 cpu
      🟩 amd64              Pass: 100%/36  | Total:  1d 06h | Avg: 51m 32s | Max:  1h 44m | Hits:  48%/3528  
      🟩 arm64              Pass: 100%/2   | Total:  1h 47m | Avg: 53m 47s | Max: 53m 49s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  4h 42m | Avg: 56m 34s | Max:  1h 06m | Hits:  72%/882   
      🟩 12.5               Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 08m
      🟩 12.6               Pass: 100%/31  | Total:  1d 01h | Avg: 49m 50s | Max:  1h 44m | Hits:  40%/2646  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 46m | Avg: 53m 17s | Max: 53m 58s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 42m | Avg: 56m 34s | Max:  1h 06m | Hits:  72%/882   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 08m
      🟩 nvcc12.6           Pass: 100%/29  | Total: 23h 58m | Avg: 49m 36s | Max:  1h 44m | Hits:  40%/2646  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 46m | Avg: 53m 17s | Max: 53m 58s
      🟩 nvcc               Pass: 100%/36  | Total:  1d 06h | Avg: 51m 34s | Max:  1h 44m | Hits:  48%/3528  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 34m | Avg: 53m 38s | Max: 54m 28s
      🟩 Clang15            Pass: 100%/1   | Total: 50m 59s | Avg: 50m 59s | Max: 50m 59s
      🟩 Clang16            Pass: 100%/1   | Total: 54m 35s | Avg: 54m 35s | Max: 54m 35s
      🟩 Clang17            Pass: 100%/1   | Total: 52m 33s | Avg: 52m 33s | Max: 52m 33s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 11m | Avg: 44m 25s | Max: 53m 58s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 40m | Avg: 50m 17s | Max: 50m 54s
      🟩 GCC8               Pass: 100%/1   | Total: 51m 12s | Avg: 51m 12s | Max: 51m 12s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 50m | Avg: 55m 05s | Max: 57m 50s
      🟩 GCC10              Pass: 100%/1   | Total: 52m 40s | Avg: 52m 40s | Max: 52m 40s
      🟩 GCC11              Pass: 100%/1   | Total: 55m 24s | Avg: 55m 24s | Max: 55m 24s
      🟩 GCC12              Pass: 100%/3   | Total:  1h 40m | Avg: 33m 21s | Max: 54m 59s
      🟩 GCC13              Pass: 100%/8   | Total:  6h 39m | Avg: 49m 54s | Max:  1h 44m
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 10m | Hits:  56%/1764  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits:  40%/1764  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 08m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total: 11h 23m | Avg: 48m 50s | Max: 54m 35s
      🟩 GCC                Pass: 100%/18  | Total: 14h 29m | Avg: 48m 17s | Max:  1h 44m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 35m | Avg:  1h 08m | Max:  1h 11m | Hits:  48%/3528  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 08m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 45m 06s | Avg: 22m 33s | Max: 25m 42s
      🟩 v100               Pass: 100%/36  | Total:  1d 07h | Avg: 53m 16s | Max:  1h 44m | Hits:  48%/3528  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  1d 04h | Avg: 54m 12s | Max:  1h 11m | Hits:  48%/3528  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 56m 39s | Avg: 56m 39s | Max: 56m 39s
      🟩 GraphCapture       Pass: 100%/1   | Total:  1h 44m | Avg:  1h 44m | Max:  1h 44m
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 12m | Avg: 24m 00s | Max: 29m 58s
      🟩 TestGPU            Pass: 100%/2   | Total: 49m 52s | Avg: 24m 56s | Max: 26m 50s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 45m 06s | Avg: 22m 33s | Max: 25m 42s
      🟩 90a                Pass: 100%/1   | Total: 21m 34s | Avg: 21m 34s | Max: 21m 34s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total: 13h 27m | Avg: 57m 39s | Max:  1h 11m | Hits:  51%/2646  
      🟩 20                 Pass: 100%/24  | Total: 19h 16m | Avg: 48m 10s | Max:  1h 44m | Hits:  39%/882   
    
  • 🟩 libcudacxx: Pass: 100%/37 | Total: 9h 15m | Avg: 15m 01s | Max: 30m 52s | Hits: 653%/10061

    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total:  8h 34m | Avg: 14m 41s | Max: 30m 52s | Hits: 653%/10061 
      🟩 arm64              Pass: 100%/2   | Total: 41m 29s | Avg: 20m 44s | Max: 21m 18s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 09m | Avg: 13m 48s | Max: 25m 12s | Hits: 687%/2470  
      🟩 12.5               Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max:  8m 40s
      🟩 12.6               Pass: 100%/30  | Total:  7h 49m | Avg: 15m 38s | Max: 30m 52s | Hits: 642%/7591  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 09m | Avg: 17m 27s | Max: 22m 43s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 09m | Avg: 13m 48s | Max: 25m 12s | Hits: 687%/2470  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max:  8m 40s
      🟩 nvcc12.6           Pass: 100%/26  | Total:  6h 39m | Avg: 15m 22s | Max: 30m 52s | Hits: 642%/7591  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 09m | Avg: 17m 27s | Max: 22m 43s
      🟩 nvcc               Pass: 100%/33  | Total:  8h 05m | Avg: 14m 43s | Max: 30m 52s | Hits: 653%/10061 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 30m 32s | Avg:  7m 38s | Max: 17m 45s
      🟩 Clang15            Pass: 100%/1   | Total: 19m 19s | Avg: 19m 19s | Max: 19m 19s
      🟩 Clang16            Pass: 100%/1   | Total: 16m 57s | Avg: 16m 57s | Max: 16m 57s
      🟩 Clang17            Pass: 100%/1   | Total: 21m 27s | Avg: 21m 27s | Max: 21m 27s
      🟩 Clang18            Pass: 100%/8   | Total:  2h 12m | Avg: 16m 35s | Max: 22m 43s
      🟩 GCC7               Pass: 100%/2   | Total: 22m 15s | Avg: 11m 07s | Max: 18m 28s
      🟩 GCC8               Pass: 100%/1   | Total: 20m 46s | Avg: 20m 46s | Max: 20m 46s
      🟩 GCC9               Pass: 100%/2   | Total:  7m 47s | Avg:  3m 53s | Max:  4m 01s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 58s | Avg:  3m 58s | Max:  3m 58s
      🟩 GCC11              Pass: 100%/1   | Total:  4m 00s | Avg:  4m 00s | Max:  4m 00s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 38s | Avg:  4m 38s | Max:  4m 38s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 20m | Avg: 17m 35s | Max: 25m 07s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 53m 49s | Avg: 26m 54s | Max: 28m 37s | Hits: 654%/4950  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 59m 25s | Avg: 29m 42s | Max: 30m 52s | Hits: 652%/5111  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max:  8m 40s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/15  | Total:  3h 41m | Avg: 14m 44s | Max: 22m 43s
      🟩 GCC                Pass: 100%/16  | Total:  3h 24m | Avg: 12m 45s | Max: 25m 07s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 53m | Avg: 28m 18s | Max: 30m 52s | Hits: 653%/10061 
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max:  8m 40s
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total:  9h 15m | Avg: 15m 01s | Max: 30m 52s | Hits: 653%/10061 
    🟩 jobs
      🟩 Build              Pass: 100%/32  | Total:  7h 49m | Avg: 14m 41s | Max: 30m 52s | Hits: 653%/10061 
      🟩 NVRTC              Pass: 100%/2   | Total: 47m 38s | Avg: 23m 49s | Max: 23m 50s
      🟩 Test               Pass: 100%/2   | Total: 36m 04s | Avg: 18m 02s | Max: 19m 14s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 57s | Avg:  1m 57s | Max:  1m 57s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 39s | Avg: 13m 39s | Max: 13m 39s
      🟩 90a                Pass: 100%/2   | Total: 16m 39s | Avg:  8m 19s | Max: 12m 59s
    🟩 std
      🟩 17                 Pass: 100%/15  | Total:  3h 57m | Avg: 15m 50s | Max: 28m 37s | Hits: 665%/7430  
      🟩 20                 Pass: 100%/21  | Total:  5h 16m | Avg: 15m 02s | Max: 30m 52s | Hits: 619%/2631  
    
  • 🟩 thrust: Pass: 100%/37 | Total: 21h 41m | Avg: 35m 10s | Max: 1h 20m | Hits: 156%/9180

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 43m 13s | Avg: 21m 36s | Max: 28m 52s
    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total: 20h 43m | Avg: 35m 32s | Max:  1h 20m | Hits: 156%/9180  
      🟩 arm64              Pass: 100%/2   | Total: 57m 46s | Avg: 28m 53s | Max: 30m 14s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 07m | Avg: 37m 26s | Max:  1h 05m | Hits:  97%/1836  
      🟩 12.5               Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 10m
      🟩 12.6               Pass: 100%/30  | Total: 16h 18m | Avg: 32m 37s | Max:  1h 20m | Hits: 171%/7344  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 59m 39s | Avg: 29m 49s | Max: 30m 37s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 07m | Avg: 37m 26s | Max:  1h 05m | Hits:  97%/1836  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 10m
      🟩 nvcc12.6           Pass: 100%/28  | Total: 15h 19m | Avg: 32m 49s | Max:  1h 20m | Hits: 171%/7344  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 59m 39s | Avg: 29m 49s | Max: 30m 37s
      🟩 nvcc               Pass: 100%/35  | Total: 20h 41m | Avg: 35m 28s | Max:  1h 20m | Hits: 156%/9180  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 01m | Avg: 30m 17s | Max: 31m 04s
      🟩 Clang15            Pass: 100%/1   | Total: 34m 09s | Avg: 34m 09s | Max: 34m 09s
      🟩 Clang16            Pass: 100%/1   | Total: 33m 31s | Avg: 33m 31s | Max: 33m 31s
      🟩 Clang17            Pass: 100%/1   | Total: 34m 07s | Avg: 34m 07s | Max: 34m 07s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 57m | Avg: 25m 18s | Max: 32m 14s
      🟩 GCC7               Pass: 100%/2   | Total: 57m 32s | Avg: 28m 46s | Max: 29m 07s
      🟩 GCC8               Pass: 100%/1   | Total: 31m 12s | Avg: 31m 12s | Max: 31m 12s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 05m | Avg: 32m 38s | Max: 33m 12s
      🟩 GCC10              Pass: 100%/1   | Total: 31m 51s | Avg: 31m 51s | Max: 31m 51s
      🟩 GCC11              Pass: 100%/1   | Total: 38m 59s | Avg: 38m 59s | Max: 38m 59s
      🟩 GCC12              Pass: 100%/1   | Total: 37m 05s | Avg: 37m 05s | Max: 37m 05s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 02m | Avg: 22m 51s | Max: 33m 29s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 07m | Hits: 111%/3672  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  3h 08m | Avg:  1h 02m | Max:  1h 20m | Hits: 187%/5508  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 10m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  6h 40m | Avg: 28m 34s | Max: 34m 09s
      🟩 GCC                Pass: 100%/16  | Total:  7h 24m | Avg: 27m 48s | Max: 38m 59s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 21m | Avg:  1h 04m | Max:  1h 20m | Hits: 156%/9180  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 10m
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total: 21h 41m | Avg: 35m 10s | Max:  1h 20m | Hits: 156%/9180  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total: 19h 55m | Avg: 38m 33s | Max:  1h 20m | Hits: 104%/7344  
      🟩 TestCPU            Pass: 100%/3   | Total: 50m 46s | Avg: 16m 55s | Max: 35m 03s | Hits: 365%/1836  
      🟩 TestGPU            Pass: 100%/3   | Total: 55m 37s | Avg: 18m 32s | Max: 22m 33s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 17m 09s | Avg: 17m 09s | Max: 17m 09s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  9h 38m | Avg: 41m 17s | Max:  1h 13m | Hits: 107%/5508  
      🟩 20                 Pass: 100%/21  | Total: 11h 20m | Avg: 32m 23s | Max:  1h 20m | Hits: 231%/3672  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 1h 55m | Avg: 5m 46s | Max: 18m 54s | Hits: 383%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 45m | Avg:  6m 34s | Max: 18m 54s | Hits: 383%/522   
      🟩 arm64              Pass: 100%/4   | Total: 10m 30s | Avg:  2m 37s | Max:  2m 42s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 11m 37s | Avg: 11m 37s | Max: 11m 37s | Hits: 383%/261   
      🟩 12.5               Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 34s
      🟩 12.6               Pass: 100%/17  | Total:  1h 32m | Avg:  5m 28s | Max: 18m 54s | Hits: 383%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 11m 37s | Avg: 11m 37s | Max: 11m 37s | Hits: 383%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 34s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 32m | Avg:  5m 28s | Max: 18m 54s | Hits: 383%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  1h 55m | Avg:  5m 46s | Max: 18m 54s | Hits: 383%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 10s | Avg:  3m 10s | Max:  3m 10s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 24s | Avg:  3m 24s | Max:  3m 24s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 22s | Avg:  3m 22s | Max:  3m 22s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 29s | Avg:  3m 29s | Max:  3m 29s
      🟩 Clang18            Pass: 100%/4   | Total: 26m 47s | Avg:  6m 41s | Max: 18m 14s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 18s | Avg:  3m 18s | Max:  3m 18s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 19s | Avg:  3m 19s | Max:  3m 19s
      🟩 GCC12              Pass: 100%/2   | Total: 22m 16s | Avg: 11m 08s | Max: 18m 54s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 51s | Avg:  2m 42s | Max:  3m 00s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 11m 37s | Avg: 11m 37s | Max: 11m 37s | Hits: 383%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 13m 01s | Avg: 13m 01s | Max: 13m 01s | Hits: 383%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 34s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 40m 12s | Avg:  5m 01s | Max: 18m 14s
      🟩 GCC                Pass: 100%/8   | Total: 39m 44s | Avg:  4m 58s | Max: 18m 54s
      🟩 MSVC               Pass: 100%/2   | Total: 24m 38s | Avg: 12m 19s | Max: 13m 01s | Hits: 383%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 34s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  1h 55m | Avg:  5m 46s | Max: 18m 54s | Hits: 383%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 18m | Avg:  4m 21s | Max: 13m 01s | Hits: 383%/522   
      🟩 Test               Pass: 100%/2   | Total: 37m 08s | Avg: 18m 34s | Max: 18m 54s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 40s | Avg:  2m 40s | Max:  2m 40s
      🟩 90a                Pass: 100%/1   | Total:  3m 00s | Avg:  3m 00s | Max:  3m 00s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 13m 28s | Avg:  3m 22s | Max:  5m 34s
      🟩 20                 Pass: 100%/16  | Total:  1h 42m | Avg:  6m 23s | Max: 18m 54s | Hits: 383%/522   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 04s | Avg: 5m 02s | Max: 8m 08s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 08s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 08s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 08s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 08s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 08s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 08s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 08s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  1m 56s | Avg:  1m 56s | Max:  1m 56s
      🟩 Test               Pass: 100%/1   | Total:  8m 08s | Avg:  8m 08s | Max:  8m 08s
    
  • 🟩 python: Pass: 100%/1 | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 135)

# Runner
92 linux-amd64-cpu16
17 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@fbusato fbusato merged commit 0b5a5d8 into NVIDIA:main Jan 23, 2025
147 of 150 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3.0 Targeted for 3.0 release
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

[FEA]: ceil_div should return the resulting type of its operation
2 participants