Releases: rapidsai/rmm
Releases · rapidsai/rmm
v24.04.00
🚨 Breaking Changes
- Accept stream argument in DeviceMemoryResource allocate/deallocate (#1494) @wence-
- Replace all internal usage of
get_upstreamwithget_upstream_resource(#1491) @miscco - Deprecate rmm::mr::device_memory_resource::supports_streams() (#1452) @harrism
- Remove deprecated rmm::detail::available_device_memory (#1438) @harrism
- Make device_memory_resource::supports_streams() not pure virtual. Remove derived implementations and calls in RMM (#1437) @harrism
- Deprecate rmm::mr::device_memory_resource::get_mem_info() and supports_get_mem_info(). (#1436) @harrism
🐛 Bug Fixes
- Fix search path for torch allocator in editable installs and ensure CUDA support is available (#1498) @vyasr
- Accept stream argument in DeviceMemoryResource allocate/deallocate (#1494) @wence-
- Run STATISTICS_TEST and TRACKING_TEST in serial to avoid OOM errors. (#1487) @bdice
📖 Documentation
🚀 New Features
- Replace all internal usage of
get_upstreamwithget_upstream_resource(#1491) @miscco - Add complete set of resource ref aliases (#1479) @nvdbaranec
- Automate include grouping using clang-format (#1463) @harrism
- Add
get_upstream_resourceto resource adaptors (#1456) @miscco - Deprecate rmm::mr::device_memory_resource::supports_streams() (#1452) @harrism
- Remove duplicated memory_resource_tests (#1451) @miscco
- Change
rmm::exec_policyto takeasync_resource_ref(#1449) @miscco - Change
device_scalarto takeasync_resource_ref(#1447) @miscco - Add device_async_resource_ref convenience alias (#1441) @harrism
- Remove deprecated rmm::detail::available_device_memory (#1438) @harrism
- Make device_memory_resource::supports_streams() not pure virtual. Remove derived implementations and calls in RMM (#1437) @harrism
- Deprecate rmm::mr::device_memory_resource::get_mem_info() and supports_get_mem_info(). (#1436) @harrism
- Support CUDA 12.2 (#1419) @jameslamb
🛠️ Improvements
- Use
conda env create --yesinstead of--force(#1509) @bdice - Add upper bound to prevent usage of NumPy 2 (#1501) @bdice
- Remove hard-coding of RAPIDS version where possible (#1496) @KyleFromNVIDIA
- Requre NumPy 1.23+ (#1488) @jakirkham
- Use
rmm::device_async_resource_refin multi_stream_allocation benchmark (#1482) @miscco - Update devcontainers to CUDA Toolkit 12.2 (#1470) @trxcllnt
- Add support for Python 3.11 (#1469) @jameslamb
- target branch-24.04 for GitHub Actions workflows (#1468) @jameslamb
- [FEA]: Use
std::optionalinstead ofthrust::optional(#1464) @miscco - Add environment-agnostic scripts for running ctests and pytests (#1462) @trxcllnt
- Ensure that
ctestis called with--no-tests=error. (#1460) @bdice - Update ops-bot.yaml (#1458) @AyodeAwe
- Adopt the
rmm::device_async_resource_refalias (#1454) @miscco - Refactor error.hpp out of detail (#1439) @lamarrr
v24.02.00
🚨 Breaking Changes
- Make device_memory_resource::do_get_mem_info() and supports_get_mem_info() not pure virtual. Remove derived implementations and calls in RMM (#1430) @harrism
- Deprecate detail::available_device_memory, most detail/aligned.hpp utilities, and optional pool_memory_resource initial size (#1424) @harrism
- Require explicit pool size in
pool_memory_resourceand move some things out of detail namespace (#1417) @harrism - Remove HTML builds of librmm (#1415) @vyasr
- Update to CCCL 2.2.0. (#1404) @bdice
- Switch to scikit-build-core (#1287) @vyasr
🐛 Bug Fixes
- Exclude tests from builds (#1459) @vyasr
- Update CODEOWNERS (#1410) @raydouglass
- Correct signatures for torch allocator plug in (#1407) @wence-
- Fix Arena MR to support simultaneous access by PTDS and other streams (#1395) @tgravescs
- Fix else-after-throw clang tidy error (#1391) @harrism
📖 Documentation
- remove references to setup.py in docs (#1420) @jameslamb
- Remove HTML builds of librmm (#1415) @vyasr
- Update GPU support docs to drop Pascal (#1413) @harrism
🚀 New Features
- Make device_memory_resource::do_get_mem_info() and supports_get_mem_info() not pure virtual. Remove derived implementations and calls in RMM (#1430) @harrism
- Deprecate detail::available_device_memory, most detail/aligned.hpp utilities, and optional pool_memory_resource initial size (#1424) @harrism
- Add a host-pinned memory resource that can be used as upstream for
pool_memory_resource. (#1392) @harrism
🛠️ Improvements
- Remove usages of rapids-env-update (#1423) @KyleFromNVIDIA
- Refactor CUDA versions in dependencies.yaml. (#1422) @bdice
- Require explicit pool size in
pool_memory_resourceand move some things out of detail namespace (#1417) @harrism - Update dependencies.yaml to support CUDA 12.*. (#1414) @bdice
- Define python dependency range as a matrix fallback. (#1409) @bdice
- Use latest cuda-python within CUDA major version. (#1406) @bdice
- Update to CCCL 2.2.0. (#1404) @bdice
- Remove RMM_BUILD_WHEELS and standardize Python builds (#1401) @vyasr
- Update to fmt 10.1.1 and spdlog 1.12.0. (#1374) @bdice
- Switch to scikit-build-core (#1287) @vyasr
v23.12.00
🚨 Breaking Changes
- Document minimum CUDA version of 11.4 (#1385) @harrism
- Store and set the correct CUDA device in device_buffer (#1370) @harrism
- Use
cuda::mr::memory_resourceinstead of rawdevice_memory_resource(#1095) @miscco
🐛 Bug Fixes
- Update actions/labeler to v4 (#1397) @raydouglass
- Backport arena MR fix for simultaneous access by PTDS and other streams (#1396) @bdice
- Deliberately leak PTDS thread_local events in stream ordered mr (#1375) @wence-
- Add missing CUDA 12 dependencies and fix dlopen library names (#1366) @vyasr
📖 Documentation
- Document minimum CUDA version of 11.4 (#1385) @harrism
- Fix more doxygen issues (#1367) @vyasr
- Add groups to the doxygen docs (#1358) @vyasr
- Enable doxygen XML and fix issues (#1348) @vyasr
🚀 New Features
- Make internally stored default argument values public (#1373) @vyasr
- Store and set the correct CUDA device in device_buffer (#1370) @harrism
- Update rapids-cmake functions to non-deprecated signatures (#1357) @robertmaynard
- Generate unified Python/C++ docs (#1324) @vyasr
- Use
cuda::mr::memory_resourceinstead of rawdevice_memory_resource(#1095) @miscco
🛠️ Improvements
- Silence false gcc warning (#1381) @miscco
- Build concurrency for nightly and merge triggers (#1380) @bdice
- Update
shared-action-workflowsreferences (#1363) @AyodeAwe - Use branch-23.12 workflows. (#1360) @bdice
- Update devcontainers to 23.12 (#1355) @raydouglass
- Generate proper, consistent nightly versions for pip and conda packages (#1347) @vyasr
- RMM: Build CUDA 12.0 ARM conda packages. (#1330) @bdice
v23.10.00
🚨 Breaking Changes
🐛 Bug Fixes
- Compile cdef public functions from torch_allocator with C ABI (#1350) @wence-
- Make doxygen only a conda dependency. (#1344) @bdice
- Use
conda mambabuildnotmamba mambabuild(#1338) @wence- - Fix stream_ordered_memory_resource attempt to record event in stream from another device (#1333) @harrism
📖 Documentation
- Clean up headers in CMakeLists.txt. (#1341) @bdice
- Add pre-commit hook to validate doxygen (#1334) @vyasr
- Fix doxygen warnings (#1317) @vyasr
- Treat warnings as errors in Python documentation (#1316) @vyasr
🚀 New Features
🛠️ Improvements
- Update image names (#1346) @AyodeAwe
- Update to clang 16.0.6. (#1343) @bdice
- Update doxygen to 1.9.1 (#1337) @vyasr
- Simplify wheel build scripts and allow alphas of RAPIDS dependencies (#1335) @divyegala
- Use
copy-pr-bot(#1329) @ajschmidt8 - Add RMM devcontainers (#1328) @trxcllnt
- Add Python bindings for
limiting_resource_adaptor(#1327) @pentschev - Fix missing jQuery error in docs (#1321) @AyodeAwe
- Use fetch_rapids.cmake. (#1319) @bdice
- Update to Cython 3.0.0 (#1313) @vyasr
- Branch 23.10 merge 23.08 (#1312) @vyasr
- Branch 23.10 merge 23.08 (#1309) @vyasr
v23.08.00
🚨 Breaking Changes
- Stop invoking setup.py (#1300) @vyasr
- Remove now-deprecated top-level allocator functions (#1281) @wence-
- Remove padding from device_memory_resource (#1278) @vyasr
🐛 Bug Fixes
- Fix typo in wheels-test.yaml. (#1310) @bdice
- Add a missing '#include <array>' in logger.hpp (#1295) @valgur
- Use gbench
thread_index()accessor to fix replay bench compilation (#1293) @harrism - Ensure logger tests don't generate temp directories in build dir (#1289) @robertmaynard
🚀 New Features
🛠️ Improvements
- Switch to new CI wheel building pipeline (#1305) @vyasr
- Revert CUDA 12.0 CI workflows to branch-23.08. (#1303) @bdice
- Update linters: remove flake8, add ruff, update cython-lint (#1302) @vyasr
- Adding identify minimum version requirement (#1301) @hyperbolic2346
- Stop invoking setup.py (#1300) @vyasr
- Use cuda-version to constrain cudatoolkit. (#1296) @bdice
- Update to CMake 3.26.4 (#1291) @vyasr
- use rapids-upload-docs script (#1288) @AyodeAwe
- Reorder parameters in RMM_EXPECTS (#1286) @vyasr
- Remove documentation build scripts for Jenkins (#1285) @ajschmidt8
- Remove padding from device_memory_resource (#1278) @vyasr
- Unpin scikit-build upper bound (#1275) @vyasr
- RMM: Build CUDA 12 packages (#1223) @bdice
v23.06.00
🚨 Breaking Changes
🐛 Bug Fixes
- Ensure Logger tests aren't run in parallel (#1277) @robertmaynard
- Pin to scikit-build<0.17.2. (#1262) @bdice
🛠️ Improvements
- Require Numba 0.57.0+ & NumPy 1.21.0+ (#1279) @jakirkham
- Align test_cpp.sh with conventions in other RAPIDS repos. (#1269) @bdice
- Switch back to using primary shared-action-workflows branch (#1268) @vyasr
- Update recipes to GTest version >=1.13.0 (#1263) @bdice
- Support CUDA 12.0 for pip wheels (#1259) @bdice
- Add build vars (#1258) @AyodeAwe
- Enable sccache hits from local builds (#1257) @AyodeAwe
- Revert to branch-23.06 for shared-action-workflows (#1256) @shwina
- run docs builds nightly too (#1255) @AyodeAwe
- Build wheels using new single image workflow (#1254) @vyasr
- Update minimum Python version to Python 3.9 (#1252) @shwina
- Remove usage of rapids-get-rapids-version-from-git (#1251) @jjacobelli
- Remove wheel pytest verbosity (#1249) @sevagh
- Update clang-format to 16.0.1. (#1246) @bdice
- Remove uses-setup-env-vars (#1242) @vyasr
- Move RMM_LOGGING_ASSERT into separate header (#1241) @ahendriksen
- Use ARC V2 self-hosted runners for GPU jobs (#1239) @jjacobelli
v23.04.00
🐛 Bug Fixes
- Remove MANIFEST.in use auto-generated one for sdists and package_data for wheels (#1233) @vyasr
- Fix update-version.sh. (#1227) @vyasr
- Specify include_package_data to setup (#1218) @vyasr
- Revert changes overriding rapids-cmake repo. (#1209) @bdice
- Synchronize stream in
DeviceBuffer.c_from_unique_ptrconstructor (#1100) @shwina
🚀 New Features
- Use rapids-cmake parallel testing feature (#1183) @robertmaynard
🛠️ Improvements
- Stop setting package version attribute in wheels (#1236) @vyasr
- Add codespell as a linter (#1231) @bdice
- Pass
AWS_SESSION_TOKENandSCCACHE_S3_USE_SSLvars to conda build (#1230) @ajschmidt8 - Update to GCC 11 (#1228) @bdice
- Fix some minor oversights in the conversion to pyproject.toml (#1226) @vyasr
- Remove pickle compatibility layer in tests for Python < 3.8. (#1224) @bdice
- Move external allocators into rmm.allocators module to defer imports (#1221) @wence-
- Generate pyproject.toml dependencies using dfg (#1219) @vyasr
- Run rapids-dependency-file-generator via pre-commit (#1217) @vyasr
- Skip docs job in nightly runs (#1215) @AyodeAwe
- CI: Remove specification of manual stage for check_style.sh script. (#1214) @csadorf
- Use script rather than environment variable to modify package names (#1212) @vyasr
- Reduce error handling verbosity in CI tests scripts (#1204) @AjayThorve
- Update shared workflow branches (#1203) @ajschmidt8
- Use date in build string instead of in the version. (#1195) @bdice
- Stop using versioneer to manage versions (#1190) @vyasr
- Update to spdlog>=1.11.0, fmt>=9.1.0. (#1177) @bdice
- Migrate as much as possible to
pyproject.toml(#1151) @jakirkham
v23.02.00
🐛 Bug Fixes
- pre-commit: Update isort version to 5.12.0 (#1197) @wence-
- Revert "Upgrade to spdlog 1.10 (#1173)" (#1176) @bdice
- Ensure
UpstreamResourceAdaptoris not cleared by the Python GC (#1170) @shwina
📖 Documentation
🚀 New Features
🛠️ Improvements
- Update shared workflow branches (#1201) @ajschmidt8
- Fix update-version.sh (#1199) @raydouglass
- Use CTK 118/cp310 branch of wheel workflows (#1193) @sevagh
- Update
build.yamlworkflow to reduce verbosity (#1192) @AyodeAwe - Fix
build.yamlworkflow (#1191) @ajschmidt8 - add docs_build step (#1189) @AyodeAwe
- Upkeep/wheel param cleanup (#1187) @sevagh
- Update workflows for nightly tests (#1186) @ajschmidt8
- Build CUDA
11.8and Python3.10Packages (#1184) @ajschmidt8 - Build wheels alongside conda CI (#1182) @sevagh
- Update conda recipes. (#1180) @bdice
- Update PR Workflow (#1174) @ajschmidt8
- Upgrade to spdlog 1.10 (#1173) @kkraus14
- Enable
codecov(#1171) @ajschmidt8 - Add support for Python 3.10. (#1166) @bdice
- Update pre-commit hooks (#1154) @bdice
v22.10.00
🐛 Bug Fixes
- Ensure consistent spdlog dependency target no matter the source (#1101) @robertmaynard
- Remove cuda event deadlocking issues in device mr tests (#1097) @robertmaynard
- Propagate exceptions raised in Python callback functions (#1096) @madsbk
- Avoid unused parameter warnings in do_get_mem_info (#1084) @fkallen
- Use rapids-cmake 22.10 best practice for RAPIDS.cmake location (#1083) @robertmaynard
📖 Documentation
- Document that minimum required CMake version is now 3.23.1 (#1098) @robertmaynard
- Fix docs for module-level API (#1091) @bdice
- Improve DeviceBuffer docs. (#1090) @bdice
- Branch 22.10 merge 22.08 (#1089) @harrism
- Improve docs formatting and update links. (#1086) @bdice
- Add resources section to README. (#1085) @bdice
- Simplify PR template. (#1080) @bdice
🚀 New Features
- Add
gdbpretty-printers for rmm types (#1088) @upsj - Support using THRUST_WRAPPED_NAMESPACE (#1077) @robertmaynard
🛠️ Improvements
- GH Actions - Enforce
checksbefore builds run (#1125) @ajschmidt8 - Update GH Action Workflows (#1123) @ajschmidt8
- Add
cudatoolkitversions todependencies.yaml(#1119) @ajschmidt8 - Remove
rmminstallation fromlibrmmtests` (#1117) @ajschmidt8 - Add GitHub Actions workflows (#1104) @Ethyling
build.sh: accept--help(#1093) @madsbk- Move clang dependency to conda develop packages. (#1092) @bdice
- Add device_uvector::reserve and device_buffer::reserve (#1079) @upsj
- Bifurcate Dependency Lists (#1073) @ajschmidt8
[NIGHTLY] v22.12.00
🔗 Links
🐛 Bug Fixes
- Don't use CMake 3.25.0 as it has a show stopping FindCUDAToolkit bug (#1162) @robertmaynard
- Relax test for async memory pool IPC handle support (#1130) @bdice
📖 Documentation
- Use rapidsai CODE_OF_CONDUCT.md (#1159) @bdice
- Fix doxygen formatting for set_stream. (#1153) @bdice
- Document required Python dependencies to build from source (#1146) @ccoulombe
- fix failed automerge (Branch 22.12 merge 22.10) (#1131) @harrism
🚀 New Features
🛠️ Improvements
- Align version with wheel version (#1161) @sevagh
- Add
ninja& Update CI environment variables (#1155) @ajschmidt8 - Remove CUDA 11.0 from dependencies.yaml. (#1152) @bdice
- Update dependencies schema. (#1147) @bdice
- Enable sccache for python build (#1145) @Ethyling
- Remove Jenkins scripts (#1143) @ajschmidt8
- Use
ninjain GitHub Actions (#1142) @ajschmidt8 - Switch to using rapids-cmake for gbench. (#1139) @vyasr
- Remove stale labeler (#1137) @raydouglass
- Add a public
copyAPI toDeviceBuffer(#1128) @galipremsagar - Format gdb script. (#1127) @bdice