v3.0.4
This is an OpenCL 3.0 release update.
Noteworthy Changes
- added enum and API tracing for many new extensions, including all known KHR and EXT extensions
- added many performance improvements to reduce overhead, especially with event profiling and chrome tracing
- added many improvements to tracing and logging
- added kernel capture and replay support for easier debugging
- added the ability to dump modified program source or options for inspection
- added support for chrome tracing flow events
- added support for chrome tracing using the Perfetto UI
- added an override for the CL_DEVICE_IL_VERSION query
- added support for querying kernel register counts
- added the ability to dump buffer and image hashes
- added the ability to demangle the names of C++ kernels
- added the ability to track transfer sizes
- added the ability to flush files after every write
- added the ability to generate report data at regular intervals
- added many new cliloader options, including an option to dump available MDAPI metrics
- added a check to verify proper permissions for MDAPI counters
- added the ability to set the MDAPI time-based sampling period and buffer size
Supported Operating Systems
- Windows
- Linux (tested Ubuntu 22.04)
- OSX
- Android (experimental)
- FreeBSD (experimental)
Supported OpenCL Versions
- Up to OpenCL 3.0
New Contributors
- @trbauer made their first contribution in #270
- @isanghao made their first contribution in #293
- @Novermars made their first contribution in #294
- @ivvenevt made their first contribution in #330
- @karolherbst made their first contribution in #348
Full Changelog: v3.0.3...v3.0.4