Skip to content

Report HIP occupancy-driven grid sizes in Stream-K CkProfiler output #2340

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 6 commits into
base: develop
Choose a base branch
from

Conversation

ozturkosu
Copy link
Contributor

Proposed changes

This update introduces grid dimension tracking for Stream-K profiling in the CK Profiler. The grid dimensions, determined by hipOccupancyMaxActiveBlocksPerMultiprocessor are now explicitly tracked and stored during kernel execution. This enhancement improves debugging, profiling, and runtime analysis for Stream-K operations.

Checklist

Please put an x into the boxes that apply. You can also fill these out after creating the PR. If you're not sure, please don't hesitate to ask.

  • I have run clang-format on all changed files
  • Any dependent changes have been merged

Discussion

If this is a relatively large or complex change, feel free to start a discussion by explaining why you chose the solution you did and what alternatives you considered

@ozturkosu ozturkosu changed the title Add Grid Dimension (Defined by HipOccupancy) Tracking For Stream-K CkProfiler Enhance Stream-K CK Profiler with Grid Dimension (Defined by HipOccupancy) Tracking Jun 13, 2025
@ozturkosu ozturkosu self-assigned this Jun 13, 2025
@ozturkosu ozturkosu changed the title Enhance Stream-K CK Profiler with Grid Dimension (Defined by HipOccupancy) Tracking Report HIP occupancy-driven grid sizes in Stream-K profiler output Jun 13, 2025
@ozturkosu ozturkosu changed the title Report HIP occupancy-driven grid sizes in Stream-K profiler output Report HIP occupancy-driven grid sizes in Stream-K CkProfiler output Jun 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant