Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
cosden authored Jul 13, 2017
1 parent e5d2094 commit ed1ac85
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,9 +19,9 @@ This is intended to use as a example to profile with performance tuning tools su
Short test on head node: `./mm.out 250` (250x250 matrix)
1. Run the provided script to submit a VTune wrapped job to the scheduler
`./submit_to_scheduler`
This will, by default, run a 500x500 matrix example on a compute node using the VTune "Hotspots" analysis. When it finishes it will create a directory with the results named something like r001hs, where 001 is incremented for each new analysis.
This will, by default, run a 500x500 matrix example on a compute node using the VTune "Hotspots" analysis. When it finishes it will create a directory with the results named something like r000hs, where 000 is incremented by 1 for each new analysis.
1. Open the resulting directory with VTune GUI
e.g. `amplxe-gui r001hs`
e.g. `amplxe-gui r000hs`
1. Explore "Bottom-up" and "Top-down Tree" and double click on hotspots to look at line-by-line performance
1. Edit the file "submit.slurm" to increase the matrix size (originally 500) and rerun the analysis. WARNING: don't go over 1500!
1. Challenge: using the knowledge gained from VTune, improve the speed of the code by changing the functions that are called. This is done by changing threshold values set in command line arguments to mm.out in submit.slurm (see more info in comments at top of matmul_test.cpp for setting thresholds).
Expand Down

0 comments on commit ed1ac85

Please sign in to comment.