We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi,
I have a question about the performance between host_memory_bandwidth and host_memory_bandwidth_xrt.
It seems that they have same kernel and similar host code, but the performance results are quite different.
For buffer size 32.00 KB example,
OpenCL host example is
Concurrent Read and Write Throughput = 12.1802 (GB/sec) for buffer size 32.00 KB Read Throughput = 8.17289 (GB/sec) for buffer size 32.00 KB Write Throughput = 8.26587 (GB/sec) for buffer size 32.00 KB
,while XRT host example is
Concurrent Read and Write Throughput = 12.4713 (GB/sec) for buffer size 32.00 KB Read Throughput = 0.00397313 (GB/sec) for buffer size 32.00 KB Write Throughput = 0.0040426 (GB/sec) for buffer size 32.00 KB
What is the main factor contributing to this difference? Do XRT and OpenCL affect the performance significantly?
The text was updated successfully, but these errors were encountered:
Hi @anonymous1782 ,
Can you please share the Vitis and XRT version that are being used along with Platform that is targetted.
Thanks, Vishnu
Sorry, something went wrong.
Hi @vishnuchebrolu
Hmm.. both results shown in the post are not my own. The results are from the README.md file in this repository. (OpenCL, XRT)
By the way, I'm using vitis v2021.2 and xrt v 2.12.427.
Thanks,
No branches or pull requests
Hi,
I have a question about the performance between host_memory_bandwidth and host_memory_bandwidth_xrt.
It seems that they have same kernel and similar host code, but the performance results are quite different.
For buffer size 32.00 KB example,
OpenCL host example is
,while XRT host example is
What is the main factor contributing to this difference?
Do XRT and OpenCL affect the performance significantly?
The text was updated successfully, but these errors were encountered: