-
-
Notifications
You must be signed in to change notification settings - Fork 101
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Multiple buffer input/output support. AMD TLB fix
-Now VkFFT supports input/output data to be stored in multiple buffers. passed as a pointer to the array of buffers with a number of buffers. VkFFT will use memory consecutively, splitting data logically in the chunks of the smallest buffer size. This allows to use data split between different memory allocations and mitigate 4GB single allocation limit. Sample 10 shows how this works -VkFFT is now able to mitigate TLB buffer misses on big sequences. This was detrimental for AMD GPUs performance before - up to 5x performance gains for big systems there. This is done by logical split of input/output buffer in 16KB chunks if sequence spans more than 2MB. Two parameters: localPageSize and devicePageSize control these two parameters. Also combinable with multiple buffers update -Updated performance plots
- Loading branch information
Showing
240 changed files
with
14,758 additions
and
7,641 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.