Added
-
futhark doc
now prints warnings about errors in doc comments (mostly
references to unknown names). -
FutharkScript now permits chained
let
withoutin
, just as in Futhark. -
futhark pkg
now allows the~
character in package paths. -
cuda
backend: explicitly support CC 8.9, 9.0, 10.0, 10.1, and 12.0. -
Profiling now provides source locations for the profiled events. Some things
are not yet accurately tracked, and only the static location is reported
(i.e., no full call stack). -
The AD transformation is now more diligent about propagating attributes from
SOACs in the primal code to SOACs in the differentiated code.
Fixed
-
f64
atomics on NVIDIA GPUs with less than CC 6.0 (Maxwell and older). -
Infinite loop in fusion (#2276).
-
Rare compiler crash during internalisation code that does size coercions on
opaque size-lifted types. -
Missing fusion inside reduction and scan operators (#2283).
-
Incorrect aliasing for memory blocks could cause some optimisations to be
misapplied. (#2288) -
to_bits
/from_bits
not handled by AD (#2292). -
For GPU backends, incorrect code generation for accumulator updates (produced
by AD) for thread-local arrays (#2294). -
Missing consumption check in the termination condition for
while
loops (#2300).