gpu-fp32
ed2bca20 · GPU: use fp32 for non-linear terms. Observed speed-up is about x1.8 · Jun 30, 2023