Sergey Sharybin 61eab743f1 Cycles: Optimization for CMJ in CUDA kernels
Two things:
- Use intrinsics for clz/ctz (ctz is implemented via ffs()).
- Use faster sqrt() function which precision is enough for
  integer values.
2015-03-13 12:38:14 +05:00
2015-03-12 23:49:15 +11:00
2013-12-24 22:57:27 +06:00
2014-08-08 06:00:49 +10:00
Description
No description provided
926 MiB
Languages
C++ 78.1%
Python 15%
C 2.8%
GLSL 1.8%
CMake 1.2%
Other 0.9%