blender

Author	SHA1	Message	Date
Sergey Sharybin	f18d77b874	Cycles: Restore some lost custom cflags passed to the kernel compilation They were lost during simplification of kernel loading but might be rather crucial for the performance. Also made it so cflags are shared across kernels. Surely it might lead to some unwanted kernel re-compilation but at the same time they might easily run out of sync with the changes in kernel and so.	2015-05-21 14:05:53 +05:00
Sergey Sharybin	148ed4e05e	Cycles: Cleanup, synchronize name across file name, program and kernel names	2015-05-20 23:10:07 +05:00
Sergey Sharybin	6f48df45ee	Cycles: Simplify code around kernel loading	2015-05-20 23:10:07 +05:00
Thomas Dinges	105b87a3f7	Cycles: Enable advanced shading on AMD / OpenCL. That is needed for Motion Blur and Render Passes to work properly. I hope there are no nasty side effects, but we need to test this.	2015-05-17 19:29:33 +02:00
Thomas Dinges	14c2bc53c0	Cleanup: Typos, typos everywhere. :D	2015-05-17 18:32:31 +02:00
Campbell Barton	daeb3069cf	Cleanup: typos	2015-05-17 16:09:32 +10:00
Campbell Barton	31e96cbf96	Cleanup: style, spelling	2015-05-15 23:38:53 +10:00
Sergey Sharybin	c2b9f78415	Cycles: Pass __KERNEL_EXPERIMENTAL__ to OpenCL split kernels Experimental feature set id currently unavailable for megakernel, it'll require some changes to the cache system to distinguish cached regular kernels from cached experimental kernels. Currently unused, but some features will be enabled soon.	2015-05-15 13:22:47 +05:00
Sergey Sharybin	960d7df56f	Cycles: Pass device compute capabilities to kernel via build options This way it's possible to do device-selective feature disabling/enabling. Currently only supported for NVidia devices via OpenCL extension.	2015-05-15 13:22:47 +05:00
Sergey Sharybin	03f9d5a4cf	Cycles: Cleanup, move build options string calculation into the device class This way it's easier to access platform name, device ID and other stuff which might be needed to define build options.	2015-05-15 13:22:47 +05:00
Sergey Sharybin	3c10ec96b5	Cycles: Enable object motion blur on Intel OpenCL platform This required allocating some memory related on object transform needed by ShaderData and currently it is done for all the platforms. Since we're targeting full feature-complete platforms this is rather acceptable at this point and in the future we'll do selective NO_HAIR/NO_SSS/NO_BLUR kernels. This is experimental still and in fact there're some major issues on NVidia platform and it's not really clear if it's a bug in compiler, some uninitizlied variable or other kind of issue.	2015-05-15 00:48:12 +05:00
Sergey Sharybin	03565218d5	Cycles: Various fixes Some stupid fixes like spaces around operator and missing semicolon, plus fix for wrong detecting of ShaderData SOA size. Thar was harmless since there's only one closure array, but still better to fix this.	2015-05-15 00:42:05 +05:00
Sergey Sharybin	f6c6dd44de	Cycles: Remove meaningless ifdef checks for features in device_opencl This file was actually checking for features enabled on CPU and surely all of them were enabled, so removing them does not cause any difference. ideally we'll need to do runtime feature detection and just pass some stuff as NULL to the kernel, or maybe also have variadic kernel entry points which is also possible quite easily.	2015-05-14 23:44:19 +05:00
Sergey Sharybin	93867ae549	Cycles: Cleanup: use generic utility function to set kernel arguments	2015-05-13 19:56:24 +05:00
Sergey Sharybin	51a6bc8faa	Cycles: Inline sizeof of elements needed for the split kernel No need to store them in the class, they're unlikely to be changed and if they do change we're in big trouble anyway. More appropriate approach would be then to typedef this things in kernel_types.h, but still use inlined sizeof(),	2015-05-13 19:56:24 +05:00
Sergey Sharybin	3a2c0ccdd0	Cycles: Correction to opencl whitelist check Was using platform as a device id accidentally.	2015-05-10 20:02:06 +05:00
Sergey Sharybin	136d7a4f62	Cycles: Only whitelist AMD GPU devices in the OpenCL section Only those ones are priority for now, all the rest are still testable if CYCLES_OPENCL_TEST or CYCLES_OPENCL_SPLIT_KERNEL_TEST environment variables are set.	2015-05-09 23:40:26 +05:00
George Kyriazis	7f4479da42	Cycles: OpenCL kernel split This commit contains all the work related on the AMD megakernel split work which was mainly done by Varun Sundar, George Kyriazis and Lenny Wang, plus some help from Sergey Sharybin, Martijn Berger, Thomas Dinges and likely someone else which we're forgetting to mention. Currently only AMD cards are enabled for the new split kernel, but it is possible to force split opencl kernel to be used by setting the following environment variable: CYCLES_OPENCL_SPLIT_KERNEL_TEST=1. Not all the features are supported yet, and that being said no motion blur, camera blur, SSS and volumetrics for now. Also transparent shadows are disabled on AMD device because of some compiler bug. This kernel is also only implements regular path tracing and supporting branched one will take a bit. Branched path tracing is exposed to the interface still, which is a bit misleading and will be hidden there soon. More feature will be enabled once they're ported to the split kernel and tested. Neither regular CPU nor CUDA has any difference, they're generating the same exact code, which means no regressions/improvements there. Based on the research paper: https://research.nvidia.com/sites/default/files/publications/laine2013hpg_paper.pdf Here's the documentation: https://docs.google.com/document/d/1LuXW-CV-sVJkQaEGZlMJ86jZ8FmoPfecaMdR-oiWbUY/edit Design discussion of the patch: https://developer.blender.org/T44197 Differential Revision: https://developer.blender.org/D1200	2015-05-09 19:52:40 +05:00
Sergey Sharybin	2f5dd83759	Cycles: Add some statistics logging Covers number of entities in the scene (objects, meshes etc), also reports sizes of textures being allocated.	2015-04-10 15:37:49 +05:00
Sergey Sharybin	5ff132182d	Cycles: Code cleanup, spaces around keywords This inconsistency drove me totally crazy, it's really confusing when it's inconsistent especially when you work on both Cycles and Blender sides. Shouldn;t cause merge PITA, it's whitespace changes only, Git should be able to merge it nicely.	2015-03-28 00:15:15 +05:00
Sergey Sharybin	585dd26120	Cycles: Code cleanup, prepare for strict C++ flags	2015-03-27 18:23:31 +05:00
Sergey Sharybin	a922be9270	Cycles: Repot CPU and CUDA capabilities to system info operator For CPU it gives available instructions set (SSE, AVX and so). For GPU CUDA it reports most of the attribute values returned by cuDeviceGetAttribute(). Ideally we need to only use set of those which are driver-specific (so we don't clutter system info with values which we can get from GPU specifications and be sure they stay the same because driver can't affect on them).	2015-01-06 14:13:21 +05:00
Sergey Sharybin	1369bd562c	Cycles: Fix compilation error on AVX platforms with -arch-native Was a conflict in headers between clew and util_optimization.h.	2015-01-03 00:11:28 +05:00
Thomas Dinges	ee36e75b85	Cleanup: Fix Cycles Apache header. This was already mixed a bit, but the dot belongs there.	2014-12-25 02:50:24 +01:00
Sergey Sharybin	68f2066602	Cycles: Make OpenCL folks happy to use __KERNEL_DEBUG__ Quite straightforward change, the only annoying thing is that we can't use indentation for include directive just because of the way headers inlineing works for OpenCL. Might do smarter job in path_source_replace_includes() but don't want to spend time on this yet.	2014-10-05 16:00:23 +06:00
Sergey Sharybin	fbed2047c8	Fix wrong track of the memory when doing device vector resize before freeing it This is rather legit case which happens i.e. when having persistent images enabled and session is updating the lookup tables. Now device_memory keeps track of amount of memory being allocated on the device, which makes freeing using the proper allocated size, not the CPU side buffer size.	2014-09-04 17:25:12 +06:00
Dalai Felinto	8d3cc431d7	Fix T41471 Cycles Bake: Setting small tile size results in wrong bake with stripes rather than the expected noise pattern This problem was introduced in `983cbafd18` Basically the issue is that we were not getting a unique index in the baking routine for the RNG (random number generator). Reviewers: sergey Differential Revision: https://developer.blender.org/D749	2014-08-19 11:40:33 +02:00
Martijn Berger	c020bd2e73	Cycles OpenCL error to string removed in favour of the same function in clew.	2014-08-09 14:27:40 +02:00
Sergey Sharybin	77b7e1fe9a	Deduplicate CUDA and OpenCL wranglers For now it was mainly about OpenCL wrangler being duplicated between Cycles and Compositor, but with OpenSubdiv work those wranglers were gonna to be duplicated just once again. This commit makes it so Cycles and Compositor uses wranglers from this repositories: - https://github.com/CudaWrangler/cuew - https://github.com/OpenCLWrangler/clew This repositories are based on the wranglers we used before and they'll be likely continued maintaining by us plus some more players in the market. Pretty much straightforward change with some tricks in the CMake/SCons to make this libs being passed to the linker after all other libraries in order to make OpenSubdiv linked against those wranglers in the future. For those who're worrying about Cycles being less standalone, it's not truth, it's rather more flexible now and in the future different wranglers might be used in Cycles. For now it'll just mean those libs would need to be put into Cycles repository together with some other libs from Blender such as mikkspace. This is mainly platform maintenance commit, should not be any changes to the user space. Reviewers: juicyfruit, dingto, campbellbarton Reviewed By: juicyfruit, dingto, campbellbarton Differential Revision: https://developer.blender.org/D707	2014-08-05 13:57:50 +06:00
Dalai Felinto	fc55c41bba	Cycles Bake: show progress bar during bake Baking progress preview is not possible, in parts due to the way the API was designed. But at least you get to see the progress bar while baking. Reviewers: sergey Differential Revision: https://developer.blender.org/D656	2014-07-25 11:42:53 -03:00
Brecht Van Lommel	e4e58d4612	Fix T40370: cycles CUDA baking timeout with high number of AA samples. Now baking does one AA sample at a time, just like final render. There is also some code for shader antialiasing that solves T40369 but it is disabled for now because there may be unpredictable side effects.	2014-06-06 15:39:04 +02:00
Brecht Van Lommel	69c7522b24	Fix T40379: world MIS causing too much CUDA memory usage. The kernel for baking the world texture was the same as the one used for baking. Now that's separate which allows the kernel to reserve much less memory.	2014-05-27 15:11:32 +02:00
Campbell Barton	dc13969e48	Style cleanup: indentation, braces	2014-05-05 02:19:08 +10:00
Campbell Barton	1618329b00	Code cleanup: style, require ; for cuda_assert, opencl_assert	2014-05-04 03:57:50 +10:00
Campbell Barton	8d16869d83	Code cleanup: Add -Werror=float-conversion to Cycles	2014-05-03 07:31:46 +10:00
Thomas Dinges	f6abc96b6b	Cleanup: Remove OpenCL __MULTI_CLOSURE__ sanity check, not needed anymore after `04a10907dc`.	2014-04-21 18:08:01 +02:00
Martijn Berger	163a3212b4	OpenCL Change opencl_assert to be more like cuda assert where possible. added some extra warnings and feedback if things go wrong	2014-04-07 16:17:20 +02:00
Martijn Berger	dd2dca2f7e	Add support for multiple interpolation modes on cycles image textures All textures are sampled bi-linear currently with the exception of OSL there texture sampling is fixed and set to smart bi-cubic. This patch adds user control to this setting. Added: - bits to DNA / RNA in the form of an enum for supporting multiple interpolations types - changes to the image texture node drawing code ( add enum) - to ImageManager (this needs to know to allocate second texture when interpolation type is different) - to node compiler (pass on interpolation type) - to device tex_alloc this also needs to get the concept of multiple interpolation types - implementation for doing non interpolated lookup for cuda and cpu - implementation where we pass this along to osl ( this makes OSL also do linear untill I add smartcubic to the interface / DNA/ RNA) Reviewers: brecht, dingto Reviewed By: brecht CC: dingto, venomgfx Differential Revision: https://developer.blender.org/D317	2014-03-07 23:16:33 +01:00
Thomas Dinges	ad0a3de3ce	Cycles / OpenCL: Let the OpenCL runtime determine its optimal work-group size automatically, by passing a NULL pointer here. This is recommended in the Intel OpenCL optimization docs (http://software.intel.com/en-us/vcsource/samples/optimizing-opencl) and I can confirm a small performance increase here (1-2% on nVidia OpenCL, up to 8% on Intel OpenCL).	2013-12-24 20:20:57 +01:00
Thomas Dinges	011ae78857	Cycles / OpenCL: Fix compile error on OS X After update to Mac OS X 10.9.1, OpenCL works now on my Intel CPU in the 2013 Macbook Pro (even the entire kernel). The Intel Iris Pro GPU still segfaults here though, even when all flags are disabled (building "clay like" kernel only). Maybe we need the -no-missing-prototypes for AMD hardware still, but I couldn't find a way to distuinguish here.	2013-12-17 09:59:18 +01:00
Martijn Berger	85a0c5d4e1	Cycles: network render code updated for latest changes and improved This actually works somewhat now, although viewport rendering is broken and any kind of network error or connection failure will kill Blender. * Experimental WITH_CYCLES_NETWORK cmake option * Networked Device is shown as an option next to CPU and GPU Compute * Various updates to work with the latest Cycles code * Locks and thread safety for RPC calls and tiles * Refactored pointer mapping code * Fix error in CPU brand string retrieval code This includes work by Doug Gale, Martijn Berger and Brecht Van Lommel. Reviewers: brecht Differential Revision: http://developer.blender.org/D36	2013-12-07 12:26:58 +01:00
Brecht Van Lommel	cbb783f1d6	Fix cycles OpenCL compile error on AMD, and fix assert in debug builds.	2013-10-02 14:41:04 +00:00
Brecht Van Lommel	31e6181187	Fix #36873 : cycles opencl render status show negative sample count.	2013-09-30 12:11:25 +00:00
Brecht Van Lommel	29f6616d60	Cycles: viewport render now takes scene color management settings into account, except for curves, that's still missing from the OpenColorIO GLSL shader. The pixels are stored in a half float texture, converterd from full float with native GPU instructions and SIMD on the CPU, so it should be pretty quick. Using a GLSL shader is useful for GPU render because it avoids a copy through CPU memory.	2013-08-30 23:49:38 +00:00
Brecht Van Lommel	6785874e7a	Fix #36137 : cycles render not using all GPU's when the number of GPU's is larger than the number of CPU threads	2013-08-30 23:09:22 +00:00
Brecht Van Lommel	b9ce231060	Cycles: relicense GNU GPL source code to Apache version 2.0. More information in this post: http://code.blender.org/ Thanks to all contributes for giving their permission!	2013-08-18 14:16:15 +00:00
Brecht Van Lommel	e11e30aadf	Fix Cycles OpenCL issue if context/program creation fails, mistake by me, patch #35866 by Doug Gale to fix it.	2013-06-26 12:24:33 +00:00
Brecht Van Lommel	2e3035dd80	Cycles OpenCL: make displacement and world importance sampling work.	2013-06-21 13:05:08 +00:00
Brecht Van Lommel	16204bd647	Cycles: prepare to make CUDA 5.0 the official version we use * Add CUDA compiler version detection to cmake/scons/runtime * Remove noinline in kernel_shader.h and reenable --use_fast_math if CUDA 5.x is used, these were workarounds for CUDA 4.2 bugs * Change max number of registers to 32 for sm 2.x (based on performance tests from Martijn Berger and confirmed here), and also for NVidia OpenCL. Overall it seems that with these changes and the latest CUDA 5.0 download, that performance is as good as or better than the 2.67b release with the scenes and graphics cards I tested.	2013-06-19 17:54:23 +00:00
Brecht Van Lommel	0ad88d1001	Fix another windows / msvc build error.	2013-06-01 02:39:34 +00:00

1 2

100 Commits