yuzu/src at 66a0cedba39cabba30c756626d7b58bd0e519d8e - mirrors/yuzu - Computernewb Git

mirrors/yuzu

History

ReinUsesLisp 66a0cedba3 shader: Fold integer FMA from Nvidia's pattern Fold shaders doing "a * b + c" on integers from the pattern generated by Nvidia's GL compiler. On a somewhat complex compute shader it reduces the code size by 16 instructions from 2 matches on Turing GPUs. On Intel as extracted from KHR_pipeline_executable_properties: Before the optimization: ``` Instruction Count: 2057 Basic Block Count: 45 Scratch Memory Size: 14752 Spill Count: 232 Fill Count: 261 SEND Count: 610 Cycle Count: 11325 ``` After the optimization: ``` Instruction Count: 2046 Basic Block Count: 44 Scratch Memory Size: 13728 Spill Count: 219 Fill Count: 268 SEND Count: 604 Cycle Count: 11367 ```		2021-07-26 04:58:02 -03:00
..
audio_core	Merge pull request #6571 from Kelebek1/Mix	2021-07-12 22:09:05 -07:00
common	Merge pull request #6697 from ameerj/fps-cap	2021-07-25 16:23:44 -07:00
core	Merge pull request #6697 from ameerj/fps-cap	2021-07-25 16:23:44 -07:00
input_common	Merge pull request #6654 from german77/custom_threshold	2021-07-21 20:31:33 -04:00
shader_recompiler	shader: Fold integer FMA from Nvidia's pattern	2021-07-26 04:58:02 -03:00
tests	shader_recompiler,video_core: Cleanup some GCC and Clang errors	2021-07-22 21:51:26 -04:00
video_core	Merge pull request #6585 from ameerj/hades	2021-07-25 11:39:04 -07:00
web_service	web_service: Silence -Wmaybe-uninitialized on httplib.h	2021-07-12 03:30:45 -03:00
yuzu	Merge pull request #6697 from ameerj/fps-cap	2021-07-25 16:23:44 -07:00
yuzu_cmd	Merge pull request #6697 from ameerj/fps-cap	2021-07-25 16:23:44 -07:00
.clang-format
CMakeLists.txt	shader: Initial recompiler work	2021-07-22 21:51:21 -04:00