Revert "i965/fs: Predicate byte scattered writes if needed"
authorFrancisco Jerez <currojerez@riseup.net>
Sun, 25 Feb 2018 00:05:21 +0000 (16:05 -0800)
committerFrancisco Jerez <currojerez@riseup.net>
Fri, 2 Mar 2018 19:28:56 +0000 (11:28 -0800)
This reverts commit a4031bdfa927fb4c3c5d0bdadc70634f3c1a5eac.  It's
redundant with the sample mask predication done at this point by the
common logical send lowering infrastructure, and rather buggy because
it wasn't applying the correct sample mask in shaders using discard,
since the dispatch mask returned by FS_OPCODE_MOV_DISPATCH_TO_FLAGS
doesn't reflect samples discarded by the shader, so it could have led
to data corruption in fragment shader invocations that execute discard
based on a non-dynamically uniform condition.

Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
src/intel/compiler/brw_fs_nir.cpp

index 47247875e8b2ab7f25cc8290aa2d0c7580e9edc2..554d61d71af54087026323764fc31abee31011ed 100644 (file)
@@ -4207,25 +4207,12 @@ fs_visitor::nir_emit_intrinsic(const fs_builder &bld, nir_intrinsic_instr *instr
              * to rely on byte scattered in order to write 16-bit elements.
              * The byte_scattered_write message needs that every written 16-bit
              * type to be aligned 32-bits (stride=2).
-             * Additionally, while on Untyped Surface messages the
-             * bits of the execution mask are ANDed with the corresponding
-             * bits of the Pixel/Sample Mask, that is not the case for byte
-             * scattered writes. That is needed to avoid ssbo stores writing
-             * on helper invocations. So when that can affect, we load the
-             * sample mask, and predicate the send message.
              */
-            brw_predicate pred = BRW_PREDICATE_NONE;
-
-            if (stage == MESA_SHADER_FRAGMENT) {
-               bld.emit(FS_OPCODE_MOV_DISPATCH_TO_FLAGS);
-               pred = BRW_PREDICATE_NORMAL;
-            }
-
             emit_byte_scattered_write(bld, surf_index, offset_reg,
                                       write_src,
                                       1 /* dims */, 1,
                                       bit_size,
-                                      pred);
+                                      BRW_PREDICATE_NONE);
          } else {
             assert(num_components * type_size <= 16);
             assert((num_components * type_size) % 4 == 0);