i965/vec4: Only zero out unused message components when there are any.
authorKenneth Graunke <kenneth@whitecape.org>
Mon, 9 Sep 2013 18:11:03 +0000 (11:11 -0700)
committerKenneth Graunke <kenneth@whitecape.org>
Wed, 11 Sep 2013 00:52:56 +0000 (17:52 -0700)
Otherwise, coordinates with four components would result in a MOV
with a destination writemask that has no channels enabled:

mov(8) g115<1>.F 0D { align16 WE_normal NoDDChk 1Q };

At best, this is stupid: we emit code that shouldn't do anything.
Worse, it apparently causes GPU hangs (observable with Chris's
textureGather test on CubeArrays.)

Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>
Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>
Reviewed-by: Ian Romanick <idr@freedesktop.org>
Cc: mesa-stable@lists.freedesktop.org
src/mesa/drivers/dri/i965/brw_vec4_visitor.cpp

index a51b61baf03bc059213a5516a6bad05c6bd1b3d4..86ecd214496536d84fdcd8df100a3ed06315873a 100644 (file)
@@ -2248,8 +2248,10 @@ vec4_visitor::visit(ir_texture *ir)
         emit(MOV(dst_reg(MRF, param_base, ir->coordinate->type, coord_mask),
                  coordinate));
       }
-      emit(MOV(dst_reg(MRF, param_base, ir->coordinate->type, zero_mask),
-              src_reg(0)));
+      if (zero_mask != 0) {
+         emit(MOV(dst_reg(MRF, param_base, ir->coordinate->type, zero_mask),
+                  src_reg(0)));
+      }
       /* Load the shadow comparitor */
       if (ir->shadow_comparitor && ir->op != ir_txd) {
         emit(MOV(dst_reg(MRF, param_base + 1, ir->shadow_comparitor->type,