i965: Don't consider null dst instructions as matching non-null dst.
authorMatt Turner <mattst88@gmail.com>
Mon, 12 Jan 2015 21:58:06 +0000 (13:58 -0800)
committerMatt Turner <mattst88@gmail.com>
Thu, 15 Jan 2015 18:11:42 +0000 (10:11 -0800)
When performing common subexpression elimination on instructions with
non-null destinations we emit a MOV to copy the result to a new
register that must have no other uses. In the case of:

   cmp.g.f0.0(8) null:D, vgrf43:F, 0.500000f
   ...
   cmp.g.f0.0(8) vgrf113:D, vgrf43:F, 0.500000f

we put the first instruction in the AEB and decided that we could reuse
its result when we found the second. Unfortunately, that meant that we'd
emit a MOV from the first's destination, which is null.

Don't do anything if the entry's destination is null and the
instruction's destination is non-null.

Tested-by: Tapani Pälli <tapani.palli@intel.com>
src/mesa/drivers/dri/i965/brw_fs_cse.cpp
src/mesa/drivers/dri/i965/brw_vec4_cse.cpp

index f87601ce0ecd48263607cf2fd989fe739cf31a76..11cb327614c677ae55d1f6247d6e8b5b7eb46245 100644 (file)
@@ -194,7 +194,8 @@ fs_visitor::opt_cse_local(bblock_t *block)
 
          foreach_in_list_use_after(aeb_entry, entry, &aeb) {
             /* Match current instruction's expression against those in AEB. */
-            if (instructions_match(inst, entry->generator)) {
+            if (!(entry->generator->dst.is_null() && !inst->dst.is_null()) &&
+                instructions_match(inst, entry->generator)) {
                found = true;
                progress = true;
                break;
index 30a4098b339257b68aff4c14009ad12611b2e9fd..ee50419dc9a354072a9fde6cb49dd77e3be57efb 100644 (file)
@@ -157,7 +157,8 @@ vec4_visitor::opt_cse_local(bblock_t *block)
 
          foreach_in_list_use_after(aeb_entry, entry, &aeb) {
             /* Match current instruction's expression against those in AEB. */
-            if (instructions_match(inst, entry->generator)) {
+            if (!(entry->generator->dst.is_null() && !inst->dst.is_null()) &&
+                instructions_match(inst, entry->generator)) {
                found = true;
                progress = true;
                break;