util/ra: Improve ra_set_finalize() performance.
authorEric Anholt <eric@anholt.net>
Mon, 13 Apr 2020 18:14:23 +0000 (11:14 -0700)
committerMarge Bot <eric+marge@anholt.net>
Wed, 29 Apr 2020 19:46:08 +0000 (19:46 +0000)
BITSET_FOR_EACH_SET can walk a sparse set (such as a register class's set
of registers) much faster than just iterating over individual bits.

Improves freedreno startup time (as measured by shader-db ./run
shaders/closed/gputest/triangle on my x86 system) by -4.12679% +/-
1.99006% (n=151)

Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4537>

src/util/register_allocate.c

index 091500710a94bdacab56ff764c3b43900201be7c..c0cd879c9cac85b300988b17e5c6e81973f9b473 100644 (file)
@@ -400,12 +400,9 @@ ra_set_finalize(struct ra_regs *regs, unsigned int **q_values)
             unsigned int rc;
             int max_conflicts = 0;
 
-            for (rc = 0; rc < regs->count; rc++) {
+            BITSET_FOREACH_SET(rc, regs->classes[c]->regs, regs->count) {
                int conflicts = 0;
 
-               if (!reg_belongs_to_class(rc, regs->classes[c]))
-                  continue;
-
                util_dynarray_foreach(&regs->regs[rc].conflict_list,
                                      unsigned int, rbp) {
                   unsigned int rb = *rbp;