git.libre-soc.org Git - mesa.git/commit

author	Kenneth Graunke <kenneth@whitecape.org>
	Tue, 13 Oct 2015 22:30:03 +0000 (15:30 -0700)
committer	Kenneth Graunke <kenneth@whitecape.org>
	Thu, 5 Nov 2015 23:26:07 +0000 (15:26 -0800)
commit	8dcf807cb43383590ba193c7ff20b8a98e4a9f65
tree	e9c05a2c3895e154905e993eddd0f8fd9191c6dc	tree
parent	5ae37ae6151623303300047d7465d199df8199a4	commit \| diff

i965: Fix scalar VS float[] and vec2[] output arrays.

The scalar VS backend has never handled float[] and vec2[] outputs
correctly (my original code was broken).  Outputs need to be padded
out to vec4 slots.

In fs_visitor::nir_setup_outputs(), we tried to process each vec4 slot
by looping from 0 to ALIGN(type_size_scalar(type), 4) / 4.  However,
this is wrong: type_size_scalar() for a float[2] would return 2, or
for vec2[2] it would return 4.  This looked like a single slot, even
though in reality each array element would be stored in separate vec4
slots.

Because of this bug, outputs[] and output_components[] would not get
initialized for the second element's VARYING_SLOT, which meant
emit_urb_writes() would skip writing them.  Nothing used those values,
and dead code elimination threw a party.

To fix this, we introduce a new type_size_vec4_times_4() function which
pads array elements correctly, but still counts in scalar components,
generating correct indices in store_output intrinsics.

Normally, varying packing avoids this problem by turning varyings into
vec4s.  So this doesn't actually fix any Piglit or dEQP tests today.
However, if varying packing is disabled, things would be broken.
Tessellation shaders can't use varying packing, so this fixes various
tcs-input Piglit tests on a branch of mine.

v2: Shorten the implementation of type_size_4x to a single line (caught
    by Connor Abbott), and rename it to type_size_vec4_times_4()
    (renaming suggested by Jason Ekstrand).  Use type_size_vec4
    rather than using type_size_vec4_times_4 and then dividing by 4.

Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>
Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>

src/mesa/drivers/dri/i965/brw_fs.cpp		diff \| blob \| history
src/mesa/drivers/dri/i965/brw_fs_nir.cpp		diff \| blob \| history
src/mesa/drivers/dri/i965/brw_nir.c		diff \| blob \| history
src/mesa/drivers/dri/i965/brw_shader.h		diff \| blob \| history