mesa.git
13 years agor300g: avoid stall in no-tcl drawing when mapping vbo
Dave Airlie [Mon, 23 Aug 2010 10:28:02 +0000 (20:28 +1000)]
r300g: avoid stall in no-tcl drawing when mapping vbo

the current code reuses the same vbo over and over, however after a flush
we'd stall and wait for mapping on the vbo when we should just fire and forget.

On a gears test this brings me from ~620 to ~750 on my rv530 in swtcl mode.

Signed-off-by: Dave Airlie <airlied@redhat.com>
13 years agoglapi: Clean up header inclusions.
Chia-I Wu [Mon, 23 Aug 2010 08:13:12 +0000 (16:13 +0800)]
glapi: Clean up header inclusions.

Do not rely on PUBLIC being defined in glapi.h.  Do not include core
mesa headers.

13 years agomesa: Assorted fixes for es_generator.py on win32.
Chia-I Wu [Sat, 21 Aug 2010 10:20:39 +0000 (18:20 +0800)]
mesa: Assorted fixes for es_generator.py on win32.

Fix mixed use of GL_APIENTRY and GLAPIENTRY.  Parameter list of a function
prototype should never be empty.

13 years agoi965: Add sandybridge D0 pci ids
Zhenyu Wang [Mon, 23 Aug 2010 02:16:45 +0000 (10:16 +0800)]
i965: Add sandybridge D0 pci ids

Signed-off-by: Zhenyu Wang <zhenyuw@linux.intel.com>
13 years agomesa: Fix typo in autoconf.in that made talloc cflags still detect at runtime.
Eric Anholt [Mon, 23 Aug 2010 01:53:33 +0000 (18:53 -0700)]
mesa: Fix typo in autoconf.in that made talloc cflags still detect at runtime.

13 years agost/mesa: implement depth-only blit for BlitFramebuffer
Marek Olšák [Sat, 14 Aug 2010 15:47:34 +0000 (08:47 -0700)]
st/mesa: implement depth-only blit for BlitFramebuffer

Signed-off-by: Brian Paul <brianp@vmware.com>
13 years agoutil: implement depth blitting in u_blit
Marek Olšák [Mon, 23 Aug 2010 01:29:32 +0000 (19:29 -0600)]
util: implement depth blitting in u_blit

Signed-off-by: Brian Paul <brianp@vmware.com>
13 years agost/mesa: fix BlitFramebuffer for D24S8 textures
Marek Olšák [Sat, 14 Aug 2010 15:47:32 +0000 (08:47 -0700)]
st/mesa: fix BlitFramebuffer for D24S8 textures

This is the same issue as in the previous patch, but here the Blit is not
implemented for separate depth and stencil buffers at all (such
a configuration is not supported in Gallium) and the code incorrectly treated
a D24S8 texture as two separate buffers, making this Blit a no-op.

Signed-off-by: Brian Paul <brianp@vmware.com>
13 years agost/mesa: added st_is_depth_stencil_combined() function
Brian Paul [Mon, 23 Aug 2010 01:34:53 +0000 (19:34 -0600)]
st/mesa: added st_is_depth_stencil_combined() function

This code is part of a patch by Marek Olšák.

13 years agoglsl: Don't constant-fold in a constant in place of a function outval.
Eric Anholt [Mon, 23 Aug 2010 01:26:42 +0000 (18:26 -0700)]
glsl: Don't constant-fold in a constant in place of a function outval.

13 years agoglsl: Convert constant folding to the rvalue visitor.
Eric Anholt [Mon, 23 Aug 2010 01:15:20 +0000 (18:15 -0700)]
glsl: Convert constant folding to the rvalue visitor.

This should be mostly a noop, except that a plain dereference of a
variable that is not part of a constant expression could now get
"constant folded".  I expect that for all current backends this will
be either a noop, or possibly a win when it provokes more
ir_algebraic.  It'll also ensure that when new features are added,
tree walking will work normally.  Before this, constants weren't
getting folded inside of loops.

13 years agoglsl: Don't tree-graft in an expression in place of a function outval.
Eric Anholt [Mon, 23 Aug 2010 01:25:55 +0000 (18:25 -0700)]
glsl: Don't tree-graft in an expression in place of a function outval.

Fixes: glsl-constant-folding-call-1 (bug #29737)
13 years agost/mesa: fix ReadPixels crashes when reading depth/stencil from a FBO
Brian Paul [Mon, 23 Aug 2010 01:04:47 +0000 (19:04 -0600)]
st/mesa: fix ReadPixels crashes when reading depth/stencil from a FBO

This is based on a patch from Marek Olšák.

NOTE: This is a candidate for the Mesa 7.8 branch.

13 years agomesa: use driver hook for creating new renderbuffers
Brian Paul [Mon, 23 Aug 2010 00:54:50 +0000 (18:54 -0600)]
mesa: use driver hook for creating new renderbuffers

13 years agost/mesa: clean-up pipe_get_transfer() calls
Brian Paul [Mon, 23 Aug 2010 00:48:28 +0000 (18:48 -0600)]
st/mesa: clean-up pipe_get_transfer() calls

13 years agomesa: AC_SUBST the talloc libs/cflags so the ./configure results are saved.
Eric Anholt [Mon, 23 Aug 2010 00:34:18 +0000 (17:34 -0700)]
mesa: AC_SUBST the talloc libs/cflags so the ./configure results are saved.

I had used pkg-config from the Makefile because I didn't want to screw
around with the non-autoconf build, but that doesn't work because the
PKG_CONFIG_PATH or TALLOC_LIBS/TALLOC_CFLAGS that people set at
configure time needs to be respected and may not be present at build
time.

Bug #29585

13 years agonvfx: fix minor memory leak
Luca Barbieri [Sun, 22 Aug 2010 22:16:23 +0000 (00:16 +0200)]
nvfx: fix minor memory leak

13 years agonvfx: support both sprite coord origins
Luca Barbieri [Sun, 22 Aug 2010 21:29:34 +0000 (23:29 +0200)]
nvfx: support both sprite coord origins

Now we lie less when claiming OpenGL 2 support.

Also, first piglit result group is now all green, except for
fdo25614-genmipmap, which seems mesa/st's fault.

13 years agonvfx: use 64-bit bitmasks for temps
Luca Barbieri [Sun, 22 Aug 2010 19:41:49 +0000 (21:41 +0200)]
nvfx: use 64-bit bitmasks for temps

13 years agor600g: fix DB decompression
Jerome Glisse [Sun, 22 Aug 2010 21:13:58 +0000 (17:13 -0400)]
r600g: fix DB decompression

Signed-off-by: Jerome Glisse <jglisse@redhat.com>
13 years agonvfx: Include missing header in nvfx_vertprog.c.
Vinson Lee [Sun, 22 Aug 2010 19:45:04 +0000 (12:45 -0700)]
nvfx: Include missing header in nvfx_vertprog.c.

Include draw_context.h for draw_*_vertex_shader symbols.

Fixes the following GCC warning.
nvfx_vertprog.c: In function 'nvfx_vp_state_create':
nvfx_vertprog.c:1276: warning: implicit declaration of function 'draw_create_vertex_shader'
nvfx_vertprog.c:1276: warning: assignment makes pointer from integer without a cast
nvfx_vertprog.c: In function 'nvfx_vp_state_delete':
nvfx_vertprog.c:1298: warning: implicit declaration of function 'draw_delete_vertex_shader'

13 years agotranslate_sse: add R32G32B32A32_FLOAT -> X8X8X8X8_UNORM for EMIT_4UB
Jakob Bornecrantz [Sun, 22 Aug 2010 17:58:57 +0000 (19:58 +0200)]
translate_sse: add R32G32B32A32_FLOAT -> X8X8X8X8_UNORM for EMIT_4UB

Changed by me to use movd instead of movss to avoid penalties.

13 years agotranslate_sse: refactor constant management
Luca Barbieri [Sun, 22 Aug 2010 16:11:22 +0000 (17:11 +0100)]
translate_sse: refactor constant management

13 years agonvfx: refactor to support multiple fragment program versions
Luca Barbieri [Sun, 22 Aug 2010 14:15:51 +0000 (16:15 +0200)]
nvfx: refactor to support multiple fragment program versions

13 years agonvfx: move stuff around
Luca Barbieri [Sun, 22 Aug 2010 13:48:41 +0000 (15:48 +0200)]
nvfx: move stuff around

13 years agor600g: depth buffer likely needs decompression when used as texture
Jerome Glisse [Sun, 22 Aug 2010 18:22:00 +0000 (14:22 -0400)]
r600g: depth buffer likely needs decompression when used as texture

Before using depth buffer as texture, it needs to be decompressed
(tile pattern of db are different from one used for colorbuffer
like texture)

Signed-off-by: Jerome Glisse <jglisse@redhat.com>
13 years agoglx/xlib: configurable strict/non-strict buffer size invalidate
Keith Whitwell [Sun, 22 Aug 2010 13:14:55 +0000 (14:14 +0100)]
glx/xlib: configurable strict/non-strict buffer size invalidate

Introduce a new configuration option XMESA_STRICT_INVALIDATE to switch
between swapbuffers-based and glViewport-based buffer invalidation.

Default strict invalidate to false, ie glViewport-based invalidation,
aka ST_MANAGER_BROKEN_INVALIDATE.

This means we will not call XGetGeometry after every swapbuffers,
which allows swapbuffers to remain asynchronous.  For apps running at
100fps with synchronous swapping, a 10% boost is typical.  For gears,
I see closer to 20% speedup.

Note that the work of copying data on swapbuffers doesn't disappear -
this change just allows the X server to execute the PutImage
asynchronously without us effectively blocked until its completion.

This applies even to llvmpipe's threaded rasterization as the
swapbuffers operation was a large part of the serial component of an
llvmpipe frame.

The downside of this is correctness - applications which don't call
glViewport on window resizes will get incorrect rendering, unless
XMESA_STRICT_INVALIDATE is set.

The ultimate solution would be to have per-frame but asynchronous
invalidation.  Xcb almost looks as if it could provide this, but the
API doesn't seem to quite be there.

13 years agollvmpipe: reduce size of fragment shader variant key
Keith Whitwell [Sun, 22 Aug 2010 11:31:18 +0000 (12:31 +0100)]
llvmpipe: reduce size of fragment shader variant key

Don't spend as much time comparing them.

13 years agollvmpipe: remove unused member from lp_fragment_shader_variant_key
Keith Whitwell [Sun, 22 Aug 2010 11:16:45 +0000 (12:16 +0100)]
llvmpipe: remove unused member from lp_fragment_shader_variant_key

13 years agollvmpipe: don't clear unused bins
Keith Whitwell [Sun, 22 Aug 2010 10:43:01 +0000 (11:43 +0100)]
llvmpipe: don't clear unused bins

If bins outside the current scene bounds are being corrupted, we'll
need to fix that separately.  Currently seems ok though.

13 years agodraw: reduce the size of the llvm variant key
Keith Whitwell [Sat, 21 Aug 2010 21:51:38 +0000 (22:51 +0100)]
draw: reduce the size of the llvm variant key

13 years agoglx/xlib: remove another XSync
Keith Whitwell [Thu, 19 Aug 2010 23:14:47 +0000 (00:14 +0100)]
glx/xlib: remove another XSync

With this change, xmesa_get_window_size still does one round trip, but
that's better than doing two.

13 years agoglx/xlib: no need to call XSync from XMesaFlush
Keith Whitwell [Thu, 19 Aug 2010 23:08:22 +0000 (00:08 +0100)]
glx/xlib: no need to call XSync from XMesaFlush

Try to eliminate some unnecessary X server round trips.

13 years agonvfx: simplify and correct fragment program update logic
Luca Barbieri [Sun, 22 Aug 2010 10:02:41 +0000 (12:02 +0200)]
nvfx: simplify and correct fragment program update logic

This version should hopefully be much clearer and thus less likely
to be subtly broken.

Also fixes point sprites on nv40 and possibly some other bugs too.

13 years agonvfx: make stipple setting independent of enable
Luca Barbieri [Sun, 22 Aug 2010 09:58:54 +0000 (11:58 +0200)]
nvfx: make stipple setting independent of enable

13 years agonvfx: fix vertex programs
Luca Barbieri [Sun, 22 Aug 2010 12:53:49 +0000 (14:53 +0200)]
nvfx: fix vertex programs

13 years agonvfx: use relocations array for vp constants
Luca Barbieri [Sat, 21 Aug 2010 22:21:55 +0000 (00:21 +0200)]
nvfx: use relocations array for vp constants

13 years agor600g: Don't blindly unmap NULL->size.
Henri Verbeet [Mon, 16 Aug 2010 20:18:37 +0000 (22:18 +0200)]
r600g: Don't blindly unmap NULL->size.

There may actually be something mapped in that range, especially for large
buffers like e.g. the GL Drawable.

13 years agosvga: Do not shortcut NULL surface relocations with SVGA3D_INVALID_ID.
José Fonseca [Sun, 15 Aug 2010 12:36:02 +0000 (13:36 +0100)]
svga: Do not shortcut NULL surface relocations with SVGA3D_INVALID_ID.

How to cope with NULL surface relocations should be entirely at winsys'
discretion.

13 years agoi965: Fix 8-wide FB writes on gen6.
Eric Anholt [Sun, 22 Aug 2010 07:47:45 +0000 (00:47 -0700)]
i965: Fix 8-wide FB writes on gen6.

My merge of Zhenyu's patch on top of my previous patches broke it by
my code expecting simd16 single write and Zhenyu's simd8 path being
disabled by mine.  Merge the two for success.

13 years agoi965: Fix brw_math1 with scalar argument in gen6 FS.
Eric Anholt [Sun, 22 Aug 2010 07:44:28 +0000 (00:44 -0700)]
i965: Fix brw_math1 with scalar argument in gen6 FS.

The docs claim two conflicting things: One, that a scalar source is
supported.  Two, source hstride must be 1 and width must be exec size.
So splat a constant argument out into a full reg to operate on, since
violating the second set of constraints is clearly failing.

The alternative here might be to do a 1-wide exec on a constant
argument for math1.  It would probably save cycles too.  But I'll
leave that for the glsl2-965 branch.

Fixes glsl-algebraic-div-one-2.shader_test.

13 years agoi965: Fix up WM push constant setup on gen6.
Eric Anholt [Sun, 22 Aug 2010 07:26:09 +0000 (00:26 -0700)]
i965: Fix up WM push constant setup on gen6.

Fixes glsl-algebraic-add-add-1.

13 years agoi965: Use intel->gen >= 6 instead of IS_GEN6.
Eric Anholt [Sun, 22 Aug 2010 06:47:06 +0000 (23:47 -0700)]
i965: Use intel->gen >= 6 instead of IS_GEN6.

13 years agolibgl-xlib: Include missing header in xlib.c.
Vinson Lee [Sun, 22 Aug 2010 07:30:47 +0000 (00:30 -0700)]
libgl-xlib: Include missing header in xlib.c.

Include st_api.h for st_api_create_OpenGL symbol.

13 years agonvfx: Silence unused variable warning.
Vinson Lee [Sun, 22 Aug 2010 07:16:54 +0000 (00:16 -0700)]
nvfx: Silence unused variable warning.

The variable is used but only in the body of an assert.

13 years agomesa: Initialize member variables in ir_to_mesa_src_reg constructor.
Vinson Lee [Sun, 22 Aug 2010 07:09:43 +0000 (00:09 -0700)]
mesa: Initialize member variables in ir_to_mesa_src_reg constructor.

The default constructor did not initialize some member variables.

13 years agomesa: Initialize variables in mesa_src_reg_from_ir_src_reg.
Vinson Lee [Sun, 22 Aug 2010 06:56:24 +0000 (23:56 -0700)]
mesa: Initialize variables in mesa_src_reg_from_ir_src_reg.

13 years agoutil: Use #ifdef instead of #if.
Vinson Lee [Sun, 22 Aug 2010 06:36:30 +0000 (23:36 -0700)]
util: Use #ifdef instead of #if.

This is a typo fix of earlier commit 0f3b3751b8643352dcc242567b3696bd1505df1d.

13 years agoutil: Define dump_cpu only for DEBUG builds.
Vinson Lee [Sun, 22 Aug 2010 06:28:52 +0000 (23:28 -0700)]
util: Define dump_cpu only for DEBUG builds.

dump_cpu is used only when DEBUG is defined.

Fixes the following GCC warning on builds without DEBUG defined.
util/u_cpu_detect.c:76: warning: 'debug_get_option_dump_cpu' defined but not used

13 years agotranslate_sse: Silence uninitialized variable warnings.
Vinson Lee [Sun, 22 Aug 2010 06:24:28 +0000 (23:24 -0700)]
translate_sse: Silence uninitialized variable warnings.

Initialize variables on error paths.

13 years agonvfx: Silence uninitialized variable warnings.
Vinson Lee [Sun, 22 Aug 2010 05:59:46 +0000 (22:59 -0700)]
nvfx: Silence uninitialized variable warnings.

Variables weren't initialized on the error paths.

13 years agoi965g: Silence printf format warnings on 64-bit builds.
Vinson Lee [Sun, 22 Aug 2010 05:45:09 +0000 (22:45 -0700)]
i965g: Silence printf format warnings on 64-bit builds.

13 years agonvfx: Silence uninitialized variable warnings.
Vinson Lee [Sun, 22 Aug 2010 05:09:47 +0000 (22:09 -0700)]
nvfx: Silence uninitialized variable warnings.

Silence the following i686-apple-darwin10-gcc-4.2.1 warnings.
nv04_2d.c: In function 'nv04_region_copy_cpu':
nv04_2d.c:560: warning: 'dswy' may be used uninitialized in this function
nv04_2d.c:559: warning: 'dswx' may be used uninitialized in this function
nv04_2d.c:562: warning: 'sswy' may be used uninitialized in this function
nv04_2d.c:561: warning: 'sswx' may be used uninitialized in this function

13 years agonv50: Silence incompatible pointer type initialization warning.
Vinson Lee [Sun, 22 Aug 2010 05:01:04 +0000 (22:01 -0700)]
nv50: Silence incompatible pointer type initialization warning.

Silence the following GCC warning.
warning: initialization from incompatible pointer type

13 years agonv50: Disable unused code.
Vinson Lee [Sun, 22 Aug 2010 04:42:17 +0000 (21:42 -0700)]
nv50: Disable unused code.

Disable release_hw and emit_mov_from_pred functions as they are
currently not being used.

13 years agoi965g: Fix printf format warning on 32-bit platforms.
Vinson Lee [Sun, 22 Aug 2010 04:27:43 +0000 (21:27 -0700)]
i965g: Fix printf format warning on 32-bit platforms.

Fixes the following GCC warning on 32-bit platforms.
warning: format '%li' expects type 'long int', but argument 4 has type 'int'

13 years agoglsl: Silence uninitialized variable warning.
Vinson Lee [Sun, 22 Aug 2010 03:38:07 +0000 (20:38 -0700)]
glsl: Silence uninitialized variable warning.

i686-apple-darwin10-gcc-4.2.1 generated the following warning.
warning: 'score' may be used uninitialized in this function

GCC 4.4.3 on Linux didn't generate the above warning.

13 years agor600g: partialy fix texturing from depth buffer + initial support for untiling
Jerome Glisse [Sun, 22 Aug 2010 02:49:22 +0000 (22:49 -0400)]
r600g: partialy fix texturing from depth buffer + initial support for untiling

Partialy fix texturing from depth buffer, depth buffer is tiled
following different tile organisation that color buffer. This
properly set the tile type & array mode field of texture sampler
when sampling from db resource.

Add initial support to untiling buffer when transfering them,
it's kind of broken by corruption the vertex buffer of previous
draw.

Signed-off-by: Jerome Glisse <jglisse@redhat.com>
13 years agodraw: Don't assert if indices point outside vertex buffer.
José Fonseca [Sun, 22 Aug 2010 01:26:09 +0000 (02:26 +0100)]
draw: Don't assert if indices point outside vertex buffer.

This is valid input, and asserting here does causes the test suites that
verify this to crash.

Also, the assert was wrongly accepting the case

  max_index == vert_info->count

which, IIUC, is the first vertex outside the buffer. Assuming the
vert_info->count is precise (which often is not the case).

13 years agomesa: Removed another unused variable.
José Fonseca [Sat, 21 Aug 2010 14:04:47 +0000 (15:04 +0100)]
mesa: Removed another unused variable.

13 years agoglsl: Silence unused variable warning.
Vinson Lee [Sat, 21 Aug 2010 23:21:41 +0000 (16:21 -0700)]
glsl: Silence unused variable warning.

The variable is actually used but only in the body of an assert.

13 years agoutil: Silence uninitialized variable warnings.
Vinson Lee [Sat, 21 Aug 2010 22:48:25 +0000 (15:48 -0700)]
util: Silence uninitialized variable warnings.

13 years agoglsl: Handle array declarations in function parameters.
Kenneth Graunke [Sat, 21 Aug 2010 22:30:34 +0000 (15:30 -0700)]
glsl: Handle array declarations in function parameters.

The 'vec4[12] foo' style already worked, but the 'vec4 foo[12]' style
did not.  Also, 'vec4[] foo' was wrongly accepted.

Fixes piglit test cases array-19.vert and array-21.vert.

May fix fd.o bug #29684 (or at least part of it).

13 years agonvfx: actually fix it properly
Luca Barbieri [Sat, 21 Aug 2010 21:53:39 +0000 (23:53 +0200)]
nvfx: actually fix it properly

13 years agonvfx: fix incorrect assert
Luca Barbieri [Sat, 21 Aug 2010 21:33:51 +0000 (23:33 +0200)]
nvfx: fix incorrect assert

13 years agoutil: Move loop variable declaration outside for loop.
Vinson Lee [Sat, 21 Aug 2010 21:36:29 +0000 (14:36 -0700)]
util: Move loop variable declaration outside for loop.

Fixes build error with MSVC.

13 years agonvfx: Fix SCons build.
Vinson Lee [Sat, 21 Aug 2010 21:29:50 +0000 (14:29 -0700)]
nvfx: Fix SCons build.

Move declarations before code.
Fix void pointer arithmetic.

13 years agonvfx: fix warnings
Luca Barbieri [Sat, 21 Aug 2010 20:48:29 +0000 (22:48 +0200)]
nvfx: fix warnings

13 years agogallivm: Emit DIVPS instead of RCPPS.
José Fonseca [Sat, 21 Aug 2010 20:58:22 +0000 (21:58 +0100)]
gallivm: Emit DIVPS instead of RCPPS.

See comments for detailed rationale.

Thanks to Michal Krol and Zack Rusin for detecting and investigating this
in detail.

13 years agonvfx: enable translate_sse
Luca Barbieri [Sat, 21 Aug 2010 19:29:18 +0000 (21:29 +0200)]
nvfx: enable translate_sse

13 years agoauxiliary: Add missing files to SCons build.
Vinson Lee [Sat, 21 Aug 2010 19:32:17 +0000 (12:32 -0700)]
auxiliary: Add missing files to SCons build.

Add u_linear.c and u_linkages.c to SCons build.
Reorder list of files to be more alphabetical.

13 years agoauxiliary: Reorder list of files in Makefile.
Vinson Lee [Sat, 21 Aug 2010 19:21:59 +0000 (12:21 -0700)]
auxiliary: Reorder list of files in Makefile.

This patch reorders the list of files so that the order is more alphabetic.

13 years agoscons: Fix nvfx build.
Vinson Lee [Sat, 21 Aug 2010 19:00:57 +0000 (12:00 -0700)]
scons: Fix nvfx build.

13 years agonvfx: slightly improve handling of overlong vps
Luca Barbieri [Sat, 21 Aug 2010 18:23:41 +0000 (20:23 +0200)]
nvfx: slightly improve handling of overlong vps

13 years agonvfx: tweak CMP in fp
Luca Barbieri [Sat, 21 Aug 2010 18:14:35 +0000 (20:14 +0200)]
nvfx: tweak CMP in fp

13 years agonvfx: implement CMP in vp
Luca Barbieri [Sat, 21 Aug 2010 18:14:16 +0000 (20:14 +0200)]
nvfx: implement CMP in vp

13 years agonvfx: implement TXL in fp
Luca Barbieri [Sat, 21 Aug 2010 18:07:48 +0000 (20:07 +0200)]
nvfx: implement TXL in fp

13 years agonvfx: implement SSG in fp
Luca Barbieri [Sat, 21 Aug 2010 18:05:04 +0000 (20:05 +0200)]
nvfx: implement SSG in fp

13 years agonvfx: implement DP2 in vp and fp
Luca Barbieri [Sat, 21 Aug 2010 17:43:46 +0000 (19:43 +0200)]
nvfx: implement DP2 in vp and fp

13 years agonvfx: implement TRUNC in vp and fp
Luca Barbieri [Sat, 21 Aug 2010 17:35:06 +0000 (19:35 +0200)]
nvfx: implement TRUNC in vp and fp

13 years agonvfx: implement NOP
Luca Barbieri [Sat, 21 Aug 2010 17:45:06 +0000 (19:45 +0200)]
nvfx: implement NOP

13 years agonvfx: add vertex program control flow
Luca Barbieri [Sat, 21 Aug 2010 16:37:21 +0000 (18:37 +0200)]
nvfx: add vertex program control flow

13 years agonvfx: fix vertex shader headers
Luca Barbieri [Sat, 21 Aug 2010 16:37:01 +0000 (18:37 +0200)]
nvfx: fix vertex shader headers

13 years agonv40: add fragment program control flow
Luca Barbieri [Fri, 20 Aug 2010 19:16:49 +0000 (21:16 +0200)]
nv40: add fragment program control flow

13 years agonvfx: refactor shader assembler
Luca Barbieri [Sat, 21 Aug 2010 10:32:59 +0000 (12:32 +0200)]
nvfx: refactor shader assembler

13 years agonvfx: add option to dump shaders in TGSI and native code
Luca Barbieri [Sat, 21 Aug 2010 11:28:38 +0000 (13:28 +0200)]
nvfx: add option to dump shaders in TGSI and native code

13 years agonvfx: improve and correct nvfx_shader.h
Luca Barbieri [Thu, 25 Feb 2010 16:46:37 +0000 (17:46 +0100)]
nvfx: improve and correct nvfx_shader.h

13 years agonvfx: fix lodbias
Luca Barbieri [Thu, 19 Aug 2010 20:47:03 +0000 (22:47 +0200)]
nvfx: fix lodbias

13 years agonvfx: mostly fix inline corruption magically
Luca Barbieri [Thu, 19 Aug 2010 20:36:00 +0000 (22:36 +0200)]
nvfx: mostly fix inline corruption magically

Not sure why this mostly works.

13 years agonvfx: fix GPU hardlocks when depth buffer is absent
Luca Barbieri [Thu, 19 Aug 2010 10:58:14 +0000 (12:58 +0200)]
nvfx: fix GPU hardlocks when depth buffer is absent

13 years agonvfx: fire ring after transfers
Luca Barbieri [Mon, 16 Aug 2010 23:01:42 +0000 (01:01 +0200)]
nvfx: fire ring after transfers

Might reduce the risk of running out of memory

13 years agonv30: band-aid viewport issues
Luca Barbieri [Mon, 16 Aug 2010 14:55:00 +0000 (16:55 +0200)]
nv30: band-aid viewport issues

For some reason nv30 seems to like to reset the viewport, even though
attempts to isolate where exactly it does that have currently been
inconclusive.

13 years agonvfx: support flatshade_first
Luca Barbieri [Sun, 15 Aug 2010 08:15:40 +0000 (10:15 +0200)]
nvfx: support flatshade_first

13 years agonvfx: expose GLSL
Luca Barbieri [Sat, 13 Mar 2010 01:28:59 +0000 (02:28 +0100)]
nvfx: expose GLSL

Still no control flow support, but basic stuff works.

13 years agonvfx: support proper shader linkage - adds glsl support
Luca Barbieri [Tue, 10 Aug 2010 21:09:53 +0000 (23:09 +0200)]
nvfx: support proper shader linkage - adds glsl support

13 years agonvfx: rewrite draw code and buffer code
Luca Barbieri [Sat, 7 Aug 2010 03:39:18 +0000 (05:39 +0200)]
nvfx: rewrite draw code and buffer code

This is a full rewrite of the drawing and buffer management logic.

It offers a lot of improvements:
1. A copy of buffers is now always kept in system memory. This is
   necessary to allow software processing of them, which is necessary
   or improves performance in many cases.
2. Support for pushing vertices on the FIFO, with index lookup if necessary.
3. "Smart" draw code that tries to intelligently choose the cheapest
  way to draw something: whether to use inline vertices or hardware
  vertex buffer, and whether to use hardware index buffers
4. Support for all vertex formats supported by the hardware
5. Usage of translate to push vertices, supporting all formats that are
   sensible to use as vertex formats
6. Support for base vertex
7. Usage of Ben Skeggs' primitive splitter originally for nv50, allowing
   correct splitting of line loops, triangle fans, etc.
8. Support for instancing
9. Precomputation using the vertex elements CSO

Thanks to Ben Skeggs for his primitive splitter originally for nv50.

Thanks to Christoph Bumiller for his nv50 push code, that was the basis
of this work, even though I changed his code dramatically, in particular
to replace his ad-hoc vertex data emitter with translate.

The changes could also go into nv50 too, but there are substantial
differences due to the additional nv50 hardware features.

13 years agonvfx: refactor sampling code, add support for swizzles and depth tex
Luca Barbieri [Sat, 7 Aug 2010 01:47:25 +0000 (03:47 +0200)]
nvfx: refactor sampling code, add support for swizzles and depth tex

This is a significant refactoring of the sampling code that:
- Moves all generic functions in nvfx_fragtex.c
- Adds a driver-specific sampler view structure and uses it to
  precompute texture setup as it should be done
- Unifies a bit more of code between nv30 and nv40
- Adds support for sampler view swizzles
- Support for specifying as sampler view format different from the
  resource one (only trivially)
- Support for sampler view specification of first and last level
- Support for depth textures on nv30, both for reading depth and
  for compare
- Support for sRGB textures
- Unifies the format table between nv30 and nv40
- Expands the format table to include essentially all supportable formats
  except mixed sign and "autonormal" formats
- Fixes the "is format supported" logic, which was quite broken, and
  makes it use the format table

Only tested on nv30 currently.

13 years agonvfx: new 2D: unify textures and buffers
Luca Barbieri [Tue, 3 Aug 2010 20:49:19 +0000 (22:49 +0200)]
nvfx: new 2D: unify textures and buffers

Stop using the vtbl, and use real transfers for buffers too.

13 years agonvfx: new 2D: use a CPU copy for up to 4 pixels, up from 0
Luca Barbieri [Sun, 18 Apr 2010 14:43:19 +0000 (16:43 +0200)]
nvfx: new 2D: use a CPU copy for up to 4 pixels, up from 0

Seems a reasonable threshold for now.

Significantly speeds up Piglit's 1x1 glReadPixels (but, you know,
reading pixels in 1x1 blocks is NOT a good idea, especially if you
might be running on a less-than-perfect driver).

13 years agonvfx: new 2D: new render temporaries with resources
Luca Barbieri [Tue, 3 Aug 2010 03:47:41 +0000 (05:47 +0200)]
nvfx: new 2D: new render temporaries with resources

This patch adds support for creating temporary surfaces to allow
rendering to surfaces that cannot be rendered to.
It uses the _second_ version of the render temporary infrastructure.

This is necessary for swizzled 3D textures and small mipmaps of
swizzled 2D textures.

This version of the patch creates a resource to use as a temporary
instead of a raw BO, making the code simpler.