mesa.git
11 years agoi965: Make INTEL_DEBUG=shader_time use the RAW surface format.
Kenneth Graunke [Thu, 7 Feb 2013 07:26:36 +0000 (23:26 -0800)]
i965: Make INTEL_DEBUG=shader_time use the RAW surface format.

Untyped Atomic Operation messages are illegal for non-RAW formats.  The
IVB hardware proceeds happily (after all, who cares what the format of the
surface is if you're doing untyped ops on it?), but later hardware
apparently doesn't.  The simulator for gen7 does complain, though.

v2: Rebase against updates to previous patches. (by anholt)

NOTE: This is a candidate for the 9.1 branch.
Reviewed-by: Eric Anholt <eric@anholt.net>
Reviewed-by: Paul Berry <stereotype441@gmail.com>
Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>
11 years agoi965: Specialize SURFACE_STATE creation for shader time.
Kenneth Graunke [Thu, 7 Feb 2013 07:26:35 +0000 (23:26 -0800)]
i965: Specialize SURFACE_STATE creation for shader time.

This is basically a copy and paste of gen7_create_constant_surface, but
with the parameters filled in to offer a simpler interface.

It will diverge shortly.

I didn't bother adding it to the vtable for now since shader time is only
exposed on Gen7+.

v2: Replace tabs in the new code (by anholt)
    Add back dropped memset() and add a comment about HSW channel selects.

NOTE: This is a candidate for the 9.1 branch.
Reviewed-by: Eric Anholt <eric@anholt.net>
Reviewed-by: Paul Berry <stereotype441@gmail.com>
Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>
11 years agoi965: Fix INTEL_DEBUG=shader_time for Haswell.
Kenneth Graunke [Thu, 7 Feb 2013 07:26:34 +0000 (23:26 -0800)]
i965: Fix INTEL_DEBUG=shader_time for Haswell.

Haswell's "Data Cache" data port is a single unit, but split into two
SFIDs to allow for more message types without adding more bits in the
message descriptor.

Untyped Atomic Operations are now message 0010 in the second data cache
data port, rather than 6 in the first.

v2: Use the #defines from the previous commit. (by anholt)

NOTE: This is a candidate for the 9.1 branch.
Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>
Reviewed-by: Eric Anholt <eric@anholt.net> (v1)
11 years agoi965: Add definitions for gen7+ data cache messages.
Eric Anholt [Mon, 11 Mar 2013 21:56:38 +0000 (14:56 -0700)]
i965: Add definitions for gen7+ data cache messages.

We were sparsely using some of these message types, but I'll just fill
them all in now.  It will be used for fixing shader_time on HSW.

v2: Add missing MEDIA_BLOCK_READ.

NOTE: This is a candidate for the 9.1 branch.
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
11 years agoi965: Split shader_time entries into separate cachelines.
Eric Anholt [Mon, 11 Mar 2013 19:59:06 +0000 (12:59 -0700)]
i965: Split shader_time entries into separate cachelines.

This avoids some snooping overhead between EUs processing separate shaders
(so VS versus FS).

Improves performance of a minecraft trace with shader_time by 28.9% +/-
18.3% (n=7), and performance of my old GLSL demo by 93.7% +/- 0.8% (n=4).

v2: Add a define for the stride with a comment explaining its units and
    why.

Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
11 years agoscons: Define _ALLOW_KEYWORD_MACROS on MSVC builds.
José Fonseca [Thu, 14 Mar 2013 17:40:14 +0000 (17:40 +0000)]
scons: Define _ALLOW_KEYWORD_MACROS on MSVC builds.

scons/llvm.py defines inline globally to workaround issues with LLVM C
binding headers, so the only way to is to avoid
aggravating xkeycheck.h errors is to set _ALLOW_KEYWORD_MACROS.

This fixes MSVC 2012 build with LLVM.

Reviewed-by: Brian Paul <brianp@vmware.com>
11 years agosoftpipe: Shrink context size.
José Fonseca [Thu, 14 Mar 2013 11:44:21 +0000 (11:44 +0000)]
softpipe: Shrink context size.

- each softpipe_tex_tile_cache 50*64*64*4*4 = 3,276,800 bytes
- each softpipe_context has 3*32 softpipe_tex_tile_cache, i.e, each softpipe
  context is 314,572,800 bytes, i.e, 300MB

That is, in a 32bits process (around 3GB virtual memory max), we can
only fit 10 contexts.

This change is a short-term hack to shrink the context size.  Longer
term we'll need to change how the texture cache works.

Reviewed-by: Brian Paul <brianp@vmware.com>
11 years agoradeon/llvm: fix LLVM dependencies
Christian König [Thu, 14 Mar 2013 11:37:02 +0000 (12:37 +0100)]
radeon/llvm: fix LLVM dependencies

Since commit 1c4f283151b191c51cbd76d7f304cc1fe7be3019 we obvious depend on this.

Signed-off-by: Christian König <christian.koenig@amd.com>
11 years agomesa: Fix FB blitting in case of zero size src or dst rect
Anuj Phogat [Thu, 7 Mar 2013 22:05:38 +0000 (14:05 -0800)]
mesa: Fix FB blitting in case of zero size src or dst rect

Framebuffer blitting operation should be skipped if any of the
dimensions (width/height) of src/dst rect is zero.

V2: Move the dimension check after error checking in _mesa_BlitFramebuffer.

Fixes: fbblit(negative.nullblit.zeroSize) in Intel oglconform
https://bugs.freedesktop.org/show_bug.cgi?id=59495

Note: Candidate for all the stable branches.

Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com>
Reviewed-by: Paul Berry <stereotype441@gmail.com>
11 years agotgsi: fix sample_d emit for arrays
Roland Scheidegger [Wed, 13 Mar 2013 21:10:18 +0000 (22:10 +0100)]
tgsi: fix sample_d emit for arrays

Those cases were apparently forgotten.

Reviewed-by: Jose Fonseca <jfonseca@vmware.com>
11 years agollvmpipe: don't assert when trying to render to surfaces with multiple layers
Roland Scheidegger [Wed, 13 Mar 2013 20:23:18 +0000 (21:23 +0100)]
llvmpipe: don't assert when trying to render to surfaces with multiple layers

instead just warn when creating the surface, rendering will simply happen
to first layer.

Reviewed-by: Jose Fonseca <jfonseca@vmware.com>
11 years agosoftpipe: don't assert when creating surfaces with multiple layers
Roland Scheidegger [Wed, 13 Mar 2013 20:19:20 +0000 (21:19 +0100)]
softpipe: don't assert when creating surfaces with multiple layers

We can't handle them yet, however we can safely just warn (we will
just render to first layer, which is fine since we can't handle
rendertarget system value neither).
Also make behavior more predictable with buffer surfaces
(it would sometimes hit bogus asserts because of the union in the surface,
instead create the surface but assert when trying to set a buffer
in the framebuffer).

Reviewed-by: Jose Fonseca <jfonseca@vmware.com>
11 years agollvmpipe: Fix geometry shader token leak.
José Fonseca [Wed, 13 Mar 2013 21:21:17 +0000 (21:21 +0000)]
llvmpipe: Fix geometry shader token leak.

Trivial. Matches softpipe's code.

11 years agoradeon/llvm: Add missing license headers
Tom Stellard [Thu, 7 Mar 2013 21:51:14 +0000 (16:51 -0500)]
radeon/llvm: Add missing license headers

Signed-off-by: Tom Stellard <thomas.stellard@amd.com>
11 years agoradeon/llvm: Make radeon_llvm_util.cpp a C file
Tom Stellard [Thu, 7 Mar 2013 21:51:13 +0000 (16:51 -0500)]
radeon/llvm: Make radeon_llvm_util.cpp a C file

All the functions in this file are now implemented in C.

11 years agoradeon/llvm: Optimize radeon_llvm_strip_unused_kernels()
Tom Stellard [Thu, 7 Mar 2013 21:51:12 +0000 (16:51 -0500)]
radeon/llvm: Optimize radeon_llvm_strip_unused_kernels()

Just delete unused kernels rather than marking them as internal and
running the GlobalDCE pass.

Also implement this function in C and inline it into
radeon_llvm_get_kernel_module()

11 years agoradeon/llvm: Implement radeon_llvm_get_kernel_module() using the C API
Tom Stellard [Thu, 7 Mar 2013 21:51:11 +0000 (16:51 -0500)]
radeon/llvm: Implement radeon_llvm_get_kernel_module() using the C API

11 years agoradeon/llvm: Implement radeon_llvm_get_num_kernels() using the C API
Tom Stellard [Thu, 7 Mar 2013 21:51:10 +0000 (16:51 -0500)]
radeon/llvm: Implement radeon_llvm_get_num_kernels() using the C API

11 years agoradeon/llvm: Implement radeon_llvm_parse_bitcode() using C API
Tom Stellard [Thu, 7 Mar 2013 21:51:09 +0000 (16:51 -0500)]
radeon/llvm: Implement radeon_llvm_parse_bitcode() using C API

Also make the function static since it is not used anywhere else.

11 years agor600g/llvm: Move llvm wrapper functions into the radeon directory
Tom Stellard [Thu, 7 Mar 2013 21:51:08 +0000 (16:51 -0500)]
r600g/llvm: Move llvm wrapper functions into the radeon directory

11 years agoProperly check GLX_INDIRECT_RENDERING in glapi/tests/check_table
Jon TURNEY [Wed, 27 Feb 2013 15:32:37 +0000 (15:32 +0000)]
Properly check GLX_INDIRECT_RENDERING in glapi/tests/check_table

Actually use $DEFINES, so we can see if GLX_INDIRECT_RENDERING is defined

If GLX_INDIRECT_RENDERING is defined,  _GLAPI_SKIP_PROTO_ENTRY_POINTS will
be defined, and libglapi won't contain the 'protocol entry points', so we
should provide stubs in check_table.cpp

11 years agoFix glapi/tests/check_table.cpp for standardized OpenGL function names
Jon TURNEY [Wed, 27 Feb 2013 12:58:17 +0000 (12:58 +0000)]
Fix glapi/tests/check_table.cpp for standardized OpenGL function names

It looks like this has been broken since commit
1a1db1746db82efc7f0643508886dfc78a15eb71 "Standardize names of OpenGL
functions."

Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk>
11 years agoFix out-of-tree build of 'make check' in src/mapi/glapi/tests/
Jon TURNEY [Tue, 26 Feb 2013 16:02:13 +0000 (16:02 +0000)]
Fix out-of-tree build of 'make check' in src/mapi/glapi/tests/

Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk>
11 years agoscons: Define PACKAGE_VERSION/BUGREPORT globally.
José Fonseca [Wed, 13 Mar 2013 13:13:08 +0000 (13:13 +0000)]
scons: Define PACKAGE_VERSION/BUGREPORT globally.

Fixes the scons build.

11 years agotests: Add $(top_srcdir)/include to AM_CPPFLAGS.
Vinson Lee [Wed, 13 Mar 2013 05:32:47 +0000 (22:32 -0700)]
tests: Add $(top_srcdir)/include to AM_CPPFLAGS.

Fixes this build error with make check.

  CC     collision.o
In file included from ../../../../../src/mesa/main/hash_table.h:34:0,
                 from collision.c:31:
../../../../../src/mesa/main/compiler.h:51:53: fatal error: c99_compat.h: No such file or directory

Signed-off-by: Vinson Lee <vlee@freedesktop.org>
11 years agoscons: Define PACKAGE_xxx
José Fonseca [Wed, 13 Mar 2013 01:25:30 +0000 (01:25 +0000)]
scons: Define PACKAGE_xxx

Should get the builds going again.

11 years agodocs: rewrite the OSMesa info / instructions
Brian Paul [Tue, 12 Mar 2013 00:31:22 +0000 (18:31 -0600)]
docs: rewrite the OSMesa info / instructions

Reviewed-by: José Fonseca <jfonseca@vmware.com>
11 years agoconfigure: wire-up new OSMesa gallium state tracker and target
Brian Paul [Tue, 12 Mar 2013 00:31:21 +0000 (18:31 -0600)]
configure: wire-up new OSMesa gallium state tracker and target

Reviewed-by: José Fonseca <jfonseca@vmware.com>
11 years agotarget/osmesa: add new Makefile.am
Brian Paul [Tue, 12 Mar 2013 00:31:21 +0000 (18:31 -0600)]
target/osmesa: add new Makefile.am

Reviewed-by: José Fonseca <jfonseca@vmware.com>
11 years agotargets/osmesa: new OSMesa gallium target
Brian Paul [Tue, 12 Mar 2013 00:31:21 +0000 (18:31 -0600)]
targets/osmesa: new OSMesa gallium target

Reviewed-by: José Fonseca <jfonseca@vmware.com>
11 years agost/osmesa: add new Makefile.am
Brian Paul [Tue, 12 Mar 2013 00:31:21 +0000 (18:31 -0600)]
st/osmesa: add new Makefile.am

Reviewed-by: José Fonseca <jfonseca@vmware.com>
11 years agost/osmesa: new OSMesa gallium state tracker
Brian Paul [Tue, 12 Mar 2013 00:31:21 +0000 (18:31 -0600)]
st/osmesa: new OSMesa gallium state tracker

Reviewed-by: José Fonseca <jfonseca@vmware.com>
11 years agost/mesa: add PIPE_FORMAT_R16G16B16A16_UNORM renderbuffer support
Brian Paul [Tue, 12 Mar 2013 00:31:21 +0000 (18:31 -0600)]
st/mesa: add PIPE_FORMAT_R16G16B16A16_UNORM renderbuffer support

To allow rendering in 16-bit/channel RGBA buffers.

Reviewed-by: José Fonseca <jfonseca@vmware.com>
11 years agoscons: Re-add ','
José Fonseca [Wed, 13 Mar 2013 00:31:03 +0000 (00:31 +0000)]
scons: Re-add ','

11 years agoautotools: Add missing top-level include dir.
José Fonseca [Wed, 13 Mar 2013 00:16:24 +0000 (00:16 +0000)]
autotools: Add missing top-level include dir.

Fixes autotools build failure.  Not sure if there are more, as I have
difficulties in building the full tree.

11 years agoconfigure.ac: Alphabetize freedreno makefiles.
Matt Turner [Wed, 13 Mar 2013 00:09:55 +0000 (17:09 -0700)]
configure.ac: Alphabetize freedreno makefiles.

11 years agobuild: Get rid of dead MESA_ASM_FILES variable
Matt Turner [Fri, 22 Feb 2013 00:51:19 +0000 (16:51 -0800)]
build: Get rid of dead MESA_ASM_FILES variable

Reviewed-by: Eric Anholt <eric@anholt.net>
11 years agomesa/build: Get rid of dead ALL_FILES variable
Matt Turner [Fri, 22 Feb 2013 00:51:03 +0000 (16:51 -0800)]
mesa/build: Get rid of dead ALL_FILES variable

Reviewed-by: Eric Anholt <eric@anholt.net>
11 years agoxmlpool/.gitignore: Remove 'Makefile'
Matt Turner [Fri, 22 Feb 2013 01:03:18 +0000 (17:03 -0800)]
xmlpool/.gitignore: Remove 'Makefile'

Handled by top level .gitignore.

Reviewed-by: Eric Anholt <eric@anholt.net>
11 years agomesa: Use PACKAGE_BUGREPORT macro.
Matt Turner [Sat, 9 Mar 2013 08:28:09 +0000 (00:28 -0800)]
mesa: Use PACKAGE_BUGREPORT macro.

Reviewed-by: Eric Anholt <eric@anholt.net>
11 years agomesa: Remove unused version #defines from version.h.
Matt Turner [Sat, 9 Mar 2013 08:23:20 +0000 (00:23 -0800)]
mesa: Remove unused version #defines from version.h.

Reviewed-by: Eric Anholt <eric@anholt.net>
11 years agomesa: Replace MESA_VERSION with PACKAGE_VERSION.
Matt Turner [Sat, 9 Mar 2013 08:25:45 +0000 (00:25 -0800)]
mesa: Replace MESA_VERSION with PACKAGE_VERSION.

One fewer place to have to update.

Reviewed-by: Eric Anholt <eric@anholt.net>
11 years agodraw/so: Fix stream output with geometry shaders
Zack Rusin [Tue, 12 Mar 2013 20:41:35 +0000 (13:41 -0700)]
draw/so: Fix stream output with geometry shaders

If geometry shader is present its stream output info should
be used instead of the vs and we shouldn't use the pre-clipped
corrdinates.

Signed-off-by: Zack Rusin <zackr@vmware.com>
Reviewed-by: José Fonseca <jfonseca@vmware.com>
11 years agoinclude: Fix build with VS 11 (i.e, 2012).
José Fonseca [Tue, 12 Mar 2013 20:37:47 +0000 (20:37 +0000)]
include: Fix build with VS 11 (i.e, 2012).

NOTE: Candidate for the stable branches.

Reviewed-by: Brian Paul <brianp@vmware.com>
11 years agomesa,gallium,egl,mapi: One definition of C99 inline/__func__ to rule them all.
José Fonseca [Tue, 12 Mar 2013 11:17:49 +0000 (11:17 +0000)]
mesa,gallium,egl,mapi: One definition of C99 inline/__func__ to rule them all.

We were in four already...

NOTE: Candidate for the stable branches.

Reviewed-by: Brian Paul <brianp@vmware.com>
11 years agoscons: Allows choosing VS 10 or 11.
José Fonseca [Tue, 12 Mar 2013 20:33:38 +0000 (20:33 +0000)]
scons: Allows choosing VS 10 or 11.

NOTE: Candidate for the stable branches.

Reviewed-by: Brian Paul <brianp@vmware.com>
11 years agoradeonsi: Fix off-by-one for maximum vertex element index in some cases
Michel Dänzer [Tue, 12 Mar 2013 11:34:37 +0000 (12:34 +0100)]
radeonsi: Fix off-by-one for maximum vertex element index in some cases

In cases where the vertex element size is smaller than the vertex buffer
stride, the previous calculation could end up 1 too low. This would result
in the GPU using index 0 instead of the maximum index for those elements,
which would be visible as intermittent distorted triangles.

NOTE: This is a candidate for the 9.1 branch.

Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
11 years agonvc0: avoid crash on updating RASTERIZE_ENABLE state
Christoph Bumiller [Mon, 11 Mar 2013 19:53:25 +0000 (20:53 +0100)]
nvc0: avoid crash on updating RASTERIZE_ENABLE state

When doing a blit with the 3D engine, the rasterizer or zsa cso may
be NULL.

11 years agogallium/tests: check format in compute tests, make selectable
Christoph Bumiller [Fri, 1 Mar 2013 15:45:47 +0000 (16:45 +0100)]
gallium/tests: check format in compute tests, make selectable

11 years agonvc0: add MP trap handler for nve4
Christoph Bumiller [Sat, 9 Mar 2013 16:17:14 +0000 (17:17 +0100)]
nvc0: add MP trap handler for nve4

11 years agonvc0: they removed the NTID,NCTAID,GRIDID registers on nve4
Christoph Bumiller [Sat, 9 Mar 2013 11:11:38 +0000 (12:11 +0100)]
nvc0: they removed the NTID,NCTAID,GRIDID registers on nve4

11 years agonvc0: implement compute support for nve4
Christoph Bumiller [Sat, 23 Feb 2013 18:40:23 +0000 (19:40 +0100)]
nvc0: implement compute support for nve4

11 years agonvc0/ir: try to fix CAS (CompareAndSwap)
Christoph Bumiller [Mon, 11 Mar 2013 16:34:43 +0000 (17:34 +0100)]
nvc0/ir: try to fix CAS (CompareAndSwap)

11 years agonv50/ir: add CCTL (cache control) op
Christoph Bumiller [Mon, 11 Mar 2013 16:34:05 +0000 (17:34 +0100)]
nv50/ir: add CCTL (cache control) op

11 years agonvc0/ir/emit: fix emission of large address offsets
Christoph Bumiller [Mon, 11 Mar 2013 16:32:52 +0000 (17:32 +0100)]
nvc0/ir/emit: fix emission of large address offsets

11 years agonvc0: add SHADER/COMPUTE_RESOURCE bind flags to format table
Christoph Bumiller [Fri, 8 Mar 2013 21:40:30 +0000 (22:40 +0100)]
nvc0: add SHADER/COMPUTE_RESOURCE bind flags to format table

11 years agonouveau: align PIPE_BIND_SHADER,COMPUTE_RESOURCEs to 256 bytes
Christoph Bumiller [Sat, 2 Mar 2013 17:27:56 +0000 (18:27 +0100)]
nouveau: align PIPE_BIND_SHADER,COMPUTE_RESOURCEs to 256 bytes

11 years agonv50,nvc0: copy writable flag on surface creation
Christoph Bumiller [Fri, 1 Mar 2013 20:37:37 +0000 (21:37 +0100)]
nv50,nvc0: copy writable flag on surface creation

11 years agonv50/ir: add support for different sampler and resource index on nve4
Christoph Bumiller [Sat, 2 Mar 2013 20:00:26 +0000 (21:00 +0100)]
nv50/ir: add support for different sampler and resource index on nve4

And remove non-working code for indirect sampler/resource selection.
Will be added back later.

Includes code from "nv50/ir/tgsi: Resource indirect indexing" by
Francisco Jerez (when mixing the R and S handles we can only specify
them via a register, i.e. indirectly, unless we upload all the used
handle combinations to c[] space, which we don't for now).

11 years agonv50/ir: implement splitting of 64 bit ops after RA
Christoph Bumiller [Sat, 2 Mar 2013 13:59:06 +0000 (14:59 +0100)]
nv50/ir: implement splitting of 64 bit ops after RA

11 years agonvc0/ir: skip back edges when determining latest sched value
Christoph Bumiller [Thu, 28 Feb 2013 21:08:36 +0000 (22:08 +0100)]
nvc0/ir: skip back edges when determining latest sched value

11 years agonvc0/ir: use large issue delay after RET, too
Christoph Bumiller [Thu, 28 Feb 2013 18:07:24 +0000 (19:07 +0100)]
nvc0/ir: use large issue delay after RET, too

11 years agonv50/ir: fix size adjustment for sched info for multiple functions
Christoph Bumiller [Thu, 28 Feb 2013 18:00:02 +0000 (19:00 +0100)]
nv50/ir: fix size adjustment for sched info for multiple functions

11 years agonv50/ir: print function inputs and outputs
Christoph Bumiller [Wed, 27 Feb 2013 20:02:29 +0000 (21:02 +0100)]
nv50/ir: print function inputs and outputs

11 years agonv50/ir/ssa: add a few comments regarding RenamePass
Christoph Bumiller [Wed, 27 Feb 2013 14:32:35 +0000 (15:32 +0100)]
nv50/ir/ssa: add a few comments regarding RenamePass

11 years agonv50/ir/tgsi: Exclude local declarations from function prototypes.
Francisco Jerez [Mon, 25 Feb 2013 20:57:32 +0000 (21:57 +0100)]
nv50/ir/tgsi: Exclude local declarations from function prototypes.

11 years agonv50/ir/opt: try to make use of SUCLAMP addend
Christoph Bumiller [Mon, 25 Feb 2013 14:52:10 +0000 (15:52 +0100)]
nv50/ir/opt: try to make use of SUCLAMP addend

11 years agonv50/ir: don't assert on type in Modifier.applyTo if it is 0
Christoph Bumiller [Sun, 24 Feb 2013 17:36:44 +0000 (18:36 +0100)]
nv50/ir: don't assert on type in Modifier.applyTo if it is 0

11 years agonv50/ir: add support for barriers
Christoph Bumiller [Sat, 23 Feb 2013 12:09:32 +0000 (13:09 +0100)]
nv50/ir: add support for barriers

nv50 part by Francisco Jerez.

11 years agonv50/ir/tgsi: add support for atomics
Christoph Bumiller [Wed, 20 Feb 2013 20:33:38 +0000 (21:33 +0100)]
nv50/ir/tgsi: add support for atomics

11 years agonv50/ir/tgsi: handle TGSI_OPCODE_LOAD,STORE
Christoph Bumiller [Fri, 22 Feb 2013 23:39:23 +0000 (00:39 +0100)]
nv50/ir/tgsi: handle TGSI_OPCODE_LOAD,STORE

Squashed and (heavily) modified original patches by Francisco Jerez:
nv50/ir/tgsi: Implement resource LOAD/STORE (wip).
nv50/ir/tgsi: Emit SUST/SULD for surface access, and add CB LOAD/STORE support
nv50/ir/tgsi: Fix/clean up the LOAD/STORE handling code.

Left out for now:
nv50/ir/tgsi: Resource indirect indexing

Treating raw, read-only surfaces as constant buffers (CBs) was removed
because CBs are limited to a size of 64 KiB which isn't desireable, and
because this decision should probably be made by the state tracker.
If we used a number of CB slots for surfaces, it might find that we
cannot accomodate the advertised limit.

11 years agonvc0/ir: don't replace load from input in COMPUTE progs with VFETCH
Christoph Bumiller [Thu, 28 Feb 2013 20:05:45 +0000 (21:05 +0100)]
nvc0/ir: don't replace load from input in COMPUTE progs with VFETCH

11 years agonvc0/ir: implement lowering of surface ops for nve4
Christoph Bumiller [Fri, 22 Feb 2013 23:00:27 +0000 (00:00 +0100)]
nvc0/ir: implement lowering of surface ops for nve4

11 years agonvc0/ir: add formatted surface load lib code, move to extra header
Christoph Bumiller [Tue, 19 Feb 2013 21:12:01 +0000 (22:12 +0100)]
nvc0/ir: add formatted surface load lib code, move to extra header

OpenGL is nice and makes the user specify a format with an image unit.
OpenCL is evil and doesn't, and what's better than adding a huge load
of functions that we call indirectly to handle the conversion ?

11 years agonv50/ir: extend moveSources for delta < 0
Christoph Bumiller [Sun, 17 Feb 2013 11:01:55 +0000 (12:01 +0100)]
nv50/ir: extend moveSources for delta < 0

11 years agonvc0/ir: lower atomics in s[]
Christoph Bumiller [Fri, 22 Feb 2013 19:46:28 +0000 (20:46 +0100)]
nvc0/ir: lower atomics in s[]

11 years agonvc0/ir/emit: implement INSBF, EXTBF, PERMT and ATOM
Christoph Bumiller [Fri, 22 Feb 2013 19:35:32 +0000 (20:35 +0100)]
nvc0/ir/emit: implement INSBF, EXTBF, PERMT and ATOM

11 years agonv50/ir/emit: handle OP_ATOM
Christoph Bumiller [Wed, 20 Feb 2013 19:54:14 +0000 (20:54 +0100)]
nv50/ir/emit: handle OP_ATOM

11 years agonvc0/ir/target: some ops can't be predicated, e.g. CALL
Christoph Bumiller [Fri, 8 Mar 2013 18:08:23 +0000 (19:08 +0100)]
nvc0/ir/target: some ops can't be predicated, e.g. CALL

11 years agonv50/ir/opt: CALLs cannot load
Christoph Bumiller [Tue, 26 Feb 2013 20:05:03 +0000 (21:05 +0100)]
nv50/ir/opt: CALLs cannot load

11 years agonv50/ir: add support for indirect BRA,CALL
Christoph Bumiller [Fri, 22 Feb 2013 19:08:57 +0000 (20:08 +0100)]
nv50/ir: add support for indirect BRA,CALL

11 years agonvc0/ir/emit: implement move to and logic ops on predicates
Christoph Bumiller [Fri, 22 Feb 2013 18:10:20 +0000 (19:10 +0100)]
nvc0/ir/emit: implement move to and logic ops on predicates

11 years agonvc0/ir/emit: implement surface related ops
Christoph Bumiller [Fri, 22 Feb 2013 18:05:16 +0000 (19:05 +0100)]
nvc0/ir/emit: implement surface related ops

11 years agonv50/ir: initialize CodeEmitters' specialized target fields
Christoph Bumiller [Mon, 25 Feb 2013 11:52:43 +0000 (12:52 +0100)]
nv50/ir: initialize CodeEmitters' specialized target fields

11 years agonv50/ir/opt: make optimization aware of atomics, barriers, surface ops
Christoph Bumiller [Wed, 20 Feb 2013 20:03:30 +0000 (21:03 +0100)]
nv50/ir/opt: make optimization aware of atomics, barriers, surface ops

11 years agonv50/ir: add various new OPs that will be needed for compute
Christoph Bumiller [Fri, 22 Feb 2013 17:45:16 +0000 (18:45 +0100)]
nv50/ir: add various new OPs that will be needed for compute

11 years agonv50/ir: Rename "mkLoad" to "mkLoadv" for consistency.
Francisco Jerez [Fri, 18 May 2012 14:17:44 +0000 (16:17 +0200)]
nv50/ir: Rename "mkLoad" to "mkLoadv" for consistency.

11 years agonv50/ir: fix comparison of system values
Christoph Bumiller [Sun, 24 Feb 2013 17:36:21 +0000 (18:36 +0100)]
nv50/ir: fix comparison of system values

11 years agonv50/ir/tgsi: Translate grid-related system parameters.
Francisco Jerez [Tue, 6 Mar 2012 19:18:12 +0000 (20:18 +0100)]
nv50/ir/tgsi: Translate grid-related system parameters.

11 years agonv50/ir/tgsi: Accept COMPUTE programs.
Francisco Jerez [Mon, 14 Nov 2011 23:12:20 +0000 (00:12 +0100)]
nv50/ir/tgsi: Accept COMPUTE programs.

11 years agonv50/ir/ra: make sure all used function inputs get assigned a reg
Christoph Bumiller [Wed, 27 Feb 2013 20:08:57 +0000 (21:08 +0100)]
nv50/ir/ra: make sure all used function inputs get assigned a reg

A live range [0, 0) counts as empty. For function inputs this can
be a problem, so insert a nop at the beginning to make it [0, 1).
This is a bit of a hack but also the most simple solution.

11 years agonv50/ir/ra: also add pre-existing MERGE,SPLIT to constraint list
Christoph Bumiller [Mon, 25 Feb 2013 13:45:52 +0000 (14:45 +0100)]
nv50/ir/ra: also add pre-existing MERGE,SPLIT to constraint list

11 years agonv50/ir/ra: fix confusion with conditional RegisterSet::occupy
Christoph Bumiller [Wed, 6 Feb 2013 16:14:55 +0000 (17:14 +0100)]
nv50/ir/ra: fix confusion with conditional RegisterSet::occupy

11 years agonv50/ir/ra: swap copyCompound args if src is compound and dst isn't
Christoph Bumiller [Thu, 28 Feb 2013 22:41:41 +0000 (23:41 +0100)]
nv50/ir/ra: swap copyCompound args if src is compound and dst isn't

11 years agonv50/ir/ra: Fix maxGPR calculation for programs with multiple functions.
Francisco Jerez [Mon, 30 Apr 2012 13:22:27 +0000 (15:22 +0200)]
nv50/ir/ra: Fix maxGPR calculation for programs with multiple functions.

11 years agonv50/ir/ra: Fix traversal before the beginning of the active list in buildRIG.
Francisco Jerez [Mon, 30 Apr 2012 13:19:40 +0000 (15:19 +0200)]
nv50/ir/ra: Fix traversal before the beginning of the active list in buildRIG.

11 years agonv50/ir/ra: Fix RegisterSet::occupy(const Value *v).
Francisco Jerez [Mon, 30 Apr 2012 13:13:07 +0000 (15:13 +0200)]
nv50/ir/ra: Fix RegisterSet::occupy(const Value *v).

11 years agonv50/ir/ra: Fix argument const-ness in RegisterSet::idToUnits and idToBytes
Francisco Jerez [Mon, 30 Apr 2012 13:12:15 +0000 (15:12 +0200)]
nv50/ir/ra: Fix argument const-ness in RegisterSet::idToUnits and idToBytes

11 years agonv50/ir/opt: Fix tryPropagateBranch for BBs with several exit branches.
Francisco Jerez [Wed, 6 Feb 2013 13:12:44 +0000 (14:12 +0100)]
nv50/ir/opt: Fix tryPropagateBranch for BBs with several exit branches.

Comments and "if (bf->cfg.incidentCount() == 1)" condition added
by Christoph Bumiller.

11 years agonv50/ir: Clean up references to function values before destroying them.
Francisco Jerez [Mon, 30 Apr 2012 13:06:52 +0000 (15:06 +0200)]
nv50/ir: Clean up references to function values before destroying them.