mesa.git
12 years agor600g: Set RADEON_FLUSH_KEEP_TILING_FLAGS when emitting compute cs
Tom Stellard [Mon, 24 Sep 2012 18:34:02 +0000 (18:34 +0000)]
r600g: Set RADEON_FLUSH_KEEP_TILING_FLAGS when emitting compute cs

12 years agobuild: substitute X11_INCLUDES variable
Robert Bragg [Wed, 19 Sep 2012 15:12:08 +0000 (16:12 +0100)]
build: substitute X11_INCLUDES variable

There are a few automake files that reference $(X11_INCLUDES) such as
src/glx/Makefile.am but configure.ac wasn't declaring the variable for
substitution. This would break builds of glx if libxcb, for example, was
installed in its own prefix since AM_CFLAGS wouldn't coincidentally
list the needed include path in that case.

Reviewed-by: Matt Turner <mattst88@gmail.com>
12 years agoUse signbit() in IS_NEGATIVE and DIFFERENT_SIGNS
Matt Turner [Fri, 14 Sep 2012 23:04:40 +0000 (16:04 -0700)]
Use signbit() in IS_NEGATIVE and DIFFERENT_SIGNS

signbit() appears to be available everywhere (even MSVC according to
MSDN), so let's use it instead of open-coding some messy and confusing
bit twiddling macros.

Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=54805
Reviewed-by: Paul Berry <stereotype441@gmail.com>
Suggested-by: Ian Romanick <ian.d.romanick@intel.com>
12 years agoclover: Silence narrowing conversion warnings in resource.cpp.
Francisco Jerez [Mon, 24 Sep 2012 16:33:35 +0000 (18:33 +0200)]
clover: Silence narrowing conversion warnings in resource.cpp.

12 years agoclover: Handle NULL value for clEnqueueNDRangeKernel local_work_size
Tom Stellard [Fri, 21 Sep 2012 20:45:50 +0000 (20:45 +0000)]
clover: Handle NULL value for clEnqueueNDRangeKernel local_work_size

[ Francisco Jerez: Slight simplification. ]

12 years agoi965/blorp: Increase Y alignment for multisampled stencil blits.
Paul Berry [Wed, 12 Sep 2012 18:13:49 +0000 (11:13 -0700)]
i965/blorp: Increase Y alignment for multisampled stencil blits.

This patch is a band-aid fix for a bug in commit 5fd67fa (i965/blorp:
Reduce alignment restrictions for stencil blits), which causes
multisampled stencil blits to work incorrectly on Sandy Bridge.

When blitting to or from a normal stencil buffer, we have to use a
coordinate transformation that swizzles coordinates to account for the
fact that stencil buffers use W tiling, but the most similar tiling
format available for textures and render targets is Y tiling.  The
differences between W and Y tiling cause pixels to be scrambled within
a block of size 8x4 (width x height) as measured relative to a W tile,
or 16x2 as measured relative to a Y tile.  So in order to make sure
that pixels at the edges of the blit aren't lost, we need to align the
rendering rectangle (and the buffer sizes) to multiples of the 8x4
block size.  This alignment happens in the brw_blorp_blit_params
constructor, whereas the determination of how to swizzle the
coordinates happens during code generation, in the
brw_blorp_blit_program class.

When blitting to or from a multisampled stencil buffer, the coordinate
swizzling is more complex, because it has to account for the
interleaving pattern of samples, which uses 4x4 blocks for 4x MSAA and
8x4 blocks for 8x MSAA.  The end result is that if multisampling is in
use, the 16x2 block size (relative so a Y tile) needs to be expanded
to 16x4, and the corresponding size relative to a W tile expands to
8x8.

The problem doesn't affect Ivy Bridge severely enough to crop up in
Piglit tests because on Ivy Bridge we have to disable multisampling
when blitting *to* a multisampled stencil buffer (the blorp compiler
generates code to compensate for the fact that multisampling is
disabled).  However I suspect a bug is still present because we don't
disable multisampling when blitting *from* a multisampled stencil
buffer.

This patch fixes the problem by doubling the vertical alignment
requirement when blitting to or from a multisampled stencil buffer,
and multisampling has not been disabled.

In the long run I would like to rework the brw_blorp_blit_params
constructor--it's difficult to follow and has had several subtle bugs
like this one.  However this band-aid fix should be suitable for
cherry-picking to release branches.

Fixes Piglit tests "unaligned-blit {2,4} stencil {msaa,upsample}" on
Sandy Bridge.

NOTE: This is a candidate for stable release branches.

Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
12 years agoupgrade glext.h to version 85
Brian Paul [Mon, 24 Sep 2012 14:06:56 +0000 (08:06 -0600)]
upgrade glext.h to version 85

NOTE: This is a candidate for the stable branches.

12 years agost/mesa: check for zero-size image in st_TestProxyTexImage()
Brian Paul [Fri, 21 Sep 2012 14:09:01 +0000 (08:09 -0600)]
st/mesa: check for zero-size image in st_TestProxyTexImage()

Fixes divide by zero issue in llvmpipe driver.

Reviewed-by: José Fonseca <jfonseca@vmware.com>
12 years agomesa: Silence narrowing warnings in ff_fragment_shader's emit_texenv().
Kenneth Graunke [Mon, 24 Sep 2012 05:38:58 +0000 (22:38 -0700)]
mesa: Silence narrowing warnings in ff_fragment_shader's emit_texenv().

Recent version of GCC report a warning for the implicit conversion from
int to float:

  ff_fragment_shader.cpp:897:3: warning: narrowing conversion of '(1 << ((int)rgb_shift))' from 'int' to 'float' inside { } is ill-formed in C++11 [-Wnarrowing]

This is because floats cannot precisely represent all possible 32-bit
integer values.  However, texenv code is all expected to be floating
point, so this should not be a problem.

Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>
12 years agodocs: fixup GL4.3 TODO list
Marek Olšák [Sun, 23 Sep 2012 15:14:25 +0000 (17:14 +0200)]
docs: fixup GL4.3 TODO list

From the OpenGL Registry:
  "2012/08/13: specs named GL_ARB_debug_group, GL_ARB_debug_label, and
   GL_ARB_debug_output2 were published in error during the initial OpenGL 4.3
   release. All functionality in these documents was combined into
   the extension GL_KHR_debug. They have been withdrawn from the registry,
   and a few other extensions were renumbered to avoid holes in the numbering
   scheme."

12 years agoradeon/llvm: support for interpolation intrinsics
Vincent Lejeune [Thu, 6 Sep 2012 20:45:38 +0000 (22:45 +0200)]
radeon/llvm: support for interpolation intrinsics

Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
12 years agodraw: fix non-indexed draw calls if there's an index buffer
Marek Olšák [Fri, 14 Sep 2012 15:03:25 +0000 (17:03 +0200)]
draw: fix non-indexed draw calls if there's an index buffer

pipe_draw_info::indexed determines if it should be indexed and not
the presence of an index buffer.

This fixes crashes in r300g.

NOTE: This is a candidate for the stable branches.

Tested-by: Michel Dänzer <michel.daenzer@amd.com>
Reviewed-by: Brian Paul <brianp@vmware.com>
12 years agor600g: Fix build with LLVM compiler
Tom Stellard [Sat, 22 Sep 2012 00:07:14 +0000 (20:07 -0400)]
r600g: Fix build with LLVM compiler

12 years agor600g: set QUANT_MODE on Cayman too
Marek Olšák [Tue, 18 Sep 2012 23:29:17 +0000 (01:29 +0200)]
r600g: set QUANT_MODE on Cayman too

This fixes piglit/fbo-blit-stretched.

Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
12 years agor600g: use CS helpers to emit streamout state
Marek Olšák [Tue, 18 Sep 2012 18:21:11 +0000 (20:21 +0200)]
r600g: use CS helpers to emit streamout state

Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
12 years agor600g: remove initialization of unused loop register tables
Marek Olšák [Tue, 18 Sep 2012 18:10:15 +0000 (20:10 +0200)]
r600g: remove initialization of unused loop register tables

Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
12 years agor600g: remove now-unused SURFACE_BASE_UPDATE logic
Marek Olšák [Tue, 18 Sep 2012 17:49:41 +0000 (19:49 +0200)]
r600g: remove now-unused SURFACE_BASE_UPDATE logic

Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
12 years agor600g: remove unused CB registers from register lists
Marek Olšák [Tue, 18 Sep 2012 17:46:59 +0000 (19:46 +0200)]
r600g: remove unused CB registers from register lists

Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
12 years agor600g: atomize framebuffer state
Marek Olšák [Tue, 18 Sep 2012 17:42:29 +0000 (19:42 +0200)]
r600g: atomize framebuffer state

Tested on RS880, Evergreen and Cayman.

Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
12 years agor600g: don't snoop context state while building shaders
Marek Olšák [Mon, 17 Sep 2012 21:22:00 +0000 (23:22 +0200)]
r600g: don't snoop context state while building shaders

Let's use the shader key describing the state.

Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
12 years agometa: Add on demand compilation of per target shader programs
Anuj Phogat [Thu, 20 Sep 2012 20:17:19 +0000 (13:17 -0700)]
meta: Add on demand compilation of per target shader programs

A call to glGenerateMipmap() follows the generation of a relevant
shader program in setup_glsl_generate_mipmap().

To support all texture targets and to avoid compiling shaders
everytime, per target shader programs are compiled on demand
and saved for the next call.

Fixes float-texture(mipmap.manual):
See Comment 6: https://bugs.freedesktop.org/show_bug.cgi?id=54296

NOTE: This is a candidate for stable branches.

Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com>
Reviewed-by: Brian Paul <brianp@vmware.com>
12 years agoclover: Initialize height and depth to 1 for transfers
Tom Stellard [Mon, 17 Sep 2012 14:31:31 +0000 (14:31 +0000)]
clover: Initialize height and depth to 1 for transfers

Reviewed-by: Francisco Jerez <currojerez@riseup.net>
12 years agopipe-loader: Remove a few debug_printfs
Tom Stellard [Thu, 13 Sep 2012 14:53:32 +0000 (14:53 +0000)]
pipe-loader: Remove a few debug_printfs

On debug builds these were always being printed.

Reviewed-by: Francisco Jerez <currojerez@riseup.net>
12 years agoradeon/llvm: Handle loads from the constants address space.
Tom Stellard [Thu, 13 Sep 2012 15:21:42 +0000 (15:21 +0000)]
radeon/llvm: Handle loads from the constants address space.

Reading from constant memory is not supported yet, so constant reads use
global memory.

12 years agoradeon/llvm: Add support for v4f32 stores on R600
Tom Stellard [Thu, 13 Sep 2012 15:20:46 +0000 (15:20 +0000)]
radeon/llvm: Add support for v4f32 stores on R600

12 years agoradeon/llvm: Add support for i8 reads on R600
Tom Stellard [Thu, 13 Sep 2012 15:19:48 +0000 (15:19 +0000)]
radeon/llvm: Add support for i8 reads on R600

12 years agoradeon/llvm: Expand vector fadd and fmul on R600
Tom Stellard [Thu, 13 Sep 2012 15:14:26 +0000 (15:14 +0000)]
radeon/llvm: Expand vector fadd and fmul on R600

12 years agoradeon/llvm: Add optimization for FP_ROUND
Tom Stellard [Thu, 13 Sep 2012 15:08:40 +0000 (15:08 +0000)]
radeon/llvm: Add optimization for FP_ROUND

12 years agoradeon/llvm: Replace AMDGPU pow intrinsic with the llvm version
Tom Stellard [Thu, 13 Sep 2012 15:04:15 +0000 (15:04 +0000)]
radeon/llvm: Replace AMDGPU pow intrinsic with the llvm version

12 years agoi965/blorp: Fix narrowing warnings.
Paul Berry [Thu, 13 Sep 2012 03:51:07 +0000 (20:51 -0700)]
i965/blorp: Fix narrowing warnings.

Blorp has to convert rectangle coordinates from integers to floats in
order to send them down the GPU pipeline.  Recent versions of GCC
issue a warning for this, since a float is not capable of precisely
representing all possible 32-bit integer values.  Suppress the warning
with an explicit type cast in the case of blorp, since rectangle
coordinates will never be large enough to cause a loss of precision.

Reviewed-by: Eric Anholt <eric@anholt.net>
12 years agoi965: Remove brw_set_predicate_inverse(p, true) from scratch offset code
Kenneth Graunke [Thu, 20 Sep 2012 23:31:15 +0000 (16:31 -0700)]
i965: Remove brw_set_predicate_inverse(p, true) from scratch offset code

Given that it exists between a push/pop of instruction state, this call
can only affect the MOV or ADD instruction generated just below it.
Neither of those instructions are predicated, so it makes no sense to
ask for the inverse predicate.

This fixes grumblings from the simulator debugger, which was
complaining about an invalid predicate.

Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>
Reviewed-by: Paul Berry <stereotype441@gmail.com>
Reviewed-by: Eric Anholt <eric@anholt.net>
12 years agomesa: Don't override S3TC internalFormat if data is pre-compressed.
Kenneth Graunke [Wed, 19 Sep 2012 19:01:14 +0000 (12:01 -0700)]
mesa: Don't override S3TC internalFormat if data is pre-compressed.

Commit 42723d88d intended to override an S3TC internalFormat to a
generic compressed format when the application requested online
compression of uncompressed data.  Unfortunately, it also broke
pre-compressed textures when libtxc_dxtn isn't installed but the
extensions are forced on.

Both glCompressedTexImage2D() and glTexImage2D() call teximage(), which
calls _mesa_choose_texture_format(), hitting this override code.  If we
have actual S3TC source data, we can't treat it as any other format, and
need to avoid the override.

Since glCompressedTexImage2D() passes in a format of GL_NONE (which is
illegal for glTexImage), we can use that to detect the pre-compressed
case and avoid the overrides.

Fixes a regression since 42723d88d370a7599398cc1c2349aeb951ba1c57.

NOTE: This is a candidate for the 9.0 branch.

Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>
Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>
Reviewed-and-tested-by: Jordan Justen <jordan.l.justen@intel.com>
12 years agoi965/blorp: Add support for blits between SRGB and linear formats.
Kenneth Graunke [Tue, 11 Sep 2012 23:20:43 +0000 (16:20 -0700)]
i965/blorp: Add support for blits between SRGB and linear formats.

Fixes colorspace issues in L4D2 when multisampling is enabled (the
scene was far too dark, but the flashlight area was way too bright).

The nVidia and AMD binary drivers both allow this kind of blit.

NOTE: This is a candidate for the 9.0 branch.

Reviewed-by: Paul Berry <stereotype441@gmail.com>
Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>
12 years agomesa: Ignore SRGB when determining compatible resolve formats.
Kenneth Graunke [Tue, 4 Sep 2012 18:29:30 +0000 (11:29 -0700)]
mesa: Ignore SRGB when determining compatible resolve formats.

MSAA resolves and other blit-like operations ignore SRGB state anyway,
so we should be able to safely allow resolves between compatible
SRGB/linear formats like SRGBA8 and RGBA8888.

This matches the behavior of the nVidia and AMD binary drivers.

Fixes completely black rendering when using multisampling in L4D2.

NOTE: This is a candidate for the 9.0 branch.

Reviewed-by: Paul Berry <stereotype441@gmail.com>
Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>
12 years agodocs: update some more FAQs
Andreas Boll [Thu, 20 Sep 2012 14:23:15 +0000 (16:23 +0200)]
docs: update some more FAQs

v2: remove mention of XFree86

Reviewed-by: Brian Paul <brianp@vmware.com>
12 years agodocs: remove utility.html
Andreas Boll [Thu, 20 Sep 2012 14:01:37 +0000 (16:01 +0200)]
docs: remove utility.html

This page is very old and some of the links are dead.

Reviewed-by: Brian Paul <brianp@vmware.com>
12 years agodocs: remove science.html
Andreas Boll [Thu, 20 Sep 2012 14:01:35 +0000 (16:01 +0200)]
docs: remove science.html

This page is very old and some of the links are dead.

Reviewed-by: Brian Paul <brianp@vmware.com>
12 years agodocs: remove modelers.html
Andreas Boll [Thu, 20 Sep 2012 14:01:32 +0000 (16:01 +0200)]
docs: remove modelers.html

This page is very old and some of the links are dead.

Reviewed-by: Brian Paul <brianp@vmware.com>
12 years agodocs: remove libraries.html
Andreas Boll [Thu, 20 Sep 2012 14:01:24 +0000 (16:01 +0200)]
docs: remove libraries.html

This page is very old and some of the links are dead.

Reviewed-by: Brian Paul <brianp@vmware.com>
12 years agodocs: remove games.html
Andreas Boll [Thu, 20 Sep 2012 14:01:21 +0000 (16:01 +0200)]
docs: remove games.html

This page is very old and some of the links are dead.

Reviewed-by: Brian Paul <brianp@vmware.com>
12 years agodocs/contents: add autoconf.html link
Andreas Boll [Thu, 20 Sep 2012 14:01:18 +0000 (16:01 +0200)]
docs/contents: add autoconf.html link

make it easier to find the docs/autoconf.html site

Reviewed-by: Brian Paul <brianp@vmware.com>
12 years agodocs: convert last traces of progs to mesa/demos repository
Andreas Boll [Thu, 20 Sep 2012 14:01:15 +0000 (16:01 +0200)]
docs: convert last traces of progs to mesa/demos repository

v2: fix typo

Reviewed-by: Brian Paul <brianp@vmware.com>
12 years agodocs: add IRC info
Andreas Boll [Thu, 20 Sep 2012 14:01:12 +0000 (16:01 +0200)]
docs: add IRC info

Reviewed-by: Brian Paul <brianp@vmware.com>
12 years agodocs/egl: improve markup
Andreas Boll [Thu, 20 Sep 2012 14:01:08 +0000 (16:01 +0200)]
docs/egl: improve markup

replace unordered list <ul> with defined list <dl>

Reviewed-by: Brian Paul <brianp@vmware.com>
12 years agodocs/autoconf: improve markup
Andreas Boll [Thu, 20 Sep 2012 14:01:03 +0000 (16:01 +0200)]
docs/autoconf: improve markup

replace unordered list <ul> with defined list <dl>

Reviewed-by: Brian Paul <brianp@vmware.com>
12 years agodocs/autoconf: remove obsolete demo options
Andreas Boll [Thu, 20 Sep 2012 14:00:52 +0000 (16:00 +0200)]
docs/autoconf: remove obsolete demo options

removed with commit 56c3cce2a199f7f79a48d7633431e1e80fcd4ba2
two years ago

Reviewed-by: Brian Paul <brianp@vmware.com>
12 years agodocs: improve quality of gears.png
Andreas Boll [Thu, 20 Sep 2012 13:22:37 +0000 (15:22 +0200)]
docs: improve quality of gears.png

Reviewed-by: Brian Paul <brianp@vmware.com>
12 years agogallium: mention PIPE_TIMEOUT_INFINITE in the fence_finish() comment
Brian Paul [Wed, 19 Sep 2012 18:43:38 +0000 (12:43 -0600)]
gallium: mention PIPE_TIMEOUT_INFINITE in the fence_finish() comment

12 years agollvmpipe: fix overflow bug in total texture size computation
Brian Paul [Thu, 20 Sep 2012 15:13:37 +0000 (09:13 -0600)]
llvmpipe: fix overflow bug in total texture size computation

v2: use uint64_t for the total_size variable, per Jose.

Also add two earlier checks for exceeding the max texture size.
For example a 1K^3 RGBA volume would overflow the lpr->image_stride
variable.

Use simple algebra to avoid overflow in intermediate values.
So instead of "x * y > z" use "x > z / y".

This should work if we happen to be on a platform that doesn't have
64-bit types.

Reviewed-by: Jose Fonseca <jfonseca@vmware.com>
12 years agor600g/llvm: rs780/rs880 are r600 asics
Alex Deucher [Thu, 20 Sep 2012 15:16:36 +0000 (11:16 -0400)]
r600g/llvm: rs780/rs880 are r600 asics

Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
12 years agomesa: Allow glGetTexParameter of GL_TEXTURE_SRGB_DECODE_EXT
Ian Romanick [Tue, 18 Sep 2012 13:19:18 +0000 (15:19 +0200)]
mesa: Allow glGetTexParameter of GL_TEXTURE_SRGB_DECODE_EXT

This was already (correctly) supported for glGetSamplerParameter paths.

NOTE: This is a candidate for stable branches.

Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>
Reviewed-by: Brian Paul <brianp@vmware.com>
12 years agor300/compiler: Use precomputed q values in the register allocator
Tom Stellard [Thu, 6 Sep 2012 04:20:27 +0000 (00:20 -0400)]
r300/compiler: Use precomputed q values in the register allocator

12 years agor300g: Init regalloc state during context creation
Tom Stellard [Thu, 6 Sep 2012 04:20:27 +0000 (00:20 -0400)]
r300g: Init regalloc state during context creation

Initializing the regalloc state is expensive, and since it is always
the same for every compile we only need to initialize it once per
context.  This should help improve shader compile times for the driver.

12 years agor300/compiler: Don't create register classes for inputs
Tom Stellard [Mon, 3 Sep 2012 12:25:13 +0000 (08:25 -0400)]
r300/compiler: Don't create register classes for inputs

12 years agora: Add q_values parameter to ra_set_finalize()
Tom Stellard [Mon, 3 Sep 2012 14:43:45 +0000 (10:43 -0400)]
ra: Add q_values parameter to ra_set_finalize()

This allows the user to pass precomputed q values to the allocator.

Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
12 years agora: Clarify usage of ra_set_node_reg()
Tom Stellard [Mon, 3 Sep 2012 12:23:02 +0000 (08:23 -0400)]
ra: Clarify usage of ra_set_node_reg()

Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
12 years agor600g: Invalidate texture cache when creating vertex buffers for compute v2
Tom Stellard [Wed, 1 Aug 2012 20:42:53 +0000 (20:42 +0000)]
r600g: Invalidate texture cache when creating vertex buffers for compute v2

Compute shaders fetch data from vertex buffers via the texture cache, so
we need to make sure the texture cache is flushed.

v2:
  - Fix rebase mistake
  - Fix spelling in comment

Reviewed-by: Marek Olšák <maraeo@gmail.com>
12 years agor600g: Use LOOP_START_DX10 for loops
Tom Stellard [Mon, 17 Sep 2012 14:33:56 +0000 (14:33 +0000)]
r600g: Use LOOP_START_DX10 for loops

LOOP_START_DX10 ignores the LOOP_CONFIG* registers, so it is not limited
to 4096 iterations like the other LOOP_* instructions.  Compute shaders
need to use this instruction, and since we aren't optimizing loops with
the LOOP_CONFIG* registers for pixel and vertex shaders, it seems like
we should just use it for everything.

Reviewed-by: Marek Olšák <maraeo@gmail.com>
12 years agor600g: Set the correct value of COLOR*_DIM for RATs
Tom Stellard [Thu, 13 Sep 2012 17:15:57 +0000 (17:15 +0000)]
r600g: Set the correct value of COLOR*_DIM for RATs

For buffers (which is what is being used for RATs), the
COLOR*_DIM.WIDTH_MASK field needs to be set to the low 16-bits of the
buffer size, and the COLOR*_DIM.HEIEGHT_MAX needs to be set to the
high bits.

Reviewed-by: Marek Olšák <maraeo@gmail.com>
12 years agor600g: Make sure to initialize DB_DEPTH_CONTROL register for compute
Tom Stellard [Thu, 13 Sep 2012 17:14:56 +0000 (17:14 +0000)]
r600g: Make sure to initialize DB_DEPTH_CONTROL register for compute

The kernel CS checker will fail if this register is not initialized.

Reviewed-by: Marek Olšák <maraeo@gmail.com>
12 years agor600g: Add some comments and debug printfs to compute code
Tom Stellard [Thu, 13 Sep 2012 14:37:53 +0000 (14:37 +0000)]
r600g: Add some comments and debug printfs to compute code

Reviewed-by: Marek Olšák <maraeo@gmail.com>
12 years agor600g: Add missing break to case statement
Tom Stellard [Wed, 19 Sep 2012 19:27:32 +0000 (15:27 -0400)]
r600g: Add missing break to case statement

12 years agoradeon/llvm: Emit ISA for ALU instructions in the R600 code emitter
Michal Sciubidlo [Wed, 12 Sep 2012 06:57:01 +0000 (08:57 +0200)]
radeon/llvm: Emit ISA for ALU instructions in the R600 code emitter

Signed-off-by: Tom Stellard <thomas.stellard@amd.com>
12 years agoradeon/llvm: Only support 512 constant registers on R600
Tom Stellard [Wed, 19 Sep 2012 16:23:20 +0000 (12:23 -0400)]
radeon/llvm: Only support 512 constant registers on R600

This is necessary upcoming encoding changes, since we will only be
using 9-bits for register encoding.

12 years agodocs: update faq
Andreas Boll [Wed, 19 Sep 2012 16:22:37 +0000 (18:22 +0200)]
docs: update faq

12 years agodocs: update sourcetree
Andreas Boll [Wed, 19 Sep 2012 16:22:31 +0000 (18:22 +0200)]
docs: update sourcetree

- add OpenCL state tracker Clover

- add XvMC state tracker

- remove progs
  directory got moved into its own repository mesa/demos

- remove vf
  directory removed with abda64efce73c18d49c74

12 years agodocs: remove obsolete r300c traces
Andreas Boll [Wed, 19 Sep 2012 16:22:19 +0000 (18:22 +0200)]
docs: remove obsolete r300c traces

12 years agoRevert "mesa: consolidate subtexture x/y/width/height error checking code"
Brian Paul [Wed, 19 Sep 2012 16:07:45 +0000 (10:07 -0600)]
Revert "mesa: consolidate subtexture x/y/width/height error checking code"

This reverts commit 5b807400a87d5efefc481017eb420b772933e1da.

accidentally pushed.

12 years agoRevert "more comment"
Brian Paul [Wed, 19 Sep 2012 16:07:34 +0000 (10:07 -0600)]
Revert "more comment"

This reverts commit 5205db6a7ce623a7fca72e6dc6391bd12be3f6aa.

accidentally pushed

12 years agoRevert "mesa: clean-up and fix glCompressedTexSubImage error checking"
Brian Paul [Wed, 19 Sep 2012 16:07:22 +0000 (10:07 -0600)]
Revert "mesa: clean-up and fix glCompressedTexSubImage error checking"

This reverts commit 0c67fe5d2dc6d8066fc23c39184d9614abf63992.

accidentally pushed.

12 years agodocs: fix "Cppyright" typo
Brian Paul [Wed, 19 Sep 2012 16:01:04 +0000 (10:01 -0600)]
docs: fix "Cppyright" typo

12 years agomesa: clean-up and fix glCompressedTexSubImage error checking
Brian Paul [Tue, 18 Sep 2012 21:51:33 +0000 (15:51 -0600)]
mesa: clean-up and fix glCompressedTexSubImage error checking

12 years agomore comment
Brian Paul [Tue, 18 Sep 2012 21:42:06 +0000 (15:42 -0600)]
more comment

12 years agomesa: consolidate subtexture x/y/width/height error checking code
Brian Paul [Tue, 18 Sep 2012 21:22:41 +0000 (15:22 -0600)]
mesa: consolidate subtexture x/y/width/height error checking code

This is the code that checks if a subtexure region is aligned to the
compressed format's block size.

12 years agodocs: remove obsolete target attribute
Andreas Boll [Tue, 18 Sep 2012 17:31:28 +0000 (19:31 +0200)]
docs: remove obsolete target attribute

12 years agodocs: news.html is the new index.html
Andreas Boll [Tue, 18 Sep 2012 16:59:33 +0000 (18:59 +0200)]
docs: news.html is the new index.html

12 years agodocs: remove obsolete frame layout
Andreas Boll [Tue, 18 Sep 2012 16:57:54 +0000 (18:57 +0200)]
docs: remove obsolete frame layout

12 years agodocs: add new iframe layout
Andreas Boll [Tue, 18 Sep 2012 16:57:02 +0000 (18:57 +0200)]
docs: add new iframe layout

12 years agodocs/news: linkify some active links
Andreas Boll [Wed, 19 Sep 2012 15:15:45 +0000 (17:15 +0200)]
docs/news: linkify some active links

12 years agodocs/news: deactivate dead links
Andreas Boll [Wed, 19 Sep 2012 15:15:39 +0000 (17:15 +0200)]
docs/news: deactivate dead links

I have left the links as <code> elements for the purpose of
documentation.

12 years agodocs/news: drop redundant link
Andreas Boll [Wed, 19 Sep 2012 15:15:34 +0000 (17:15 +0200)]
docs/news: drop redundant link

12 years agodocs/news: update link
Andreas Boll [Wed, 19 Sep 2012 15:15:31 +0000 (17:15 +0200)]
docs/news: update link

12 years agodocs/news: remove link to a non-existent page
Andreas Boll [Wed, 19 Sep 2012 15:15:24 +0000 (17:15 +0200)]
docs/news: remove link to a non-existent page

12 years agodocs: fix some issues in relnotes
Andreas Boll [Sat, 1 Sep 2012 09:18:19 +0000 (11:18 +0200)]
docs: fix some issues in relnotes

improve markup
fix link to relnotes-9.0
add missing relnotes links

12 years agodocs/devinfo: fix typo
Andreas Boll [Wed, 19 Sep 2012 10:10:32 +0000 (12:10 +0200)]
docs/devinfo: fix typo

12 years agowinsys/radeon: fix relocs caching
Vadim Girlin [Wed, 19 Sep 2012 00:48:16 +0000 (04:48 +0400)]
winsys/radeon: fix relocs caching

Don't cache pointers to elements of reallocatable array.
In some circumstances it caused false cache hits resulting in incorrect
command stream and gpu lockup.

Note: This is a candidate for the stable branches.

Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>
Reviewed-by: Marek Olšák <maraeo@gmail.com>
12 years agoradeon/llvm: Add a fdiv pattern.
Vincent Lejeune [Mon, 17 Sep 2012 20:20:18 +0000 (22:20 +0200)]
radeon/llvm: Add a fdiv pattern.

Reviewed-by: Tom Stellard <thomas.stellard at amd.com>
12 years agoradeon/llvm: reserve also corresponding 128bits reg
Vincent Lejeune [Tue, 11 Sep 2012 15:56:39 +0000 (17:56 +0200)]
radeon/llvm: reserve also corresponding 128bits reg

Reviewed-by: Tom Stellard <thomas.stellard at amd.com>
12 years agodocs: drop obsolete sourceforge link
Andreas Boll [Tue, 18 Sep 2012 12:39:17 +0000 (14:39 +0200)]
docs: drop obsolete sourceforge link

Signed-off-by: Brian Paul <brianp@vmware.com>
12 years agosoftpipe: implement the new can_create_resource() function
Brian Paul [Mon, 17 Sep 2012 01:44:07 +0000 (19:44 -0600)]
softpipe: implement the new can_create_resource() function

And define a SP_MAX_TEXTURE_SIZE value as we do in llvmpipe.

Reviewed-by: Jose Fonseca <jfonseca@vmware.com>
12 years agollvmpipe: implement the new can_create_resource() function
Brian Paul [Mon, 17 Sep 2012 01:43:50 +0000 (19:43 -0600)]
llvmpipe: implement the new can_create_resource() function

Reviewed-by: Jose Fonseca <jfonseca@vmware.com>
12 years agost/mesa: implement new proxy texture code
Brian Paul [Mon, 17 Sep 2012 01:42:15 +0000 (19:42 -0600)]
st/mesa: implement new proxy texture code

If the gallium driver implements the can_create_resource() function, call
it to do proxy texture size checks.

Reviewed-by: Jose Fonseca <jfonseca@vmware.com>
12 years agogallium: add new pipe_screen::can_create_resource() function
Brian Paul [Mon, 17 Sep 2012 01:40:13 +0000 (19:40 -0600)]
gallium: add new pipe_screen::can_create_resource() function

Used to implement proxy textures.  If a gallium driver doesn't implement
this function we'll just continue to use the core Mesa fallback code.

Without this hook we really have no good way to implement OpenGL proxy
textures with gallium drivers.

Reviewed-by: Jose Fonseca <jfonseca@vmware.com>
12 years agomesa: take cube faces into account in _mesa_test_proxy_teximage()
Brian Paul [Mon, 17 Sep 2012 01:15:28 +0000 (19:15 -0600)]
mesa: take cube faces into account in _mesa_test_proxy_teximage()

There will always be six cube faces so take that into consideration when
computing the texture size and comparing against the limit.

12 years agomesa: handle GL_PROXY_TEXTURE_CUBE_MAP in _mesa_num_tex_faces()
Brian Paul [Mon, 17 Sep 2012 01:14:56 +0000 (19:14 -0600)]
mesa: handle GL_PROXY_TEXTURE_CUBE_MAP in _mesa_num_tex_faces()

12 years agollvmpipe: set max cube texture size to 4K x 4K
Brian Paul [Mon, 17 Sep 2012 01:05:51 +0000 (19:05 -0600)]
llvmpipe: set max cube texture size to 4K x 4K

Before, the limit was 8K.  For 32-bit RGBA that would be require 1.5 GB
of memory (w/out mipmaps).  That's well beyond the LP_MAX_TEXTURE_SIZE
of 1GB.

Reviewed-by: Jose Fonseca <jfonseca@vmware.com>
12 years agomesa: move/fix levels check for glTexStorage()
Brian Paul [Sat, 15 Sep 2012 16:30:20 +0000 (10:30 -0600)]
mesa: move/fix levels check for glTexStorage()

Fix copy&paste error and move min levels check closer to max levels check.

Reviewed-by: Jose Fonseca <jfonseca@vmware.com>
12 years agomesa: rewrite glTexStorage() code
Brian Paul [Sat, 15 Sep 2012 16:30:20 +0000 (10:30 -0600)]
mesa: rewrite glTexStorage() code

Simplify the code and make it more like the other glTexImage commands.
Call _mesa_legal_texture_dimensions() to validate width, height, depth.
Call ctx->Driver.TestProxyTexImage() to make sure texture is not too large.

Reviewed-by: Jose Fonseca <jfonseca@vmware.com>
12 years agomesa: rework texture size error checking
Brian Paul [Sat, 15 Sep 2012 16:30:20 +0000 (10:30 -0600)]
mesa: rework texture size error checking

There are two aspects to texture image size checking:
1. Are the width, height, depth legal values (not negative, not larger
   than the max size for the mipmap level, etc)?
2. Is the texture just too large to handle?  For example, we might not be
   able to really allocate memory for a 3D texture of maxSize x maxSize x
   maxSize.

Previously, we did (1) via the ctx->Driver.TestProxyTextureImage() hook
but those tests are really device-independent.  Now we do (2) via that
hook since the max texture memory and texture shape are device-dependent.

Also, (1) is now done outside the general texture parameter error checking
functions because of the special interaction with proxy textures.  The
recently introduced PROXY_ERROR token is removed.

The teximage() and copyteximage() functions are bit simpler now (less
if-then nesting, etc.)

Reviewed-by: Jose Fonseca <jfonseca@vmware.com>
12 years agomesa: refactor _mesa_test_proxy_teximage() code
Brian Paul [Sat, 15 Sep 2012 16:30:20 +0000 (10:30 -0600)]
mesa: refactor _mesa_test_proxy_teximage() code

Basically, move the body into a new _mesa_legal_texture_dimensions() function.
More refactoring to come.

Reviewed-by: Jose Fonseca <jfonseca@vmware.com>