i965: Use 64-byte offset alignment for shader storage buffers

author Iago Toral Quiroga <itoral@igalia.com>

Thu, 19 Mar 2015 10:27:21 +0000 (11:27 +0100)

committer Samuel Iglesias Gonsalvez <siglesias@igalia.com>

Fri, 25 Sep 2015 06:39:20 +0000 (08:39 +0200)
author Iago Toral Quiroga <itoral@igalia.com>
Thu, 19 Mar 2015 10:27:21 +0000 (11:27 +0100)
committer Samuel Iglesias Gonsalvez <siglesias@igalia.com>
Fri, 25 Sep 2015 06:39:20 +0000 (08:39 +0200)
diff --git a/src/mesa/drivers/dri/i965/brw_context.c b/src/mesa/drivers/dri/i965/brw_context.c

index 7c1c13300dc6d5811ace186a05b3174b71978ae4..0cfc8435964d07261cb1c09246d0dec375327f5a 100644 (file)
--- a/src/mesa/drivers/dri/i965/brw_context.c
+++ b/src/mesa/drivers/dri/i965/brw_context.c
@@ -567,6 +567,15 @@ brw_initialize_context_constants(struct brw_context *brw)
      * However, unaligned accesses are slower, so enforce buffer alignment.
      */
     ctx->Const.UniformBufferOffsetAlignment = 16;
      * However, unaligned accesses are slower, so enforce buffer alignment.
      */
     ctx->Const.UniformBufferOffsetAlignment = 16;
+
+   /* ShaderStorageBufferOffsetAlignment should be a cacheline (64 bytes) so
+    * that we can safely have the CPU and GPU writing the same SSBO on
+    * non-cachecoherent systems (our Atom CPUs). With UBOs, the GPU never
+    * writes, so there's no problem. For an SSBO, the GPU and the CPU can
+    * be updating disjoint regions of the buffer simultaneously and that will
+    * break if the regions overlap the same cacheline.
+    */
+   ctx->Const.ShaderStorageBufferOffsetAlignment = 64;
     ctx->Const.TextureBufferOffsetAlignment = 16;
     ctx->Const.MaxTextureBufferSize = 128 * 1024 * 1024;
  
     ctx->Const.TextureBufferOffsetAlignment = 16;
     ctx->Const.MaxTextureBufferSize = 128 * 1024 * 1024;
author	Iago Toral Quiroga <itoral@igalia.com>
	Thu, 19 Mar 2015 10:27:21 +0000 (11:27 +0100)
committer	Samuel Iglesias Gonsalvez <siglesias@igalia.com>
	Fri, 25 Sep 2015 06:39:20 +0000 (08:39 +0200)