From 03e284ec9ecc414a58e7b029144d1e3931bdc1cd Mon Sep 17 00:00:00 2001 From: Erik Faye-Lund Date: Mon, 15 Jun 2020 11:52:42 +0200 Subject: [PATCH] docs: fixup relnotes after rst-conversion Acked-by: Eric Anholt Part-of: --- docs/relnotes/19.2.1.rst | 68 +++++++ docs/relnotes/19.2.2.rst | 50 +++++ docs/relnotes/19.2.3.rst | 56 ++++++ docs/relnotes/19.2.4.rst | 2 + docs/relnotes/19.2.5.rst | 41 ++++ docs/relnotes/19.2.6.rst | 20 ++ docs/relnotes/19.2.7.rst | 26 +++ docs/relnotes/19.2.8.rst | 32 +++ docs/relnotes/19.3.0.rst | 389 ++++++++++++++++++++++++++++++++++++ docs/relnotes/19.3.1.rst | 23 +++ docs/relnotes/19.3.2.rst | 41 ++++ docs/relnotes/19.3.3.rst | 77 +++++++ docs/relnotes/19.3.4.rst | 80 ++++++++ docs/relnotes/19.3.5.rst | 74 +++++++ docs/relnotes/20.0.0.rst | 413 ++++++++++++++++++++++++++++++++++++++ docs/relnotes/20.0.1.rst | 77 +++++++ docs/relnotes/20.0.2.rst | 56 ++++++ docs/relnotes/20.0.3.rst | 65 ++++++ docs/relnotes/20.0.4.rst | 5 + docs/relnotes/20.0.5.rst | 95 +++++++++ docs/relnotes/20.0.6.rst | 35 ++++ docs/relnotes/20.0.7.rst | 62 ++++++ docs/relnotes/20.0.8.rst | 83 ++++++++ docs/relnotes/20.1.0.rst | 419 +++++++++++++++++++++++++++++++++++++++ docs/relnotes/20.1.1.rst | 71 +++++++ 25 files changed, 2360 insertions(+) diff --git a/docs/relnotes/19.2.1.rst b/docs/relnotes/19.2.1.rst index 49202c09e2a..a98a5fee769 100644 --- a/docs/relnotes/19.2.1.rst +++ b/docs/relnotes/19.2.1.rst @@ -41,12 +41,32 @@ Bug fixes Changes ------- +Andreas Gottschling (1): + - drisw: Fix shared memory leak on drawable resize + +Andres Gomez (1): + - egl: Remove the 565 pbuffer-only EGL config under X11. + +Andrii Simiklit (1): + - glsl: disallow incompatible matrices multiplication + +Bas Nieuwenhuizen (1): + - radv: Fix condition for skipping the continue CS. + +Connor Abbott (1): + - nir/opt_large_constants: Handle store writemasks + +Danylo Piliaiev (1): + - st/nine: Ignore D3DSIO_RET if it is the last instruction in a shader + +Dylan Baker (9): + - meson: fix logic for generating .pc files with old glvnd - meson: Try finding libxvmcw via pkg-config before using find_library - meson: Link xvmc with libxv @@ -57,21 +77,45 @@ Changes - scripts: Add a gen_release_notes.py script - release: Add an update_release_calendar.py script - bin: delete unused releasing scripts + +Eric Engestrom (3): + - radv: fix s/load/store/ copy-paste typo - meson: drop -Wno-foo bug workaround for Meson < 0.46 - meson: add missing idep_nir_headers in iris_gen_libs + +Erik Faye-Lund (1): + - glsl: correct bitcast-helpers + +Ian Romanick (1): + - nir/range-analysis: Bail if the types don't match + +Jason Ekstrand (1): + - intel/fs: Fix fs_inst::flags_read for ANY/ALL predicates + +Ken Mays (1): + - haiku: fix Mesa build + +Kenneth Graunke (2): + - iris: Disable CCS_E for 32-bit floating point textures. - iris: Fix iris_rebind_buffer() for VBOs with non-zero offsets. + +Lionel Landwerlin (6): + - anv: gem-stubs: return a valid fd got anv_gem_userptr() - intel: use proper label for Comet Lake skus - mesa: don't forget to clear \_Layer field on texture unit - intel: fix topology query - intel: fix subslice computation from topology data - intel/isl: Set null surface format to R32_UINT + +Marek Olšák (7): + - gallium/vl: don't set PIPE_HANDLE_USAGE_EXPLICIT_FLUSH - gallium: extend resource_get_param to be as capable as resource_get_handle @@ -80,13 +124,37 @@ Changes - ac: fix num_good_cu_per_sh for harvested chips - ac: add radeon_info::tcc_harvested - radeonsi/gfx10: fix corruption for chips with harvested TCCs + +Mauro Rossi (1): + - android: compiler/nir: build nir_divergence_analysis.c + +Michel Dänzer (1): + - radeonsi: fix VAAPI segfault due to various bugs + +Michel Zou (1): + - scons: add py3 support + +Prodea Alexandru-Liviu (1): + - scons/MSYS2-MinGW-W64: Fix build options defaults + +Rhys Perry (1): + - nir/opt_remove_phis: handle phis with no sources + +Stephen Barber (1): + - nouveau: add idep_nir_headers as dep for libnouveau + +Tapani Pälli (2): + - iris: disable aux on first get_param if not created with aux - anv/android: fix images created with external format support + +pal1000 (2): + - scons: Fix MSYS2 Mingw-w64 build. - scons/windows: Support build with LLVM 9. diff --git a/docs/relnotes/19.2.2.rst b/docs/relnotes/19.2.2.rst index 0efb8b17ee4..b580da8a2e1 100644 --- a/docs/relnotes/19.2.2.rst +++ b/docs/relnotes/19.2.2.rst @@ -48,20 +48,40 @@ Bug fixes Changes ------- +Alan Coopersmith (6): + - c99_compat.h: Don't try to use 'restrict' in C++ code - util: Make Solaris implemention of p_atomic_add work with gcc - util: Workaround lack of flock on Solaris - util: Solaris has linux-style pthread_setname_np - meson: recognize "sunos" as the system name for Solaris - intel/common: include unistd.h for ioctl() prototype on Solaris + +Alejandro Piñeiro (1): + - v3d: take into account prim_counts_offset + +Bas Nieuwenhuizen (3): + - radv: Disallow sparse shared images. - nir/dead_cf: Remove dead control flow after infinite loops. - radv: Fix single stage constant flush with merged shaders. + +Clément Guérin (1): + - radeonsi: enable zerovram for Rocket League + +Connor Abbott (2): + - nir/sink: Rewrite loop handling logic - nir/sink: Don't sink load_ubo to outside of its defining loop + +Dylan Baker (1): + - docs: Add SHA256 sum for 19.2.1 + +Eric Engestrom (7): + - GL: drop symbols mangling support - meson: rename \`glvnd_missing_pc_files\` to \`not glvnd_has_headers_and_pc_files\` @@ -71,24 +91,54 @@ Changes - meson: skip installation of GLVND-provided headers - util/u_atomic: fix return type of p_atomic_{inc,dec}_return() and p_atomic_{cmp,}xchg() + +Ian Romanick (2): + - nir/search: Fix possible NULL dereference in is_fsign - intel/vec4: Don't try both sources as immediates for DPH + +James Xiong (1): + - iris: finish aux import on get_param + +Kenneth Graunke (2): + - iris: Properly unreference extra VBOs for draw parameters - iris: Implement the Gen < 9 tessellation quads workaround + +Lepton Wu (1): + - egl/android: Remove our own reference to buffers. + +Lionel Landwerlin (3): + - etnaviv: remove variable from global namespace - anv: fix vkUpdateDescriptorSets with inline uniform blocks - anv: fix memory leak on device destroy + +Lucas Stach (3): + - etnaviv: fix vertex buffer state emission for single stream GPUs - rbug: fix transmitted texture sizes - rbug: unwrap index buffer resource + +Pierre-Eric Pelloux-Prayer (1): + - mesa: fix invalid target error handling for teximage + +Roland Scheidegger (1): + - gallivm: Fix saturated signed psub/padd intrinsics on llvm 8 + +Samuel Pitoiset (6): + - drirc: enable vk_x11_override_min_image_count for DOOM - radv: bump minTexelBufferOffsetAlignment to 4 - radv: fix DCC fast clear code for intensity formats - Revert "radv: do not emit PKT3_CONTEXT_CONTROL with AMDGPU 3.6.0+" - radv: fix DCC fast clear code for intensity formats (correctly) - radv: fix updating bound fast ds clear values with different aspects + +Timothy Arceri (1): + - glsl: fix crash compiling bindless samplers inside unnamed UBOs diff --git a/docs/relnotes/19.2.3.rst b/docs/relnotes/19.2.3.rst index 9ef1aa17981..a3ce5c47cd0 100644 --- a/docs/relnotes/19.2.3.rst +++ b/docs/relnotes/19.2.3.rst @@ -40,12 +40,23 @@ Bug fixes Changes ------- +Bas Nieuwenhuizen (4): + - radv: Fix timeout handling in syncobj wait. - radv: Remove \_mesa_locale_init/fini calls. - turnip: Remove \_mesa_locale_init/fini calls. - anv: Remove \_mesa_locale_init/fini calls. + +Caio Marcelo de Oliveira Filho (1): + - anv: Fix output of INTEL_DEBUG=bat for chained batches + +Danylo Piliaiev (1): + - glsl: Initialize all fields of ir_variable in constructor + +Dylan Baker (11): + - bin/gen_release_notes.py: fix conditional of bugfix - bin/gen_release_notes.py: strip '#' from gitlab bugs - bin/gen_release_notes.py: Return "None" if there are no new features @@ -58,30 +69,75 @@ Changes - cherry-ignore: update for 19.2.3 cycle - nir: correct use of identity check in python - meson: Add dep_glvnd to egl deps when building with glvnd + +Ilia Mirkin (1): + - nv50/ir: mark STORE destination inputs as used + +Illia Iorin (1): + - Revert "mesa/main: Fix multisample texture initialize" + +Jason Ekstrand (2): + - anv: Fix a potential BO handle leak - anv/tests: Zero-initialize instances + +Jon Turney (2): + - rbug: Fix use of alloca() without #include "c99_alloca.h" - Fix timespec_from_nsec test for 32-bit time_t + +Jonathan Marek (1): + - etnaviv: fix depth bias + +Kenneth Graunke (1): + - iris: Fix "Force Zero RTA Index Enable" setting again + +Lionel Landwerlin (2): + - anv: fix unwind of vkCreateDevice fail - mesa: check draw buffer completeness on glClearBufferfi/glClearBufferiv + +Marek Olšák (1): + - util/u_queue: skip util_queue_finish if num_threads is 0 + +Nanley Chery (5): + - anv: Properly allocate aux-tracking space for CCS_E - intel/blorp: Disable depth testing for slow depth clears - iris: Clear ::has_hiz when disabling aux - iris: Don't leak the resource for unsupported modifier - iris: Disallow incomplete resource creation + +Paulo Zanoni (1): + - intel/compiler: remove the operand restriction for src1 on GLK + +Pierre-Eric Pelloux-Prayer (1): + - mesa: enable msaa in clear_with_quad if needed + +Sagar Ghuge (1): + - intel/blorp: Assign correct view while clearing depth stencil + +Samuel Pitoiset (4): + - radv: do not create meta pipelines with 16 samples - radv: do not emit rbplus if attachments are undefined - radv/gfx10: fix 3D images - radv: fix vkUpdateDescriptorSets with inline uniform blocks + +Tapani Pälli (1): + - i965: setup sized internalformat for MESA_FORMAT_R10G10B10A2_UNORM + +Thomas Hellstrom (2): + - svga: Fix banded DMA upload unmap - winsys/svga: Limit the maximum DMA hardware buffer size diff --git a/docs/relnotes/19.2.4.rst b/docs/relnotes/19.2.4.rst index ba52b532050..36a3f552091 100644 --- a/docs/relnotes/19.2.4.rst +++ b/docs/relnotes/19.2.4.rst @@ -36,4 +36,6 @@ Bug fixes Changes ------- +Lionel Landwerlin (1): + - mesa: check framebuffer completeness only after state update diff --git a/docs/relnotes/19.2.5.rst b/docs/relnotes/19.2.5.rst index dcb9d040664..374b6157c0e 100644 --- a/docs/relnotes/19.2.5.rst +++ b/docs/relnotes/19.2.5.rst @@ -38,26 +38,67 @@ Bug fixes Changes ------- +Ben Crocker (1): + - llvmpipe: use ppc64le/ppc64 Large code model for JIT-compiled shaders + +Brian Paul (1): + - Call shmget() with permission 0600 instead of 0777 + +Caio Marcelo de Oliveira Filho (1): + - spirv: Don't leak GS initialization to other stages + +Danylo Piliaiev (1): + - i965: Unify CC_STATE and BLEND_STATE atoms on Haswell as a workaround + +Dylan Baker (2): + - docs: Add SHA256 sum for for 19.2.4 - cherry-ignore: Update for 19.2.4 cycle + +Eric Engestrom (1): + - egl: fix \_EGL_NATIVE_PLATFORM fallback + +Ian Romanick (2): + - nir/algebraic: Add the ability to mark a replacement as exact - nir/algebraic: Mark other comparison exact when removing a == a + +Illia Iorin (1): + - mesa/main: Ignore filter state for MS texture completeness + +Jason Ekstrand (1): + - anv: Stop bounds-checking pushed UBOs + +Lepton Wu (1): + - gallium: dri2: Use index as plane number. + +Lionel Landwerlin (3): + - anv: invalidate file descriptor of semaphore sync fd at vkQueueSubmit - anv: remove list items on batch fini - anv/wsi: signal the semaphore in the acquireNextImage + +Marek Olšák (3): + - st/mesa: fix Sanctuary and Tropics by disabling ARB_gpu_shader5 for them - tgsi_to_nir: fix masked out image loads - tgsi_to_nir: handle PIPE_FORMAT_NONE in image opcodes + +Paulo Zanoni (1): + - intel/compiler: fix nir_op_{i,u}*32 on ICL + +Pierre-Eric Pelloux-Prayer (3): + - radeonsi: disable sdma for gfx10 - radeonsi: tell the shader disk cache what IR is used - radeonsi: fix shader disk cache key diff --git a/docs/relnotes/19.2.6.rst b/docs/relnotes/19.2.6.rst index 944efee630a..9e195a9de2a 100644 --- a/docs/relnotes/19.2.6.rst +++ b/docs/relnotes/19.2.6.rst @@ -39,11 +39,31 @@ Bug fixes Changes ------- +Alejandro Piñeiro (1): + - v3d: adds an extra MOV for any sig.ld\* + +Dave Airlie (1): + - llvmpipe/ppc: fix if/ifdef confusion in backport. + +Dylan Baker (2): + - docs/relnotes/19.2.5: Add SHA256 sum - meson: generate .pc files for gles and gles2 with old glvnd + +Eric Engestrom (1): + - vulkan: delete typo'd header + +Hyunjun Ko (1): + - freedreno/ir3: fix printing output registers of FS. + +Jose Maria Casanova Crespo (1): + - v3d: Fix predication with atomic image operations + +Yevhenii Kolesnikov (1): + - glsl: Enable textureSize for samplerExternalOES diff --git a/docs/relnotes/19.2.7.rst b/docs/relnotes/19.2.7.rst index 0610e8b4815..f0e9f9dd7b1 100644 --- a/docs/relnotes/19.2.7.rst +++ b/docs/relnotes/19.2.7.rst @@ -40,16 +40,42 @@ Bug fixes Changes ------- +Bas Nieuwenhuizen (2): + - radv: Allocate cmdbuffer space for buffer marker write. - radv: Unify max_descriptor_set_size. + +Boris Brezillon (1): + - gallium: Fix the ->set_damage_region() implementation + +Ian Romanick (1): + - intel/fs: Disable conditional discard optimization on Gen4 and Gen5 + +Jason Ekstrand (1): + - anv: Set up SBE_SWIZ properly for gl_Viewport + +Jonathan Gray (2): + - winsys/amdgpu: avoid double simple_mtx_unlock() - i965: update Makefile.sources for perf changes + +Rhys Perry (1): + - radv: set writes_memory for global memory stores/atomics + +Samuel Pitoiset (3): + - radv: fix enabling sample shading with SampleID/SamplePosition - radv/gfx10: fix implementation of exclusive scans - radv: fix compute pipeline keys when optimizations are disabled + +Yevhenii Kolesnikov (1): + - meson: Fix linkage of libgallium_nine with libgalliumvl + +Zebediah Figura (1): + - Revert "draw: revert using correct order for prim decomposition." diff --git a/docs/relnotes/19.2.8.rst b/docs/relnotes/19.2.8.rst index 06547b720ab..019f414f656 100644 --- a/docs/relnotes/19.2.8.rst +++ b/docs/relnotes/19.2.8.rst @@ -36,18 +36,41 @@ Bug fixes Changes ------- +Alyssa Rosenzweig (1): + - gallium/util: Support POLYGON in u_stream_outputs_for_vertices + +Bas Nieuwenhuizen (2): + - amd/common: Always use addrlib for HTILE tc-compat. - amd/common: Fix tcCompatible degradation on Stoney. + +Dylan Baker (4): + - docs: Add SHA256 sums for 19.2.7 - meson/broadcom: libbroadcom_cle needs expat headers - meson/broadcom: libbroadcom_cle also needs zlib - cherry-ignore: Update for 19.2.8 + +Gert Wollny (1): + - virgl: Increase the shader transfer buffer by doubling the size + +Iván Briano (1): + - anv: Export filter_minmax support only when it's really supported + +Jason Ekstrand (2): + - anv: Re-emit all compute state on pipeline switch - anv: Don't leak when set_tiling fails + +Kenneth Graunke (1): + - iris: Default to X-tiling for scanout buffers without modifiers + +Lionel Landwerlin (7): + - intel/perf: fix invalid hw_id in query results - intel/perf: set read buffer len to 0 to identify empty buffer - intel/perf: take into account that reports read can be fairly old @@ -55,7 +78,16 @@ Changes - intel/perf: fix improper pointer access - anv: fix fence underlying primitive checks - mesa: avoid triggering assert in implementation + +Nanley Chery (2): + - gallium/dri2: Fix creation of multi-planar modifier images - gallium: Store the image format in winsys_handle + +Rob Clark (1): + - nir/lower_clip: Fix incorrect driver loc for clipdist outputs + +Timothy Arceri (1): + - glsl/nir: iterate the system values list when adding varyings diff --git a/docs/relnotes/19.3.0.rst b/docs/relnotes/19.3.0.rst index da620b563b5..770386989f6 100644 --- a/docs/relnotes/19.3.0.rst +++ b/docs/relnotes/19.3.0.rst @@ -120,6 +120,8 @@ Bug fixes Changes ------- +Adam Jackson (44): + - glx: Whitespace cleanups - glx: Sync with Khronos - glx: Make \__glXGetDrawableAttribute return true sometimes @@ -164,6 +166,9 @@ Changes - surfaceless: Support EGL_WL_bind_wayland_display - egl: Make native display detection work more than once - gallium/xlib: Fix xmesa drawable creation + +Alan Coopersmith (6): + - gallium: Fix a bunch of undefined left-shifts in u_format\_\* - c99_compat.h: Don't try to use 'restrict' in C++ code - util: Make Solaris implemention of p_atomic_add work with gcc @@ -171,14 +176,26 @@ Changes - util: Solaris has linux-style pthread_setname_np - meson: recognize "sunos" as the system name for Solaris - intel/common: include unistd.h for ioctl() prototype on Solaris + +Alejandro Piñeiro (5): + - i965: enable ARB_gl_spirv extension and ARB_spirv_extensions for gen7+ - mesa/version: uncomment SPIR-V extensions - i965: Enable OpenGL 4.6 for Gen8+ - v3d: take into account prim_counts_offset - v3d: adds an extra MOV for any sig.ld\* + +Alex Smith (1): + - radv: Change memory type order for GPUs without dedicated VRAM + +Alexandros Frantzis (1): + - gitlab-ci: Update required libdrm version + +Alyssa Rosenzweig (220): + - pan/decode: Eliminate DYN_MEMORY_PROP - pan/decode: Don't print MALI_DRAW_NONE - panfrost: Move pan_invocation to shared panfrost/ @@ -399,12 +416,21 @@ Changes - panfrost: Remove unused definitions in mali-job.h - pipe-loader: Build kmsro loader for with all kmsro targets - gallium/util: Support POLYGON in u_stream_outputs_for_vertices + +Andreas Baierl (5): + - lima/ppir: Rename ppir_op_dummy to ppir_op_undef - lima/ppir: Add undef handling - lima/ppir: Add various varying fetch sources to disassembler - lima: Fix compiler warning in standalone compiler - lima: Fix crash when there are no vertex shader attributes + +Andreas Gottschling (1): + - drisw: Fix shared memory leak on drawable resize + +Andres Gomez (12): + - nir/algebraic: mark float optimizations returning one parameter as inexact - docs: Update to OpenGL 4.6 in the release notes @@ -422,15 +448,30 @@ Changes - docs/relnotes: add support for GL_ARB_gl_spirv, GL_ARB_spirv_extensions and OpenGL 4.6 on i965 and iris - egl: Remove the 565 pbuffer-only EGL config under X11. + +Andres Rodriguez (2): + - radv: add RADV_DEBUG=allentrypoints - radv: additional query fixes + +Andrii Simiklit (1): + - glsl: disallow incompatible matrices multiplication + +Anuj Phogat (5): + - intel/gen12: Add L3 configurations - intel: Add few Ice Lake brand strings - genxml/gen11+: Add COMMON_SLICE_CHICKEN4 register - intel/gen11+: Enable Hardware filtering of Semi-Pipelined State in WM - intel/isl/icl: Use halign 8 instead of 4 hw workaround + +Arcady Goldmints-Orlov (1): + - anv: fix descriptor limits on gen8 + +Bas Nieuwenhuizen (63): + - radv: Use correct vgpr_comp_cnt for VS if both prim_id and instance_id are needed. - radv: Emit VGT_GS_ONCHIP_CNTL for tess on GFX10. @@ -496,7 +537,13 @@ Changes - radv: Unify max_descriptor_set_size. - radv: Fix timeline semaphore refcounting. - radv: Fix RGBX Android<->Vulkan format correspondence. + +Ben Crocker (1): + - llvmpipe: use ppc64le/ppc64 Large code model for JIT-compiled shaders + +Boris Brezillon (73): + - panfrost: Free the instruction object in mir_remove_instruction() - panfrost: Free all block/instruction objects before leaving midgard_compile_shader_nir() @@ -579,9 +626,15 @@ Changes - panfrost: Fix the DISCARD_WHOLE_RES case in transfer_map() - gallium: Fix the ->set_damage_region() implementation - panfrost: Make sure we reset the damage region of RTs at flush time + +Brian Paul (3): + - st/nir: fix illegal designated initializer in st_glsl_to_nir.cpp - REVIEWERS: add VMware reviewers - Call shmget() with permission 0600 instead of 0777 + +Caio Marcelo de Oliveira Filho (66): + - intel/compiler: Silence maybe-uninitialized warning in GCC 9.1.1 - anv: Drop unused local variable - compiler/glsl: Fix warning about unused function @@ -649,8 +702,14 @@ Changes - spirv: Don't fail if multiple ordering semantics bits are set - spirv: Don't leak GS initialization to other stages - anv: Initialize depth_bounds_test_enable when not explicitly set + +Chris Wilson (2): + - iris: Allow packed RGB pbo uploads - st/mesa: Map MESA_FORMAT_RGB_UNORM8 <-> PIPE_FORMAT_R8G8B8_UNORM + +Christian Gmeiner (13): + - gallium: util_set_vertex_buffers_mask(..): make use of u_bit_consecutive(..) - etnaviv: a bit of micro-optimization @@ -665,7 +724,13 @@ Changes - etnaviv: store updated usage in pipe_transfer object - etnaviv: keep track of buffer valid ranges for PIPE_BUFFER - etnaviv: remove dead code + +Clément Guérin (1): + - radeonsi: enable zerovram for Rocket League + +Connor Abbott (40): + - st/nir: Fix num_inputs for VS inputs - radeonsi/nir: Don't recompute num_inputs and num_outputs - ac/nir: Handle const array offsets in get_deref_offset() @@ -706,7 +771,13 @@ Changes - nir: Fix overlapping vars in nir_assign_io_var_locations() - nir/sink: Rewrite loop handling logic - nir/sink: Don't sink load_ubo to outside of its defining loop + +Daniel Kolesa (1): + - util: add auxv based PowerPC AltiVec/VSX detection + +Daniel Schürmann (44): + - nir/algebraic: some subtraction optimizations - aco: Initial commit of independent AMD compiler - radv/aco: Setup alternate path in RADV to support the experimental @@ -757,8 +828,17 @@ Changes - aco: preserve kill flag on moved operands during RA - aco: don't split live-ranges of linear VGPRs - aco: fix a couple of value numbering issues + +Daniel Stone (1): + - panfrost: Respect offset for imported resources + +Danilo Spinella (1): + - egl: Include stddef.h in generated source + +Danylo Piliaiev (10): + - nir/loop_unroll: Update the comments for loop_prepare_for_unroll - nir/loop_unroll: Prepare loop for unrolling in wrapper_unroll - nir/loop_analyze: Treat do{}while(false) loops as 0 iterations @@ -769,6 +849,9 @@ Changes - intel/compiler: Fix C++ one definition rule violations - glsl: Initialize all fields of ir_variable in constructor - i965: Unify CC_STATE and BLEND_STATE atoms on Haswell as a workaround + +Dave Airlie (75): + - virgl: drop unused format field - virgl: fix format conversion for recent gallium changes. - gallivm: fix atomic compare-and-swap @@ -845,8 +928,14 @@ Changes - zink: add samples to rasterizer - zink: attempt to get multisample resource creation right - llvmpipe/ppc: fix if/ifdef confusion in backport. + +Dave Stevenson (1): + - broadcom/v3d: Allow importing linear BOs with arbitrary offset/stride. + +Duncan Hopkins (7): + - zink: clamped limits to INT_MAX when stored as uint32_t. - zink: fix line-width calculation - zink: respect ubo buffer alignment requirement @@ -855,6 +944,9 @@ Changes - zink: Use optimal layout instead of general. Reduces valid layer warnings. Fixes RADV image noise. - zink: make sure src image is transfer-src-optimal + +Dylan Baker (120): + - docs: Mark 19.2.0-rc2 as done and push back rc3 and rc4/final - glsl/tests: Handle windows \\r\n new lines - meson: don't try to generate i18n translations on windows @@ -983,14 +1075,23 @@ Changes - Revert "egl: avoid local modifications for eglext.h Khronos standard header file" - Revert "egl: move #include of local headers out of Khronos headers" + +Eduardo Lima Mitev (4): + - nir: Add new texop nir_texop_tex_prefetch - freedreno/ir3: Add a NIR pass to select tex instructions eligible for pre-fetch - nir: Add a new ALU nir_op_imad24_ir3 - freedreno/ir3: Handle newly added opcode nir_op_imad24_ir3 + +Emil Velikov (3): + - mesa: bump version to 19.3.0-devel - docs: add 19.3.0-devel release notes template - docs: update calendar for 19.2.x + +Eric Anholt (57): + - gallium: Add a block depth field to the u_formats table. - gallium: Add block depth to the format utils. - gallium: Add the ASTC 3D formats. @@ -1058,6 +1159,9 @@ Changes - gallium: Drop the unused PIPE_FORMAT_A*L\* formats. - mesa: Redefine the RG formats as array formats. - ci: Disable lima until its farm can get fixed. + +Eric Engestrom (104): + - scons: define MESA_LLVM_VERSION_STRING like the other build systems do - llvmpipe: use LLVM version string instead of re-computing it @@ -1174,6 +1278,9 @@ Changes - egl: fix \_EGL_NATIVE_PLATFORM fallback - egl: move #include of local headers out of Khronos headers - vulkan: delete typo'd header + +Erico Nunes (7): + - lima: fix ppir spill stack allocation - lima/ppir: lower selects to scalars - lima/ppir: enable vectorize optimization @@ -1181,6 +1288,9 @@ Changes - lima/ppir: optimizations in regalloc spilling code - lima/ppir: improve regalloc spill cost calculation - lima: remove partial clear support from pipe->clear() + +Erik Faye-Lund (210): + - gallium/auxiliary/indices: consistently apply start only to input - mesa/main: remove unused include - util: fix SSE-version needed for double opcodes @@ -1393,6 +1503,9 @@ Changes - zink: do not advertize coherent mapping - zink: disable fragment-shader texture-lod - zink: correct depth-stencil format + +Francisco Jerez (56): + - intel/fs: Teach fs_inst::is_send_from_grf() about some missing send-like instructions. - intel/fs: Define is_payload() method of the IR instruction class. @@ -1465,22 +1578,40 @@ Changes message. - intel/fs/gen12: Fix barrier codegen. - intel/fs/gen11+: Fix CS_OPCODE_CS_TERMINATE codegen. + +Fritz Koenig (5): + - include/GLES2: Sync GLES2 headers with Khronos - mesa: GetFramebufferParameteriv spelling - mesa: Allow MESA_framebuffer_flip_y for GLES 3 - gallium: Enable MESA_framebuffer_flip_y - freedreno: reorder format check + +Gert Wollny (4): + - radeonsi: Release storage for smda_uploads when the context is destroyed - etnaviv: enable triangle strips only when the hardware supports it - r600: Fix interpolateAtCentroid - r600: Disable eight bit three channel formats + +Greg V (1): + - clover: use iterator_range in get_kernel_nodes + +Gurchetan Singh (4): + - virgl: remove stride from virgl_hw_res - virgl: modify resource_create_from_handle(..) callback - virgl: modify internal structures to track winsys-supplied data - virgl: honor winsys supplied metadata + +Haihao Xiang (1): + - i965: support AYUV/XYUV for external import only + +Hal Gentz (11): + - glx: Fix SEGV due to dereferencing a NULL ptr from XCB-GLX. - clover: Fix build after clang r370122. - gallium/osmesa: Fix the inability to set no context as current. @@ -1493,6 +1624,9 @@ Changes - Revert "egl: Puts RGBA visuals in the second config selection group." - Revert "egl: Fixes transparency with EGL and X11." - Revert "egl: Add EGL_CONFIG_SELECT_GROUP_MESA ext." + +Heinrich Fink (8): + - include: sync GL headers with registry - specs: Sync framebuffer_flip_y text with GL registry - headers: remove redundant GL token from GL wrapper @@ -1501,10 +1635,19 @@ Changes - headers: Add GL_MESA_EGL_sync token to GL - egl: Add GL_MESA_EGL_sync support - mesa/gl: Sync with Khronos registry + +Hyunjun Ko (3): + - freedreno/ir3: Add data structures to support texture pre-fetch - freedreno/ir3: Add support for texture sampling pre-dispatch - freedreno/ir3: fix printing output registers of FS. + +Iago Toral (1): + - v3d: drop unused shader_rec_count member from context + +Iago Toral Quiroga (13): + - prog_to_nir: VARYING_SLOT_PSIZ is a scalar - gallium/ttn: VARYING_SLOT_PSIZ and VARYING_SLOT_FOGC are scalar - nir/lower_point_size: assume scalar PSIZ @@ -1520,6 +1663,9 @@ Changes - broadcom: document known hardware issues for L2T flush command - v3d: request the kernel to flush caches when TMU is dirty - st/mesa: only require ESSL 3.1 for geometry shaders + +Ian Romanick (22): + - nir/algrbraic: Don't optimize open-coded bitfield reverse when lowering is enabled - intel/compiler: Request bitfield_reverse lowering on pre-Gen7 @@ -1553,33 +1699,57 @@ Changes - nir/algebraic: Mark other comparison exact when removing a == a - intel/fs: Disable conditional discard optimization on Gen4 and Gen5 - intel/compiler: Fix 'comparison is always true' warning + +Icenowy Zheng (4): + - lima: reset scissor state if scissor test is disabled - lima: fix PLBU viewport configuration - lima: support rectangle texture - lima: do not set the PP uniforms address lowest bits + +Ilia Mirkin (6): + - gallium/vl: use compute preference for all multimedia, not just blit - teximage: ensure that Tex*SubImage\* checks format - gallium/tgsi: add support for DEMOTE and READ_HELPER opcodes - nvc0: add support for GL_EXT_demote_to_helper_invocation - gm107/ir: fix loading z offset for layered 3d image bindings - nv50/ir: mark STORE destination inputs as used + +Illia Iorin (2): + - Revert "mesa/main: Fix multisample texture initialize" - mesa/main: Ignore filter state for MS texture completeness + +Indrajit Das (1): + - radeon/vcn: exclude raven2 from vcn 2.0 encode initialization + +James Xiong (5): + - gallium: simplify throttle implementation - gallium: rename PIPE_CAP_MAX_FRAMES_IN_FLIGHT to PIPE_CAP_THROTTLE - iris: finish aux import on get_param - gallium: do not increase ref count of the new throttle fence - iris: try to set the specified tiling when importing a dmabuf + +Jan Beich (6): + - gallium/hud: add CPU usage support for DragonFly/NetBSD/OpenBSD - util: skip NEON detection if built with -mfpu=neon - util: detect NEON at runtime on FreeBSD - util: skip AltiVec detection if built with -maltivec - util: detect AltiVec at runtime on BSDs - util: simplify BSD includes + +Jan Zielinski (3): + - swr/rasterizer: Enable ARB_fragment_layer_viewport - swr/rasterizer: Fix GS attributes processing - gallium/swr: Fix depth values for blit scenario + +Jason Ekstrand (57): + - nir: Add explicit signs to image min/max intrinsics - intel/nir: Add a helper for getting BRW_AOP from an intrinsic - v3d: Use the correct opcodes for signed image min/max @@ -1638,15 +1808,36 @@ Changes - anv: Set up SBE_SWIZ properly for gl_Viewport - anv: Re-emit all compute state on pipeline switch - anv: Don't leak when set_tiling fails + +Jean Hertel (1): + - Fix missing dri2_load_driver on platform_drm + +Jiadong Zhu (1): + - mesa: fix texStore for FORMAT_Z32_FLOAT_S8X24_UINT + +Jiang, Sonny (1): + - loader: always map the "amdgpu" kernel driver name to radeonsi (v2) + +John Stultz (1): + - Android.mk: Fix missing \\ from recent llvm change + +Jon Turney (2): + - Fix timespec_from_nsec test for 32-bit time_t - rbug: Fix use of alloca() without #include "c99_alloca.h" + +Jonathan Gray (3): + - mapi: Adapted libglvnd x86 tsd changes - winsys/amdgpu: avoid double simple_mtx_unlock() - i965: update Makefile.sources for perf changes + +Jonathan Marek (90): + - freedreno/a2xx: ir2: fix lowering of instructions after float lowering - freedreno/a2xx: ir2: remove pointcoord y invert @@ -1740,6 +1931,9 @@ Changes - etnaviv: fix non-pointsprite points on GC7000L - freedreno/ir3: disable texture prefetch for 1d array textures - freedreno/registers: fix a6xx_2d_blit_cntl ROTATE + +Jordan Justen (42): + - intel/genxml: Handle field names with different spacing/hyphen - intel/genxml/gen11: Add spaces in EnableUnormPathInColorPipe - intel/genxml: Run sort_xml.sh to tidy gen9.xml and gen11.xml @@ -1782,16 +1976,25 @@ Changes - iris: Add IRIS_DIRTY_RENDER_BUFFER state flag - iris/gen11+: Move flush for render target change - iris: Allow max dynamic pool size of 2GB for gen12 + +Jose Maria Casanova Crespo (5): + - mesa: recover target_check before get_current_tex_objects - v3d: writes to magic registers aren't RF writes after THREND - v3d: flag dirty state when binding compute states - v3d: Explicitly expose OpenGL ES Shading Language 3.1 - v3d: Fix predication with atomic image operations + +José Fonseca (5): + - glx: Fix incompatible function pointer types. - util: Prevent implicit declaration of function getenv. - util: Prevent strcasecmp macro redefinion. - scons: Make GCC builds stricter. - scons: Fix force_scons parsing. + +Juan A. Suarez Romero (14): + - docs: add release notes for 19.1.5 - docs: add sha256 checksums for 19.1.5 - docs: update calendar, add news item and link release notes for @@ -1810,6 +2013,9 @@ Changes - docs: add release notes for 19.1.8 - docs: update calendar, add news item and link release notes for 19.1.8 + +Karol Herbst (15): + - gallium: add blob field to pipe_llvm_program_header - rename pipe_llvm_program_header to pipe_binary_program_header - clover/functional: add id_equals helper @@ -1826,7 +2032,13 @@ Changes - clover: eliminate "ignoring attributes on template argument" warning - st/mesa: fix crash for drivers supporting nir defaulting to tgsi - nv50/ir: remove DUMMY edge type + +Ken Mays (1): + - haiku: fix Mesa build + +Kenneth Graunke (86): + - gallium/ddebug: Wrap resource_get_param if available - gallium/trace: Wrap resource_get_param if available - gallium/rbug: Wrap resource_get_param if available @@ -1919,6 +2131,9 @@ Changes - iris: Fix "Force Zero RTA Index Enable" setting again - driconf, glsl: Add a vs_position_always_invariant option - drirc: Set vs_position_always_invariant for Shadow of Mordor on Intel + +Kevin Strasser (14): + - drm-uapi: Update headers for fp16 formats - i965: Add helper function for allowed config formats - gallium: Use consistent approach for config format filtering @@ -1933,8 +2148,14 @@ Changes - i965: Add handling for fp16 configs - gallium: Add buffer and configs handling or fp16 formats - egl: Fix implicit declaration of ffs + +Khaled Emara (2): + - freedreno/a3xx: fix texture tiling parameters - freedreno/a3xx: fix sysmem <-> gmem tiles transfer + +Kristian Høgsberg (40): + - freedreno/a6xx: Let the GPU track streamout offsets - freedreno/a6xx: Implement primitive count queries on GPU - freedreno/a6xx: Track location of gl_Position out as we link it @@ -1976,13 +2197,25 @@ Changes - freedreno/a6xx: Turn on geometry shaders - nir: Use BITSET for tracking varyings in lower_io_arrays - freedreno/a6xx: Disable geometry shaders for release + +Krzysztof Raszkowski (2): + - util: Add unreachable() definition for clang compiler. - gallium/swr: Enable GL_ARB_gpu_shader5: multiple streams + +Laurent Carlier (1): + - egl: avoid local modifications for eglext.h Khronos standard header file + +Leo Liu (3): + - radeon/vcn: add RENOIR VCN decode support - radeon/vcn: Add VP9 8K decode support - radeonsi: enable 8K video decode support for HEVC and VP9 + +Lepton Wu (14): + - st/mesa: Allow zero as [level|layer]_override - virgl: Fix pipe_resource leaks under multi-sample. - egl/android: Only keep BGRA EGL configs as fallback @@ -1997,6 +2230,9 @@ Changes - mapi: Inline call x86_current_tls. - mapi: Improve the x86 tsd stubs performance. - gallium: dri2: Use index as plane number. + +Lionel Landwerlin (59): + - glsl/tests: take refs on glsl types - nir/tests: take reference on glsl types - compiler: ensure glsl types are not created without a reference @@ -2057,6 +2293,9 @@ Changes - intel/perf: fix improper pointer access - anv: fix missing gen12 handling - anv: fix incorrect VMA alignment for CCS main surfaces + +Lucas Stach (17): + - etnaviv: fix vertex buffer state emission for single stream GPUs - gallium/util: don't depend on implementation defined behavior in listen() @@ -2076,6 +2315,9 @@ Changes - etnaviv: check for softpin availability on Halti5 devices - etnaviv: add linear texture support on GC7000 - etnaviv: GC7000: flush TX descriptor and instruction cache + +Marek Olšák (161): + - radeonsi/gfx10: fix the legacy pipeline by storing as_ngg in the shader cache - radeonsi: move some global shader cache flags to per-binary flags @@ -2266,12 +2508,21 @@ Changes - winsys/amdgpu: use the new GPU reset query - st/mesa: fix Sanctuary and Tropics by disabling ARB_gpu_shader5 for them + +Marek Vasut (4): + - etnaviv: Make contexts track resources - etnaviv: Rework resource status tracking - etnaviv: Command buffer realloc - etnaviv: Rework locking + +Marijn Suijten (2): + - freedreno/a5xx: enable a510 - freedreno/ir3: Add missing ir3_nir_lower_tex_prefetch.c to Android.mk + +Matt Turner (6): + - clover: Remove unused code - intel/compiler: Remove unreachable() from brw_reg_type.c - intel/compiler: Restructure instruction compaction in preparation for @@ -2279,6 +2530,9 @@ Changes - intel/compiler: Inline get_src_index() - intel/compiler: Make separate src0/src1 index tables - intel/compiler: Add instruction compaction support on Gen12 + +Mauro Rossi (8): + - android: mesa: revert "Enable asm unconditionally" - android: anv: libmesa_vulkan_common: add libmesa_util static dependency @@ -2288,10 +2542,19 @@ Changes - android: amd/common: export amd/llvm headers - android: aco: fix Lower to CSSA - android: radeonsi: fix build after vl refactoring (v2) + +Maya Rashish (3): + - intel/compiler: avoid truncating int64_t to int - meson: Test for -Wl,--build-id=sha1 - llvmpipe: avoid left-shifting a negative number. + +Michael Schellenberger Costa (1): + - aco: Cleanup insert_before_logical_end + +Michel Dänzer (48): + - gitlab-ci: Move up meson-main job definition - gitlab-ci: Use new needs: keyword - gitlab-ci: Explicitly install linux-libc-dev for foreign @@ -2345,7 +2608,13 @@ Changes - gitlab-ci: Only run the pipeline if any files affecting it have changed - gitlab-ci: Disable meson-windows job for the time being + +Michel Zou (1): + - scons: add py3 support + +Nanley Chery (47): + - anv/blorp: Use BLORP_BATCH_NO_UPDATE_CLEAR_COLOR - anv: Properly allocate aux-tracking space for CCS_E - anv/formats: Disable I915_FORMAT_MOD_Y_TILED_CCS on TGL+ @@ -2393,15 +2662,30 @@ Changes - gallium/dri2: Fix creation of multi-planar modifier images - gallium: Store the image format in winsys_handle - iris: Fix import of multi-planar surfaces with modifiers + +Nataraj Deshpande (1): + - egl/android: Enable HAL_PIXEL_FORMAT_RGBA_FP16 format + +Neil Armstrong (1): + - Revert "ci: Disable lima until its farm can get fixed." + +Neil Roberts (6): + - glsl: Store the precision for a function return type - nir/builder: Move nir_atan and nir_atan2 from SPIR-V translator - nir/builtin: Add #include u_math.h to the header - nir/builtin: Add extern "C" guards to nir_builtin_builder.h - glsl: Add opcodes for atan and atan2 - glsl/builtin: Add alternate versions of atan using new ops + +OBATA Akio (1): + - util: fix to detect NetBSD properly + +Paulo Zanoni (8): + - intel/fs: grab fail_msg from v32 instead of v16 when v32->run_cs fails - intel/fs: make scan/reduce work with SIMD32 when it fits 2 registers @@ -2411,13 +2695,22 @@ Changes - intel/fs: don't forget the stride at generate_shuffle - intel/compiler: remove the operand restriction for src1 on GLK - intel/compiler: fix nir_op_{i,u}*32 on ICL + +Pierre Moreau (5): + - meson: Check for SPIRV-Tools and llvm-spirv - clover/spirv: Add functions for validating SPIR-V binaries - clover/spirv: Add functions for parsing arguments, linking programs, etc. - clover/llvm: Add options for dumping SPIR-V binaries - clover/llvm: Add functions for compiling from source to SPIR-V + +Pierre-Eric Pelloux Prayer (1): + - mesa: implement glTextureStorageNDEXT functions + +Pierre-Eric Pelloux-Prayer (23): + - glsl: replace 'x + (-x)' with constant 0 - mesa: fix invalid target error handling for teximage - mesa: add EXT_dsa glNamedRenderbufferStorageEXT and @@ -2443,6 +2736,9 @@ Changes - radeonsi: fix shader disk cache key - radeonsi: fix multi plane buffers creation - radeonsi: use gfx9.surf_offset to compute texture offset + +Plamena Manolova (8): + - genxml: Add 3DSTATE_DEPTH_BOUNDS instruction. - iris: Add support for depth bounds testing. - anv: Add support for depth bounds testing. @@ -2451,6 +2747,9 @@ Changes - genxml: Add 3DSTATE_SO_BUFFER_INDEX\_\* instructions - iris: Implement new way for setting streamout buffers. - anv: Implement new way for setting streamout buffers. + +Prodea Alexandru-Liviu (4): + - scons/windows: Fix build with LLVM>=8 - scons/MSYS2-MinGW-W64: Fix build options defaults Signed-off-by: Prodea Alexandru-Liviu Reviewed-by: Jose @@ -2461,10 +2760,16 @@ Changes - Meson: Remove lib prefix from graw and osmesa when building with Mingw. Also remove version sufix from osmesa swrast on Windows. + +Qiang Yu (4): + - lima: move format handling to unified place - lima: implement EGL_KHR_partial_update - lima: don't use damage system when full damage - lima: move damage bound build to resource + +Rafael Antognolli (13): + - anv: Only re-emit non-dynamic state that has changed. - intel/tools: Fix aubinator usage of rb_tree. - anv/block_pool: Align anv_block_pool state to 64 bits. @@ -2478,6 +2783,9 @@ Changes - iris: Add Tile Cache Flush for Unified Cache. - blorp: Add Tile Cache Flush for Unified Cache. - anv: Add Tile Cache Flush for Unified Cache. + +Rhys Perry (84): + - nir/lower_io_to_vector: allow FS outputs to be vectorized - nir/lower_io_to_vector: add flat mode - util: include u_endian.h in u_math.h @@ -2565,6 +2873,9 @@ Changes - aco: add v_nop inbetween exec write and VMEM/DS/FLAT - radv: set writes_memory for global memory stores/atomics - nir/lower_io_to_vector: don't create arrays when not needed + +Rob Clark (60): + - freedreno/ir3: convert block->predecessors to set - freedreno/ir3: maintain predecessors/successors - freedreno/ir3: do better job of marking convergence points @@ -2626,15 +2937,30 @@ Changes - freedreno/ir3: fix gpu hang with pre-fs-tex-fetch - Revert "freedreno/ir3: enable pre-fs texture fetch for a6xx" - nir/lower_clip: Fix incorrect driver loc for clipdist outputs + +Robin Murphy (1): + - egl/gbm: Fix config validation + +Rohan Garg (3): + - panfrost: Remove unused argument from panfrost_drm_submit_vs_fs_job() - panfrost: Jobs must be per context, not per screen - panfrost: protect access to shared bo cache and transient pool + +Roland Scheidegger (4): + - gallivm: use fallback code for mul_hi with llvm >= 7.0 - llvmpipe: fix CALLOC vs. free mismatches - llvmpipe: increase max texture size to 2GB - gallivm: Fix saturated signed psub/padd intrinsics on llvm 8 + +Roman Stratiienko (1): + - lima: Return fence unconditionally + +Sagar Ghuge (26): + - intel/eu/gen12: Implement immediate 64 bit constant encoding. - nir: Add alpha_to_coverage lowering pass - intel/compiler: Remove emit_alpha_to_coverage workaround from backend @@ -2663,6 +2989,9 @@ Changes - iris: Prepare resources before stencil blit operation - iris: Resolve stencil resource prior to copy or used by CPU - intel/isl: Allow stencil buffer to support compression on Gen12+ + +Samuel Iglesias Gonsálvez (26): + - spirv: check support for SPV_KHR_float_controls capabilities - spirv/nir: keep track of SPV_KHR_float_controls execution modes - nir: add auxiliary functions to detect if a mode is enabled @@ -2699,6 +3028,9 @@ Changes - anv: enable VK_KHR_shader_float_controls and SPV_KHR_float_controls - docs/relnotes: add support for VK_KHR_shader_float_controls on Intel - nir/algebraic: refactor inexact opcode restrictions + +Samuel Pitoiset (136): + - radv/gfx10: tidy up gfx10_format_table.py - radv/gfx10: hardcode some depth+stencil formats in the format table - radv: allow to enable VK_AMD_shader_ballot only on GFX8+ @@ -2841,11 +3173,20 @@ Changes - radv: fix enabling sample shading with SampleID/SamplePosition - radv/gfx10: fix implementation of exclusive scans - ac/nir: fix out-of-bound access when loading constants from global + +Sergii Romantsov (4): + - intel/dri: finish proper glthread - nir/large_constants: more careful data copying - nir/large_constants: pass after lowering copy_deref - meta: leak of shader program when decompressing tex-images + +Stephen Barber (1): + - nouveau: add idep_nir_headers as dep for libnouveau + +Tapani Pälli (23): + - util: fix os_create_anonymous_file on android - iris/android: fix build and link with libmesa_intel_perf - egl: reset blob cache set/get functions on terminate @@ -2872,10 +3213,19 @@ Changes to the buffers" - Revert "dri_interface: add interface for EGL_EXT_image_flush_external" + +Thomas Hellstrom (2): + - svga: Fix banded DMA upload unmap - winsys/svga: Limit the maximum DMA hardware buffer size + +Thong Thai (2): + - Revert "radeonsi: don't emit PKT3_CONTEXT_CONTROL on amdgpu" - radeonsi: add JPEG decode support for VCN 2.0 devices + +Timothy Arceri (35): + - radeonsi/nir: fix number of used samplers - util/disk_cache: bump thread count assigned to disk cache queue - util/u_queue: track job size and limit the size of queue growth @@ -2913,6 +3263,9 @@ Changes - radv: add a secure_compile_open_fifo_fds() helper - radv: create a fresh fork for each pipeline compile - glsl/nir: iterate the system values list when adding varyings + +Timur Kristóf (48): + - st/nine: Properly initialize GLSL types for NIR shaders. - nir: Carve out nir_lower_samplers from GLSL code. - tgsi_to_nir: Remove dependency on libglsl. @@ -2961,6 +3314,9 @@ Changes - aco: Introduce vgpr_limit to keep track of available VGPRs. - radv: Enable ACO on Navi. - ac: Handle invalid GFX10 format correctly in ac_get_tbuffer_format. + +Tomeu Vizoso (19): + - panfrost/ci: Use Volt-based runner for dEQP tests - panfrost/ci: Print bootstrap log - panfrost/ci: Build kernel with CONFIG_DETECT_HUNG_TASK @@ -2980,7 +3336,13 @@ Changes - gitlab-ci/lava: Test Lima driver with dEQP - panfrost: Keep track of active BOs - gitlab-ci: Update kernel for LAVA jobs to 5.4-rc4 + +Urja Rannikko (1): + - panfrost: allocate bo for occlusion query results + +Vasily Khoruzhick (35): + - lima/ppir: refactor const lowering - lima/ppir: clone ld_{uni,tex,var} into each block - lima/ppir: add support for unconditional branches and condition @@ -3019,6 +3381,9 @@ Changes - lima: set uniforms_address lower bits properly - lima/ppir: don't clone texture loads - lima: fix PP stack size + +Vinson Lee (7): + - glx: Fix up glXQueryGLXPbufferSGIX on macOS. - swr: Fix build with llvm-9.0 again. - travis: Fail build if any command in if statement fails. @@ -3026,14 +3391,38 @@ Changes - swr: Fix make_unique build error. - scons: Add coroutines component to build. - meson: Add coroutines component to llvmpipe build. + +Wladimir J. van der Laan (1): + - etnaviv: GC7000: Texture descriptors + +Yevhenii Kolesnikov (2): + - glsl: Enable textureSize for samplerExternalOES - meson: Fix linkage of libgallium_nine with libgalliumvl + +Zebediah Figura (1): + - Revert "draw: revert using correct order for prim decomposition." + +Zhaowei Yuan (1): + - broadcom/vc4: Expand width of dst surface + +Zhu, James (1): + - radeon: Fix mjpeg issue for ARCTURUS + +nia (1): + - loader: include limits.h for PATH_MAX + +pal1000 (3): + - scons/windows: Support build with LLVM 9. - scons: Fix MSYS2 Mingw-w64 build. - scons/windows: Enable compute shaders when possible. + +renchenglei (1): + - egl/android: Enable HAL_PIXEL_FORMAT_RGBA_1010102 format diff --git a/docs/relnotes/19.3.1.rst b/docs/relnotes/19.3.1.rst index fe1f946ca27..83c2e474de0 100644 --- a/docs/relnotes/19.3.1.rst +++ b/docs/relnotes/19.3.1.rst @@ -39,17 +39,40 @@ Bug fixes Changes ------- +Bas Nieuwenhuizen (2): + - amd/common: Fix tcCompatible degradation on Stoney. - amd/common: Always use addrlib for HTILE tc-compat. + +Dylan Baker (3): + - docs/19.3.0: Add SHA256 sums - cherry-ignore: update for the 19.3.1 cycle - docs: remove new_features.txt from stable branch + +Gert Wollny (1): + - virgl: Increase the shader transfer buffer by doubling the size + +Iván Briano (1): + - anv: Export filter_minmax support only when it's really supported + +Kenneth Graunke (1): + - iris: Default to X-tiling for scanout buffers without modifiers + +Lionel Landwerlin (2): + - anv: fix fence underlying primitive checks - mesa: avoid triggering assert in implementation + +Luis Mendes (1): + - radv: fix radv secure compile feature breaks compilation on armhf EABI and aarch64 + +Tapani Pälli (2): + - dri: add \__DRI_IMAGE_FORMAT_SXRGB8 - i965: expose MESA_FORMAT_B8G8R8X8_SRGB visual diff --git a/docs/relnotes/19.3.2.rst b/docs/relnotes/19.3.2.rst index 3957e44fe7e..1f7b1f03b13 100644 --- a/docs/relnotes/19.3.2.rst +++ b/docs/relnotes/19.3.2.rst @@ -54,10 +54,15 @@ Bug fixes Changes ------- +Andrii Simiklit (3): + - glsl: fix an incorrect max_array_access after optimization of ssbo/ubo - glsl: fix a binding points assignment for ssbo/ubo arrays - glsl/nir: do not change an element index to have correct block name + +Bas Nieuwenhuizen (7): + - radv: Limit workgroup size to 1024. - radv: Expose all sample counts for integer formats as well. - amd/common: Handle alignment of 96-bit formats. @@ -67,29 +72,65 @@ Changes - radv: Only use the gfx mipmap level offset/pitch for linear textures. - radv: Emit a BATCH_BREAK when changing pixel shaders or CB_TARGET_MASK. + +Caio Marcelo de Oliveira Filho (4): + - intel/fs: Lower 64-bit MOVs after lower_load_payload() - intel/fs: Fix lowering of dword multiplication by 16-bit constant - intel/vec4: Fix lowering of multiplication by 16-bit constant - anv: Ignore some CreateInfo structs when rasterization is disabled + +Christian Gmeiner (1): + - etnaviv: update resource status after flushing + +Dylan Baker (2): + - dcos: add releanse notes for 19.3.1 - cherry-ignore: update for 19.3.2 + +Eric Engestrom (4): + - util/format: remove left-over util_format_description_table declaration - amd: fix empty-body issues - nine: fix empty-body-issues - mesa: avoid returning a value in a void function + +Gert Wollny (1): + - r600: Fix maximum line width + +Jason Ekstrand (2): + - anv: Properly advertise sampledImageIntegerSampleCounts - intel/nir: Add a memory barrier before barrier() + +Lionel Landwerlin (2): + - loader: fix close on uninitialized file descriptor value - anv: don't close invalid syncfd semaphore + +Marek Olšák (2): + - winsys/radeon: initialize pte_fragment_size - radeonsi: disable SDMA on gfx8 to fix corruption on RX 580 + +Pierre-Eric Pelloux-Prayer (2): + - radeon/vcn2: enable rate control for hevc encoding - radeonsi: check ctx->sdma_cs before using it + +Samuel Pitoiset (2): + - radv/gfx10: fix the out-of-bounds check for vertex descriptors - radv: return the correct pitch for linear mipmaps on GFX10 + +Timur Kristóf (1): + - aco: Fix uniform i2i64. + +Yevhenii Kolesnikov (2): + - meta: Cleanup function for DrawTex - main: allow external textures for BindImageTexture diff --git a/docs/relnotes/19.3.3.rst b/docs/relnotes/19.3.3.rst index e8cb47daf18..9c5e7f7b459 100644 --- a/docs/relnotes/19.3.3.rst +++ b/docs/relnotes/19.3.3.rst @@ -50,19 +50,45 @@ Bug fixes Changes ------- +Adam Jackson (1): + - drisw: Cache the depth of the X drawable + +Andrii Simiklit (1): + - mesa/st: fix a memory leak in get_version + +Bas Nieuwenhuizen (2): + - radv: Disable VK_EXT_sample_locations on GFX10. - radv: Remove syncobj_handle variable in header. + +Caio Marcelo de Oliveira Filho (1): + - intel/fs: Only use SLM fence in compute shaders + +Daniel Schürmann (2): + - aco: fix unconditional demote_to_helper - aco: rework lower_to_cssa() + +Dylan Baker (3): + - docs: add SHA256 sums for 19.3.2 - cherry-ignore: Update for 19.3.3 - .pick_status.json: Update to c787b8d2a16d5e2950f209b1fcbec6e6c0388845 + +Eric Anholt (1): + - mesa: Fix detection of invalidating both depth and stencil. + +Eric Engestrom (1): + - meson: use github URL for wraps instead of completely unreliable wrapdb + +Erik Faye-Lund (8): + - docs: fix typo in html tag name - docs: fix paragraphs - docs: open paragraph before closing it @@ -71,12 +97,24 @@ Changes - docs: use code-tags instead of pre-tags - docs: move paragraph closing tag - docs: remove double-closed definition-list + +Francisco Jerez (3): + - glsl: Fix software 64-bit integer to 32-bit float conversions. - intel/fs/gen11+: Handle ROR/ROL in lower_simd_width(). - intel/fs/gen8+: Fix r127 dst/src overlap RA workaround for EOT message payload. + +Hyunjun Ko (1): + - turnip: fix invalid VK_ERROR_OUT_OF_POOL_MEMORY + +Jan Vesely (1): + - clover: Initialize Asm Parsers + +Jason Ekstrand (8): + - anv: Flag descriptors dirty when gl_NumWorkgroups is used - intel/vec4: Support scoped_memory_barrier - intel/blorp: Fill out all the dwords of MI_ATOMIC @@ -85,21 +123,42 @@ Changes - anv/blorp: Rename buffer image stride parameters - anv: Canonicalize buffer formats for image/buffer copies - anv: Stop allocating WSI event fences off the instance + +Jonathan Marek (1): + - st/mesa: don't lower YUV when driver supports it natively + +Kenneth Graunke (2): + - intel/compiler: Fix illegal mutation in get_nir_image_intrinsic_image - intel: Fix aux map alignments on 32-bit builds. + +Lasse Lopperi (1): + - freedreno/drm: Fix memory leak in softpin implementation + +Lionel Landwerlin (4): + - anv: fix intel perf queries availability writes - anv: only use VkSamplerCreateInfo::compareOp if enabled - intel/perf: expose timestamp begin for mdapi - intel/perf: report query split for mdapi + +Marek Olšák (4): + - ac/gpu_info: always use distributed tessellation on gfx10 - radeonsi: work around an LLVM crash when using llvm.amdgcn.icmp.i64.i1 - radeonsi: clean up how internal compute dispatches are handled - radeonsi: don't invoke decompression inside internal launch_grid + +Nataraj Deshpande (1): + - egl/android: Restrict minimum triple buffering for android color_buffers + +Pierre-Eric Pelloux-Prayer (8): + - radeonsi: release saved resources in si_retile_dcc - radeonsi: release saved resources in si_compute_expand_fmask - radeonsi: release saved resources in si_compute_clear_render_target @@ -108,6 +167,9 @@ Changes - radeonsi: fix fmask expand compute shader - radeonsi: make sure fmask expand is done if needed - util: call bind_sampler_states before setting sampler_views + +Rhys Perry (8): + - aco: set vm for pos0 exports on GFX10 - aco: fix imageSize()/textureSize() with large buffers on GFX8 - aco: fix uninitialized data in the binary @@ -117,9 +179,24 @@ Changes - aco: check if multiplication/clamp is live when applying output modifier - aco: fix off-by-one error when initializing sgpr_live_in + +Samuel Pitoiset (2): + - radv: only use VkSamplerCreateInfo::compareOp if enabled - radv: fix double free corruption in radv_alloc_memory() + +Samuel Thibault (1): + - meson: Do not require libdrm for DRI2 on hurd + +Tapani Pälli (1): + - egl/android: fix buffer_count for applications setting max count + +Thong Thai (1): + - mesa: Prevent \_MaxLevel from being less than zero + +Timur Kristóf (1): + - aco/gfx10: Fix VcmpxExecWARHazard mitigation. diff --git a/docs/relnotes/19.3.4.rst b/docs/relnotes/19.3.4.rst index 6e7d86f4f04..634d2773ca0 100644 --- a/docs/relnotes/19.3.4.rst +++ b/docs/relnotes/19.3.4.rst @@ -46,13 +46,30 @@ Bug fixes Changes ------- +Bas Nieuwenhuizen (1): + - radv: Do not set SX DISABLE bits for RB+ with unused surfaces. + +Boris Brezillon (1): + - panfrost: Fix the damage box clamping logic + +Brian Ho (2): + - anv: Properly fetch partial results in vkGetQueryPoolResults - anv: Handle unavailable queries in vkCmdCopyQueryPoolResults + +Danylo Piliaiev (2): + - i965: Do not set front_buffer_dirty if there is no front buffer - st/mesa: Handle the rest renderbuffer formats from OSMesa + +Drew Davenport (1): + - radeonsi: Clear uninitialized variable + +Dylan Baker (15): + - docs: Add SHA 256 sums for 19.3.3 - .pick_status.json: Mark 58c929be0ddbbd9291d0dadbf11538170178e791 as backported @@ -72,37 +89,85 @@ Changes - .pick_status.json: Update to d8bae10bfe0f487dcaec721743cd51441bcc12f5 - .pick_status.json: Update to 689817c9dfde9a0852f2b2489cb0fa93ffbcb215 - .pick_status.json: Update to 23037627359e739c42b194dec54875aefbb9d00b + +Eric Anholt (1): + - Revert "gallium: Fix big-endian addressing of non-bitmask array formats." + +Florian Will (1): + - radv/winsys: set IB flags prior to submit in the sysmem path + +Georg Lehmann (3): + - Correctly wait in the fragment stage until all semaphores are signaled - Vulkan Overlay: Don't try to change the image layout to present twice - Vulkan overlay: use the corresponding image index for each swapchain + +Hyunjun Ko (1): + - freedreno/ir3: put the conversion back for half const to the right place. + +Ian Romanick (1): + - intel/fs: Don't count integer instructions as being possibly coissue + +Jan Vesely (1): + - clover: Use explicit conversion from llvm::StringRef to std::string + +Jason Ekstrand (6): + - anv: Insert holes for non-existant XFB varyings - anv: Improve BTI change cache flushing - anv,iris: Set 3DSTATE_SF::DerefBlockSize to per-poly on Gen12+ - genxml: Add a new 3DSTATE_SF field on gen12 - intel/fs: Write the address register with NoMask for MOV_INDIRECT - anv/blorp: Use the correct size for vkCmdCopyBufferToImage + +Kenneth Graunke (1): + - i965: Use brw_batch_references in tex_busy check + +Lionel Landwerlin (1): + - isl: drop CCS row pitch requirement for linear surfaces + +Marek Olšák (1): + - radeonsi: fix the DCC MSAA bug workaround + +Marek Vasut (1): + - etnaviv: Destroy rsc->pending_ctx set in etna_resource_destroy() + +Michel Dänzer (6): + - winsys/amdgpu: Keep a list of amdgpu_screen_winsyses in amdgpu_winsys - winsys/amdgpu: Keep track of retrieved KMS handles using hash tables - winsys/amdgpu: Only re-export KMS handles for different DRM FDs - util: Add os_same_file_description helper - winsys/amdgpu: Re-use amdgpu_screen_winsys when possible - winsys/amdgpu: Close KMS handles for other DRM file descriptions + +Neha Bhende (1): + - svga: fix size of format_conversion_table[] + +Pierre-Eric Pelloux-Prayer (2): + - radeonsi: disable display DCC - radeonsi: stop using the VM_ALWAYS_VALID flag + +Rafael Antognolli (1): + - intel: Load the driver even if I915_PARAM_REVISION is not found. + +Rhys Perry (6): + - aco: fix operand to scc when selecting SGPR ufind_msb/ifind_msb - aco: ensure predecessors' p_logical_end is in WQM when a p_phi is in WQM @@ -113,11 +178,26 @@ Changes headers - aco: fix target calculation when vgpr spilling introduces sgpr spilling + +Samuel Pitoiset (2): + - radv: do not allow sparse resources with multi-planar formats - nir: do not use De Morgan's Law rules for flt and fge + +Tapani Pälli (2): + - mapi: add GetInteger64vEXT with EXT_disjoint_timer_query - mesa: allow bit queries for EXT_disjoint_timer_query + +Thomas Hellstrom (1): + - svga: Fix banded DMA upload + +Vasily Khoruzhick (1): + - lima: ppir: don't delete root ld_tex nodes without successors in current block + +Vinson Lee (1): + - swr: Fix GCC 4.9 checks. diff --git a/docs/relnotes/19.3.5.rst b/docs/relnotes/19.3.5.rst index 0110491325f..749f30029f5 100644 --- a/docs/relnotes/19.3.5.rst +++ b/docs/relnotes/19.3.5.rst @@ -40,11 +40,28 @@ Bug fixes Changes ------- +Andrii Simiklit (1): + - Revert "glx: convert glx_config_create_list to one big calloc" + +Arcady Goldmints-Orlov (1): + - spirv: Remove outdated SPIR-V decoration warnings + +Caio Marcelo de Oliveira Filho (1): + - intel/gen12: Take into account opcode when decoding SWSB + +Danylo Piliaiev (1): + - i965: Do not generate D16 B5G6R5_UNORM configs on gen < 8 + +Dave Airlie (1): + - gallivm/tgsi: fix stream id regression + +Dylan Baker (7): + - docs: Add SHA256 sum for 19.3.4 - .pick_status.json: Update to 2a98cf3b2ecea43cea148df7f77d2abadfd1c9db - .pick_status.json: Update to 946eacbafb47c8b94d47e7c9d2a8b02fff5a22fa @@ -52,6 +69,9 @@ Changes - .pick_status.json: Update to 8291d728dc997e87b4d2e4e451692643a1dba881 - .pick_status.json: Update to e4baff90812d799d586296fcad992ddcc553c359 - .pick_status.json: Update to 01496e3d1ea0370af03e6645dbd2b864c2ace94c + +Eric Engestrom (10): + - .pick_status.json: Update to 74e4cda64b9d114321216eefe536f80644b0f0fd - .pick_status.json: Mark dba71de5c63617677fe44558f995d35fad643413 as denominated @@ -68,39 +88,93 @@ Changes - bin/gen_release_notes.py: fix commit list command - .pick_status.json: Update to 0103f02acb10dcdea23461ba214307a6827a7772 - gitlab-ci: update template to fix container build issues + +Erik Faye-Lund (2): + - util: promote u_debug_memory.c to src/util - .pick_status.json: Update to 74e4cda64b9d114321216eefe536f80644b0f0fd + +Francisco Jerez (1): + - intel/fs/gen12: Fixup/simplify SWSB annotations of SIMD32 scratch writes. + +Ian Romanick (1): + - intel/fs: Correctly handle multiply of fsign with a source modifier + +Jason Ekstrand (3): + - isl: Set 3DSTATE_DEPTH_BUFFER::Depth correctly for 3D surfaces - iris: Don't skip fast depth clears if the color changed - anv: Parse VkPhysicalDeviceFeatures2 in CreateDevice + +Jordan Justen (1): + - intel/compiler: Restrict cs_threads to 64 + +Jose Maria Casanova Crespo (1): + - v3d: Sync on last CS when non-compute stage uses resource written by CS + +Kristian Høgsberg (2): + - Revert "glsl: Use a simpler formula for tanh" - Revert "spirv: Use a simpler and more correct implementaiton of tanh()" + +Krzysztof Raszkowski (1): + - gallium/swr: simplify environmental variabled expansion code + +Marek Olšák (3): + - radeonsi: don't wait for shader compilation to finish when destroying a context - mesa: fix immediate mode with tessellation and varying patch vertices - Revert "mesa: check for z=0 in \_mesa_Vertex3dv()" + +Mathias Fröhlich (3): + - egl: Implement getImage/putImage on pbuffer swrast. - egl: Fix A2RGB10 platform_{device,surfaceless} PBuffer configs. - mesa: Flush vertices before changing the OpenGL state. + +Michel Dänzer (1): + - st/vdpau: Only call is_video_format_supported hook if needed + +Paulo Zanoni (1): + - intel/device: bdw_gt1 actually has 6 eus per subslice + +Peng Huang (1): + - radeonsi: make si_fence_server_signal flush pipe without work + +Rafael Antognolli (1): + - intel/gen12+: Disable mid thread preemption. + +Samuel Pitoiset (3): + - ac/llvm: fix 64-bit fmed3 - ac/llvm: fix 16-bit fmed3 on GFX8 and older gens - ac/llvm: flush denorms for nir_op_fmed3 on GFX8 and older gens + +Tapani Pälli (5): + - iris: fix aux buf map failure in 32bits app on Android - mesa: introduce boolean toggle for EXT_texture_norm16 - i965: toggle on EXT_texture_norm16 - mesa/st: toggle EXT_texture_norm16 based on format support - mesa/st: fix formats required for EXT_texture_norm16 + +Timothy Arceri (1): + - glsl: fix gl_nir_set_uniform_initializers() for image arrays + +luc (1): + - zink: confused compilation macro usage for zink in target helpers. diff --git a/docs/relnotes/20.0.0.rst b/docs/relnotes/20.0.0.rst index 4827c393c5e..830f2f04698 100644 --- a/docs/relnotes/20.0.0.rst +++ b/docs/relnotes/20.0.0.rst @@ -204,20 +204,43 @@ Bug fixes Changes ------- +Aaron Watry (1): + - clover/llvm: fix build after llvm 10 commit 1dfede3122ee + +Adam Jackson (1): + - drisw: Cache the depth of the X drawable + +Afonso Bordado (4): + - pan/midgard: Optimize comparisions with similar operations - pan/midgard: Move midgard_is_branch_unit to helpers - pan/midgard: Optimize branches with inverted arguments - pan/midgard: Fix midgard_compile.h includes + +Alan Coopersmith (1): + - intel/perf: adapt to platforms like Solaris without d_type in struct dirent + +Alejandro Piñeiro (4): + - v3d: adds an extra MOV for any sig.ld\* - mesa/main/util: moving gallium u_mm to util, remove main/mm - nir/opt_peephole_select: remove unused variables - turnip: remove unused descriptor state dirty + +Alexander van der Grinten (1): + - egl: Fix \_eglPointerIsDereferencable w/o mincore() + +Alexander von Gluck IV (1): + - haiku/hgl: Fix build via header reordering + +Alyssa Rosenzweig (223): + - pipe-loader: Build kmsro loader for with all kmsro targets - pan/midgard: Remove OP_IS_STORE_VARY - pan/midgard: Add a dummy source for loads @@ -441,6 +464,9 @@ Changes - pan/midgard: Fix missing prefixes - pan/midgard: Don't crash with constants on unknown ops - pan/midgard: Use fprintf instead of printf for constants + +Andreas Baierl (14): + - lima: Beautify stream dumps - lima: Parse VS and PLBU command stream while making a dump - lima/streamparser: Fix typo in vs semaphore parser @@ -455,13 +481,22 @@ Changes - lima/parser: Make rsw alpha blend parsing more readable - lima: Add stencil support - lima: Fix alpha blending + +Andres Rodriguez (1): + - vulkan/wsi: disable the hardware cursor + +Andrii Simiklit (5): + - main: fix several 'may be used uninitialized' warnings - glsl: fix an incorrect max_array_access after optimization of ssbo/ubo - glsl: fix a binding points assignment for ssbo/ubo arrays - glsl/nir: do not change an element index to have correct block name - mesa/st: fix a memory leak in get_version + +Anthony Pesch (5): + - util: import xxhash - util: move fnv1a hash implementation into its own header - util/hash_table: replace \_mesa_hash_data's fnv1a hash function with @@ -469,8 +504,14 @@ Changes - util/hash_table: added hash functions for integer types - util/hash_table: update users to use new optimal integer hash functions + +Anuj Phogat (2): + - intel: Add device info for 1x4x6 Jasper Lake - intel: Add pci-ids for Jasper Lake + +Arno Messiaen (5): + - lima: fix stride in texture descriptor - lima: add layer_stride field to lima_resource struct - lima: introduce ppir_op_load_coords_reg to differentiate between @@ -478,6 +519,9 @@ Changes from a register - lima: add cubemap support - lima/ppir: add lod-bias support + +Bas Nieuwenhuizen (33): + - radv: Fix timeout handling in syncobj wait. - radv: Remove \_mesa_locale_init/fini calls. - turnip: Remove \_mesa_locale_init/fini calls. @@ -514,8 +558,17 @@ Changes - radv: Allow DCC & TC-compat HTILE with VK_IMAGE_CREATE_EXTENDED_USAGE_BIT. - radv: Do not set SX DISABLE bits for RB+ with unused surfaces. + +Ben Crocker (1): + - llvmpipe: use ppc64le/ppc64 Large code model for JIT-compiled shaders + +Bernd Kuhls (1): + - util/os_socket: Include unistd.h to fix build error + +Boris Brezillon (21): + - panfrost: MALI_DEPTH_TEST is actually MALI_DEPTH_WRITEMASK - panfrost: Destroy the upload manager allocated in panfrost_create_context() @@ -541,6 +594,9 @@ Changes - panfrost/midgard: Add a condense_writemask() helper - panfrost/midgard: Prettify embedded constant prints - panfrost: Fix the damage box clamping logic + +Brian Ho (14): + - turnip: Update tu_query_pool with turnip-specific fields - turnip: Implement vkCreateQueryPool for occlusion queries - turnip: Implement vkCmdBeginQuery for occlusion queries @@ -555,12 +611,21 @@ Changes - turnip: Free event->bo on vkDestroyEvent - turnip: Fix vkGetQueryPoolResults with available flag - turnip: Fix vkCmdCopyQueryPoolResults with available flag + +Brian Paul (4): + - s/APIENTRY/GLAPIENTRY/ in teximage.c - nir: fix a couple signed/unsigned comparison warnings in nir_builder.h - Call shmget() with permission 0600 instead of 0777 - nir: no-op C99 \_Pragma() with MSVC + +C Stout (1): + - util/vector: Fix u_vector_foreach when head rolls over + +Caio Marcelo de Oliveira Filho (24): + - spirv: Don't leak GS initialization to other stages - glsl: Check earlier for MaxShaderStorageBlocks and MaxUniformBlocks - glsl: Check earlier for MaxTextureImageUnits and MaxImageUniforms @@ -586,7 +651,13 @@ Changes - anv: Always initialize target_stencil_layout - intel/compiler: Add names for SHADER_OPCODE_[IU]SUB_SAT - nir: Make nir_deref_path_init skip trivial casts + +Chris Wilson (1): + - egl: Mention if swrast is being forced + +Christian Gmeiner (24): + - drm-shim: fix EOF case - etnaviv: rs: upsampling is not supported - etnaviv: add drm-shim @@ -611,6 +682,9 @@ Changes - etnaviv: add deqp debug option - etnaviv: drop default state for PE_STENCIL_CONFIG_EXT2 - etnaviv: drm-shim: add GC400 + +Connor Abbott (19): + - nir: Fix non-determinism in lower_global_vars_to_local - radv: Rename ac_arg_regfile - ac: Add a shared interface between radv, radeonsi, LLVM and ACO @@ -630,8 +704,14 @@ Changes - freedreno: Document CP_COND_REG_EXEC more - freedreno: Document CP_UNK_A6XX_55 - freedreno: Document CP_INDIRECT_BUFFER_CHAIN + +Daniel Ogorchock (2): + - panfrost: Fix panfrost_bo_access memory leak - panfrost: Fix headers and gpu_headers memory leak + +Daniel Schürmann (58): + - aco: fix immediate offset for spills if scratch is used - aco: only use single-dword loads/stores for spilling - aco: fix accidential reordering of instructions when scheduling @@ -693,9 +773,15 @@ Changes - aco: simplify gathering of MIMG address components - docs: add new features for RADV/ACO. - aco: fix image_atomic_cmp_swap + +Daniel Stone (2): + - Revert "st/dri: do FLUSH_VERTICES before calling flush_resource" - Revert "gallium: add st_context_iface::flush_resource to call FLUSH_VERTICES" + +Danylo Piliaiev (12): + - intel/blorp: Fix usage of uninitialized memory in key hashing - i965/program_cache: Lift restriction on shader key size - intel/blorp: Fix usage of uninitialized memory in key hashing @@ -708,6 +794,9 @@ Changes - i965: Do not set front_buffer_dirty if there is no front buffer - st/mesa: Handle the rest renderbuffer formats from OSMesa - st/nir: Unify inputs_read/outputs_written before serializing NIR + +Dave Airlie (74): + - nir/serialize: pack function has name and entry point into flags. - nir/serialize: fix serializing functions with no implementations. - spirv: don't store 0 to cs.ptr_size for non kernel stages. @@ -783,17 +872,41 @@ Changes - galllivm: fix gather offset casting - gallivm: fix find lsb - gallivm/nir: add missing break for isub. + +David Heidelberg (1): + - .mailmap: use correct email address + +David Stevens (1): + - virgl: support emulating planar image sampling + +Denis Pauk (2): + - gallium/swr: Enable support bptc format. - docs/features: mark GL_ARB_texture_compression_bptc as done for llvmpipe, softpipe, swr + +Dongwon Kim (3): + - gallium: enable INTEL_PERFORMANCE_QUERY - iris: INTEL performance query implementation - gallium: check all planes' pipe formats in case of multi-samplers + +Drew Davenport (1): + - radeonsi: Clear uninitialized variable + +Drew DeVault (1): + - st_get_external_sampler_key: improve error message + +Duncan Hopkins (1): + - zink: make sure src image is transfer-src-optimal + +Dylan Baker (69): + - Bump VERSION to 20.0.0-devel - docs/new_features: Empty the feature list for the 20.0 cycle - nir: correct use of identity check in python @@ -870,14 +983,26 @@ Changes - .pick_status.json: Update to 2a98cf3b2ecea43cea148df7f77d2abadfd1c9db - .pick_status.json: Update to 946eacbafb47c8b94d47e7c9d2a8b02fff5a22fa - .pick_status.json: Update to bee5c9b0dc13dbae0ccf124124eaccebf7f2a435 + +Eduardo Lima Mitev (2): + - turnip: Remove failed command buffer from pool - turnip: Fix issues in tu_compute_pipeline_create() that may lead to crash + +Elie Tournier (4): + - Docs: remove duplicate meson docs for windows - docs: fix ascii html representation - nir/algebraic: i2f(f2i()) -> trunc() - nir/algebraic: sqrt(x)*sqrt(x) -> fabs(x) + +Emmanuel Gil Peyrot (1): + - intel/compiler: Return early if read() failed + +Eric Anholt (102): + - ci: Make lava inherit the ccache setup of the .build script. - ci: Switch over to an autoscaling GKE cluster for builds. - Revert "ci: Switch over to an autoscaling GKE cluster for builds." @@ -984,6 +1109,9 @@ Changes - tu: Fix binning address setup after pack macros change. - Revert "gallium: Fix big-endian addressing of non-bitmask array formats." + +Eric Engestrom (58): + - meson: split out idep_xmlconfig_headers from idep_xmlconfig - anv: add missing xmlconfig headers dependency - radv: drop unnecessary xmlpool_options_h @@ -1047,6 +1175,9 @@ Changes - util/os_socket: fix header unavailable on windows - freedreno/perfcntrs: fix fd leak - util/disk_cache: check for write() failure in the zstd path + +Erico Nunes (17): + - lima: fix nir shader memory leak - lima: fix bo submit memory leak - lima/ppir: enable lower_fdph @@ -1064,6 +1195,9 @@ Changes - lima/ppir: fix ssa undef emit - lima/ppir: split ppir_op_undef into undef and dummy again - lima/ppir: fix src read mask swizzling + +Erik Faye-Lund (82): + - zink: heap-allocate samplers objects - zink: emit line-width when using polygon line-mode - anv: remove incorrect polygonMode=point early-out @@ -1146,7 +1280,13 @@ Changes - zink: set compareEnable when setting compareOp - st/mesa: use uint-result for sampling stencil buffers - Revert "nir: Add a couple trivial abs optimizations" + +Florian Will (1): + - radv/winsys: set IB flags prior to submit in the sysmem path + +Francisco Jerez (26): + - glsl: Fix software 64-bit integer to 32-bit float conversions. - intel/fs/gen11+: Handle ROR/ROL in lower_simd_width(). - intel/fs/gen8+: Fix r127 dst/src overlap RA workaround for EOT @@ -1191,11 +1331,20 @@ Changes writes. - intel/fs/gen12: Workaround data coherency issues due to broken NoMask control flow. + +Fritz Koenig (1): + - freedreno: reorder format check + +Georg Lehmann (3): + - Correctly wait in the fragment stage until all semaphores are signaled - Vulkan Overlay: Don't try to change the image layout to present twice - Vulkan overlay: use the corresponding image index for each swapchain + +Gert Wollny (12): + - r600: Disable eight bit three channel formats - virgl: Increase the shader transfer buffer by doubling the size - gallium/tgsi_from_mesa: Add 'extern "C"' to be able to include from @@ -1210,8 +1359,14 @@ Changes - r600: Delete vertex buffer only if there is actually a shader state - mesa/st: glsl_to_nir: don't lower atomics to SSBOs if driver supports HW atomics + +Guido Günther (2): + - etnaviv: drm: Don't miscalculate timeout - freedreno/drm: Don't miscalculate timeout + +Gurchetan Singh (11): + - drirc: set allow_higher_compat_version for Faster Than Light - virgl/drm: update UAPI - teximage: split out helper from EGLImageTargetTexture2DOES @@ -1223,6 +1378,9 @@ Changes - st/dri: track if image is created by a dmabuf - st/mesa: refactor egl image binding a bit - st/mesa: implement EGLImageTargetTexStorage + +Hyunjun Ko (7): + - freedreno/ir3: cleanup by removing repeated code - freedreno: support 16b for the sampler opcode - freedreno/ir3: fix printing output registers of FS. @@ -1231,6 +1389,9 @@ Changes - turnip: fix invalid VK_ERROR_OUT_OF_POOL_MEMORY - freedreno/ir3: put the conversion back for half const to the right place. + +Iago Toral Quiroga (32): + - v3d: rename vertex shader key (num)_fs_inputs fields - mesa/st: make sure we remove dead IO variables before handing NIR to backends @@ -1266,6 +1427,9 @@ Changes - v3d: expose OES_geometry_shader - u_vbuf: don't try to delete NULL driver CSO - v3d: fix bug when checking result of syncobj fence import + +Ian Romanick (39): + - intel/compiler: Report the number of non-spill/fill SEND messages on vec4 too - nir/algebraic: Add the ability to mark a replacement as exact @@ -1316,6 +1480,9 @@ Changes - relnotes: Add GL_INTEL_shader_integer_functions2 and VK_INTEL_shader_integer_functions2 - intel/fs: Don't count integer instructions as being possibly coissue + +Icecream95 (16): + - gallium/auxiliary: Reduce conversions in u_vbuf_get_minmax_index_mapped - gallium/auxiliary: Handle count == 0 in @@ -1334,8 +1501,14 @@ Changes - pan/decode: Rotate trace files - panfrost: Don't copy uniforms when the size is zero - pan/midgard: Fix a liveness info leak + +Icenowy Zheng (2): + - lima: support indexed draw with bias - lima: fix lima_set_vertex_buffers() + +Ilia Mirkin (7): + - gm107/ir: fix loading z offset for layered 3d image bindings - nv50/ir: mark STORE destination inputs as used - nv50,nvc0: fix destination coordinates of blit @@ -1343,17 +1516,32 @@ Changes - gm107/ir: avoid combining geometry shader stores at 0x60 - nvc0: treat all draws without color0 broadcast as MRT - nvc0: disable xfb's which don't have a stride + +Italo Nicola (1): + - intel/compiler: remove old comment + +Iván Briano (4): + - intel/compiler: Don't change hstride if not needed - anv: Export filter_minmax support only when it's really supported - anv: Export VK_KHR_buffer_device_address only when really supported - anv: Enable Vulkan 1.2 support + +James Xiong (3): + - iris: try to set the specified tiling when importing a dmabuf - gallium: dmabuf support for yuv formats that are not natively supported - gallium: let the pipe drivers decide the supported modifiers + +Jan Vesely (2): + - clover: Initialize Asm Parsers - clover: Use explicit conversion from llvm::StringRef to std::string + +Jan Zielinski (8): + - gallium/swr: Fix depth values for blit scenario - swr/rasterizer: Add tessellator implementation to the rasterizer - gallium/swr: Fix Windows build @@ -1362,6 +1550,9 @@ Changes - gallium/swr: implementation of tessellation shaders compilation - gallium/swr: fix tessellation state save/restore - docs: Update SWR tessellation support + +Jason Ekstrand (212): + - util: Add a util_sparse_array data structure - anv: Move refcount to anv_bo - anv: Use a util_sparse_array for the GEM handle -> BO map @@ -1575,10 +1766,16 @@ Changes - anv: Always fill out the AUX table even if CCS is disabled - intel/fs: Write the address register with NoMask for MOV_INDIRECT - anv/blorp: Use the correct size for vkCmdCopyBufferToImage + +Jonathan Gray (4): + - winsys/amdgpu: avoid double simple_mtx_unlock() - i965: update Makefile.sources for perf changes - util/futex: use futex syscall on OpenBSD - util/u_thread: don't restrict u_thread_get_time_nano() to \__linux_\_ + +Jonathan Marek (98): + - freedreno: add Adreno 640 ID - freedreno/ir3: disable texture prefetch for 1d array textures - freedreno/registers: fix a6xx_2d_blit_cntl ROTATE @@ -1678,6 +1875,9 @@ Changes - turnip: simplify tu_physical_device_get_format_properties - etnaviv: implement UBOs - turnip: hook up cmdbuffer event set/wait + +Jordan Justen (7): + - iris: Add IRIS_DIRTY_RENDER_BUFFER state flag - iris/gen11+: Move flush for render target change - iris: Allow max dynamic pool size of 2GB for gen12 @@ -1685,15 +1885,27 @@ Changes - iris: Fix some indentation in iris_init_render_context - iris: Emit CS Stall before Instruction Cache flush for gen12 WA - anv: Emit CS Stall before Instruction Cache flush for gen12 WA + +Jose Maria Casanova Crespo (1): + - v3d: Fix predication with atomic image operations + +Juan A. Suarez Romero (3): + - nir/lower_double_ops: relax lower mod() - Revert "nir/lower_double_ops: relax lower mod()" - nir/spirv: skip unreachable blocks in Phi second pass + +Kai Wasserbäch (4): + - nir: fix unused variable warning in nir_lower_vars_to_explicit_types - nir: fix unused variable warning in find_and_update_previous_uniform_storage - nir: fix unused function warning in src/compiler/nir/nir.c - intel/gen_decoder: Fix unused-but-set-variable warning + +Karol Herbst (14): + - nv50/ir: fix crash in isUniform for undefined values - nir/validate: validate num_components on registers and intrinsics - nir/serialize: fix vec8 and vec16 @@ -1709,6 +1921,9 @@ Changes - nir/serialize: cast swizzle before shifting - aco: use NIR_MAX_VEC_COMPONENTS instead of 4 - nv50ir/nir: support vec8 and vec16 + +Kenneth Graunke (57): + - iris: Fix "Force Zero RTA Index Enable" setting again - nir: Handle image arrays when setting variable data - Revert "intel/blorp: Fix usage of uninitialized memory in key @@ -1771,6 +1986,9 @@ Changes - iris: Support multiple chained batches. - i965: Use brw_batch_references in tex_busy check - loader: Fix leak of kernel driver name + +Kristian Høgsberg (62): + - freedreno/registers: Fix typo - freedreno/registers: Move SP_PRIMITIVE_CNTL and SP_VS_VPC_DST - freedreno/registers: Add comments about primitive counters @@ -1834,6 +2052,9 @@ Changes - ir3: Set up full/half register conflicts correctly - iris: Advertise PIPE_CAP_NATIVE_FENCE_FD - iris: Print warning and return \*out = NULL when fd to syncobj fails + +Krzysztof Raszkowski (10): + - gallium/swr: Fix GS invocation issues - Fixed proper setting gl_InvocationID. - Fixed GS vertices output memory overflow. - gallium/swr: Enable some ARB_gpu_shader5 extensions Enable / add to @@ -1848,13 +2069,28 @@ Changes - gallium/swr: add option for static link - gallium/swr: Fix gcc 4.8.5 compile error - gallium/swr: simplify environmental variabled expansion code + +Lasse Lopperi (1): + - freedreno/drm: Fix memory leak in softpin implementation + +Laurent Carlier (1): + - egl: avoid local modifications for eglext.h Khronos standard header file + +Leo Liu (1): + - ac: add missing Arcturus to the info of pc lines + +Lepton Wu (2): + - gallium: dri2: Use index as plane number. - android: mesa: Revert "android: mesa: revert "Enable asm unconditionally"" + +Lionel Landwerlin (60): + - intel/dev: set default num_eu_per_subslice on gen12 - intel/perf: add TGL support - intel/perf: fix Android build @@ -1916,9 +2152,18 @@ Changes - isl: add gen12 comment about CCS for linear tiling - anv: implement gen9 post sync pipe control workaround - anv: set MOCS on push constants + +Luis Mendes (1): + - radv: fix radv secure compile feature breaks compilation on armhf EABI and aarch64 + +Marco Felsch (1): + - etnaviv: Fix assert when try to accumulate an invalid fd + +Marek Olšák (245): + - glsl: encode/decode types using a union with bitfields for readability - glsl: encode vector_elements and matrix_columns better @@ -2221,21 +2466,39 @@ Changes - radeonsi: fix the DCC MSAA bug workaround - radeonsi: don't wait for shader compilation to finish when destroying a context + +Marek Vasut (5): + - etnaviv: Replace bitwise OR with logical OR - etnaviv: tgsi: Fix gl_FrontFacing support - etnaviv: Report correct number of vertex buffers - etnaviv: Do not filter out PIPE_FORMAT_S8_UINT_Z24_UNORM on pre-HALTI2 - etnaviv: Destroy rsc->pending_ctx set in etna_resource_destroy() + +Mark Janes (3): + - Revert "st/mesa: call nir_serialize only once per shader" - Revert "st/mesa: keep serialized NIR instead of nir_shader in st_program" - iris: separating out common perf code + +Markus Wick (3): + - mapi/glapi: Generate sizeof() helpers instead of fixed sizes. - mesa/glthread: Implement ARB_multi_bind. - drirc: Enable glthread for dolphin/citra/yuzu. + +Martin Fuzzey (1): + - etnaviv: update Android build files + +Mathias Fröhlich (1): + - egl: Implement getImage/putImage on pbuffer swrast. + +Matt Turner (19): + - intel/compiler: Use ARRAY_SIZE() - intel/compiler: Extract GEN\_\* macros into separate file - intel/compiler: Split has_64bit_types into float/int @@ -2255,6 +2518,9 @@ Changes - intel/compiler: Move Gen4/5 rounding to visitor - util: Explain BITSET_FOREACH_SET params - util: Remove tmp argument from BITSET_FOREACH_SET macro + +Mauro Rossi (9): + - android: aco: fix Lower to CSSA - android: radeonsi: fix build error due to wrong u_format.csv file path @@ -2265,6 +2531,9 @@ Changes - android: radv: fix vk_format_table.c generated source build - android: radeonsi,ac: fix building error due to ac changes - android: radv: build radv_shader_args.c + +Michel Dänzer (36): + - gitlab-ci: Set arm job CCACHE_DIR properly - gitlab-ci: Use separate arm64 build/test docker images - gitlab-ci: Don't build libdrm for ARM @@ -2303,23 +2572,47 @@ Changes - winsys/amdgpu: Close KMS handles for other DRM file descriptions - winsys/amdgpu: Re-use amdgpu_screen_winsys when possible - winsys/amdgpu: Close KMS handles for other DRM file descriptions + +Michel Zou (3): + - Meson: Check for dladdr with MinGW - disk_cache_get_function_timestamp: check for dladdr - Meson: Add llvm>=9 modules + +Miguel Casas-Sanchez (1): + - i965: Ensure that all 2101010 image imports can pass framebuffer completeness. + +Nanley Chery (3): + - gallium/dri2: Fix creation of multi-planar modifier images - gallium: Store the image format in winsys_handle - iris: Fix import of multi-planar surfaces with modifiers + +Nataraj Deshpande (1): + - egl/android: Restrict minimum triple buffering for android color_buffers + +Nathan Kidd (1): + - llvmpipe: Check thread creation errors + +Neha Bhende (3): + - st/mesa: release tgsi tokens for shader states - svga: fix size of format_conversion_table[] - svga: Use pipe_shader_state_from_tgsi to set shader state + +Neil Armstrong (3): + - Add support for T820 CI Jobs - ci: Remove T820 from CI temporarily - gitlab-ci/lava: add pipeline information in the lava job name + +Neil Roberts (9): + - nir/opcodes: Add a helper function to generate the comparison binops - nir/opcodes: Add a helper function to generate reduce opcodes - nir: Add a 16-bit bool type @@ -2329,6 +2622,9 @@ Changes - freedreno/ir3: Add implementation of nir_op_b16csel - freedreno/ir3: Implement f2b16 and i2b16 - freedreno/ir3: Enabling lowering 16-bit flrp + +Paul Cercueil (5): + - kmsro: Extend to include ingenic-drm - u_vbuf: Mark vbufs incompatible if more were requested than HW supports @@ -2336,15 +2632,33 @@ Changes - u_vbuf: Regard non-constant vbufs with non-instance elements as free - u_vbuf: Return true in u_vbuf_get_caps if nb of vbufs is below minimum + +Paul Gofman (1): + - state_tracker: Handle texture view min level in st_generate_mipmap() + +Paulo Zanoni (2): + - intel/compiler: remove the operand restriction for src1 on GLK - intel/compiler: fix nir_op_{i,u}*32 on ICL + +Peng Huang (1): + - radeonsi: make si_fence_server_signal flush pipe without work + +Philipp Sieweck (1): + - svga: check return value of define_query_vgpu{9,10} + +Pierre Moreau (4): + - compiler/spirv: Fix uses of gnu struct = {} extension - include/CL: Update OpenCL headers to latest - clover: Use the dispatch table type from the OpenCL headers - clover/meson: Define OpenCL header macros + +Pierre-Eric Pelloux-Prayer (54): + - radeonsi: tell the shader disk cache what IR is used - mesa: enable msaa in clear_with_quad if needed - mesa: pass vao as a function paramter @@ -2400,13 +2714,22 @@ Changes - docs: document AMD_DEBUG variable - radeonsi: stop using the VM_ALWAYS_VALID flag - radeonsi/ngg: add VGT_FLUSH when enabling fast launch + +Prodea Alexandru-Liviu (2): + - Meson: Remove lib prefix from graw and osmesa when building with Mingw. Also remove version sufix from osmesa swrast on Windows. - Appveyor: Quickly fix meson build. As this required use of Python 3.8, mako module also had to be updated. + +Qiang Yu (3): + - lima: sync lima_drm.h with kernel - lima: create heap buffer with new interface if available - lima: add noheap debug option + +Rafael Antognolli (23): + - intel/isl: Add MOCS settings to isl_device. - anv: Use mocs settings from isl_dev. - iris: Use mocs from isl_dev. @@ -2430,6 +2753,9 @@ Changes - iris: Implement WA for push constants. - utils/os_socket: Define ssize_t on windows. - intel: Load the driver even if I915_PARAM_REVISION is not found. + +Rhys Perry (131): + - radv: adjust loop unrolling heuristics for int64 - aco: add Instruction::usesModifiers() and add more checks in the optimizer @@ -2574,7 +2900,13 @@ Changes - aco: parallelcopy exec mask before s_wqm - aco: fix exec mask consistency issues - aco: fix gfx10_wave64_bpermute + +Ricardo Garcia (1): + - anv: Unify GetDeviceQueue and GetDeviceQueue2 + +Rob Clark (89): + - freedreno/ir3: split pre-coloring to it's own function - freedreno/ir3: use SSA flag on dest register too - freedreno/ir3: ir3_print tweaks @@ -2664,20 +2996,44 @@ Changes - freedreno: add gmem_lock - freedreno: remove flush-queue - freedreno: allow ctx->batch to be NULL + +Robert Foss (5): + - nir: Build nir_lower_point_size.c in libmesa_nir - android: Add panfrost support to build scripts - android: Fix u_format_table.c being generated twice - panfrost: Prefix schedule_program to prevent collision - android: Fix whitespace issue + +Rohan Garg (1): + - gitlab-ci: Use lavacli from packages + +Roland Scheidegger (3): + - gallium/scons: fix graw_gdi build - util/atomic: Fix p_atomic_add for unlocked and msvc paths - winsys/svga: use new ioctl for logging + +Roman Stratiienko (2): + - Android: Fix build issue without LLVM - panfrost: Fix Android build + +Ross Zwisler (1): + - intel: limit shader geometry on BDW GT1 + +Sagar Ghuge (1): + - intel/compiler: Clear accumulator register before EOT + +Samuel Iglesias Gonsálvez (1): + - main: fix coverity error in \_mesa_program_resource_find_name() + +Samuel Pitoiset (202): + - radv: declare NGG scratch for VS or TES and only on GFX10 - radv: fix compute pipeline keys when optimizations are disabled - docs: document all RADV environment variables @@ -2890,12 +3246,27 @@ Changes GFX8-GFX9 - aco: fix creating v_madak if v_mad_f32 has two sgpr literals - nir: do not use De Morgan's Law rules for flt and fge + +Samuel Thibault (3): + - loader: #define PATH_MAX when undefined (eg. Hurd) - util: Do not fail to build on unknown pthread_setname_np - meson: Do not require libdrm for DRI2 on hurd + +Satyajit Sahu (1): + - radeon/vcn: Handle crop parameters for encoder + +Sonny Jiang (1): + - radeonsi: use compute shader for clear 12-byte buffer + +Stephan Gerhold (1): + - kmsro: Add "mcde" entry point + +Tapani Pälli (33): + - nir: fix couple of compile warnings - util/android: fix android build errors - Revert "egl: implement new functions from @@ -2932,10 +3303,16 @@ Changes - intel/vec4: fix valgrind errors with vf_values array - glsl: fix a memory leak with resource_set - iris: fix aux buf map failure in 32bits app on Android + +Thomas Hellstrom (4): + - winsys/svga: Enable transhuge pages for buffer objects - svga: Avoid discard DMA uploads - gallium/util: Increase the debug_flush map depth - svga: Fix banded DMA upload + +Thong Thai (8): + - st/va: Convert interlaced NV12 to progressive - util/format: Add the P010 format used for 10-bit videos - gallium: Add PIPE_FORMAT_P010 support @@ -2944,6 +3321,9 @@ Changes - r600: Remove HEVC related code since HEVC is not supported - mesa: Prevent \_MaxLevel from being less than zero - Revert "st/va: Convert interlaced NV12 to progressive" + +Timothy Arceri (66): + - glsl: just use NIR to lower outputs when driver can't read outputs - glsl: disable lower_fragdata_array() for NIR drivers - mesa: add ARB_shading_language_include stubs @@ -3012,6 +3392,9 @@ Changes - glsl: count uniform components and storage better in nir linking - glsl_to_nir: update interface type properly - glsl: fix gl_nir_set_uniform_initializers() for image arrays + +Timur Kristóf (39): + - ac: Handle invalid GFX10 format correctly in ac_get_tbuffer_format. - aco: Make sure not to mistakenly propagate 64-bit constants. - aco: Treat all booleans as per-lane. @@ -3053,7 +3436,13 @@ Changes - aco/gfx10: Fix VcmpxExecWARHazard mitigation. - aco: Fix the meaning of is_atomic. - aco/optimizer: Don't combine uniform bool s_and to s_andn2. + +Tomasz Pyra (1): + - gallium/swr: Fix arb_transform_feedback2 + +Tomeu Vizoso (38): + - gitlab-ci: Disable lima jobs - gitlab-ci: Run only LAVA jobs in special-named branches - panfrost: Add checksum fields to SFBD descriptor @@ -3093,10 +3482,16 @@ Changes - gitlab-ci: Switch LAVA jobs to use shared dEQP runner - gitlab-ci: Upgrade kernel for LAVA jobs to v5.5-rc5 - gitlab-ci: Consolidate container and build stages for LAVA + +Urja Rannikko (4): + - panfrost: free last_read/write tables in mir_create_dependency_graph - panfrost: free allocations in schedule_block - panfrost: add lcra_free() to free lcra state - panfrost: free spill cost table in mir_spill_register + +Vasily Khoruzhick (31): + - lima: add debug prints for BO cache - lima: align size before trying to fetch BO from cache - lima: ignore flags while looking for BO in cache @@ -3132,6 +3527,9 @@ Changes - lima: ppir: always create move and update ld_tex successors for all blocks - lima: disable early-z if fragment shader uses discard + +Vinson Lee (9): + - swr: Fix build with llvm-10.0. - panfrost: Fix gnu-empty-initializer build errors. - scons: Bump C standard to gnu11 on macOS 10.15. @@ -3141,14 +3539,29 @@ Changes - lima: Fix build with GCC 10. - swr: Fix GCC 4.9 checks. - panfrost: Remove unused anonymous enum variables. + +Wladimir J. van der Laan (2): + - u_vbuf: add logic to use a limited number of vbufs - u_vbuf: use single vertex buffer if it's not possible to have multiple + +X512 (1): + - util/u_thread: Fix build under Haiku + +Yevhenii Kolesnikov (5): + - glsl: Enable textureSize for samplerExternalOES - meson: Fix linkage of libgallium_nine with libgalliumvl - meta: Cleanup function for DrawTex - main: allow external textures for BindImageTexture - meta: Add cleanup function for Bitmap + +Zebediah Figura (1): + - Revert "draw: revert using correct order for prim decomposition." + +luc (1): + - zink: confused compilation macro usage for zink in target helpers. diff --git a/docs/relnotes/20.0.1.rst b/docs/relnotes/20.0.1.rst index 20398053f7a..0008d8b9c5d 100644 --- a/docs/relnotes/20.0.1.rst +++ b/docs/relnotes/20.0.1.rst @@ -37,13 +37,36 @@ Bug fixes Changes ------- +Andreas Baierl (1): + - gitlab-ci: lima: Add flaky tests to the skips list + +Andrii Simiklit (1): + - Revert "glx: convert glx_config_create_list to one big calloc" + +Arcady Goldmints-Orlov (1): + - spirv: Remove outdated SPIR-V decoration warnings + +Bas Nieuwenhuizen (1): + - radeonsi: Fix compute copies for subsampled formats. + +Caio Marcelo de Oliveira Filho (1): + - intel/gen12: Take into account opcode when decoding SWSB + +Chris Wilson (1): + - iris: Fix import sync-file into syncobj + +Danylo Piliaiev (1): + - i965: Do not generate D16 B5G6R5_UNORM configs on gen < 8 + +Dave Airlie (7): + - dri: add another get shm variant. - glx/drisw: add getImageShm2 path - glx/drisw: return false if shmid == -1 @@ -51,6 +74,9 @@ Changes - gallivm/tgsi: fix stream id regression - gallivm/nir: fix integer divide SIGFPE - gallivm/nir: handle mod 0 better. + +Dylan Baker (7): + - docs: Add release notes for 20.0.0 - .pick_status.json: Update to 8291d728dc997e87b4d2e4e451692643a1dba881 - .pick_status.json: Update to e4baff90812d799d586296fcad992ddcc553c359 @@ -58,38 +84,89 @@ Changes - .pick_status.json: Update to 09323634898ab3efc0150dc7d756bf36b1b89b76 - .pick_status.json: Update to 3503cb4c28e01b34f3a25546c058150709c22348 - .pick_status.json: Update to 0ac731b1ff96de46998948aa06081efa5140d50e + +Eric Anholt (3): + - llvmpipe: Fix real uninitialized use of "atype" for SEMANTIC_FACE - turnip: Fix compiler warning about casting a nondispatchable handle. - aco: Fix signed-vs-unsigned warning. + +Erik Faye-Lund (1): + - util: promote u_debug_memory.c to src/util + +Ian Romanick (2): + - nir/search: Use larger type to hold linearized index - intel/fs: Correctly handle multiply of fsign with a source modifier + +James Xiong (1): + - iris: handle the failure of converting unsupported yuv formats to isl + +Jason Ekstrand (1): + - anv: Always enable the data cache + +Jonathan Marek (1): + - turnip: fix srgb MRT + +Jordan Justen (1): + - intel/compiler: Restrict cs_threads to 64 + +Jose Maria Casanova Crespo (1): + - v3d: Sync on last CS when non-compute stage uses resource written by CS + +Kenneth Graunke (2): + - iris: Make mocs an inline helper in iris_resource.h - iris: Fix BLORP vertex buffers to respect ISL MOCS settings + +Marek Olšák (5): + - mesa: fix immediate mode with tessellation and varying patch vertices - util: remove the dependency on kcmp.h - tgsi_to_nir: set num_images and num_samplers with holes correctly - mesa: call FLUSH_VERTICES before updating CoordReplace - mesa: fix incorrect prim.begin/end for glMultiDrawElements + +Mathias Fröhlich (2): + - egl: Fix A2RGB10 platform_{device,surfaceless} PBuffer configs. - mesa: Flush vertices before changing the OpenGL state. + +Michel Dänzer (1): + - st/vdpau: Only call is_video_format_supported hook if needed + +Paulo Zanoni (3): + - intel: fix the gen 11 compute shader scratch IDs - intel: fix the gen 12 compute shader scratch IDs - intel/device: bdw_gt1 actually has 6 eus per subslice + +Rafael Antognolli (2): + - iris: Apply the flushes when switching pipelines. - intel/gen12+: Disable mid thread preemption. + +Rhys Perry (2): + - aco: keep track of which events are used in a barrier - aco: fix carry-out size for wave32 v_add_co_u32_e64 + +Samuel Pitoiset (3): + - ac/llvm: fix 64-bit fmed3 - ac/llvm: fix 16-bit fmed3 on GFX8 and older gens - ac/llvm: flush denorms for nir_op_fmed3 on GFX8 and older gens + +Tapani Pälli (4): + - mesa: introduce boolean toggle for EXT_texture_norm16 - i965: toggle on EXT_texture_norm16 - mesa/st: toggle EXT_texture_norm16 based on format support diff --git a/docs/relnotes/20.0.2.rst b/docs/relnotes/20.0.2.rst index 03567bb1dd1..6e925b8ff6d 100644 --- a/docs/relnotes/20.0.2.rst +++ b/docs/relnotes/20.0.2.rst @@ -40,14 +40,28 @@ Bug fixes Changes ------- +Andreas Baierl (1): + - gitlab-ci: Add add a set of lima flakes + +Bas Nieuwenhuizen (2): + - amd/llvm: Fix divergent descriptor indexing. (v3) - amd/llvm: Fix divergent descriptor regressions with radeonsi. + +Danylo Piliaiev (2): + - glsl: do not crash if string literal is used outside of #include/#line - st/mesa: Fix signed integer overflow when using util_throttle_memory_usage + +Dave Airlie (1): + - gallium: fix build with latest meson and gcc10 + +Dylan Baker (8): + - docs: Add sha256sums for 20.0.1 - .pick_status.json: Update to 07f1ef5656e0721282d01a8421eaca056348137d - .pick_status.json: Update to 70341d7746c177a4cd7377ef633e9f85afd11d54 @@ -57,7 +71,13 @@ Changes - .pick_status.json: Update to ee9e0d1ecae307fa48200d2604d3114070253299 - .pick_status.json: Update to 3dd0d12aa5fefa94123269a541c94cdf57599e34 - .pick_status.json: Update to 94e37859a96cc56cf0c5418a5af00a3e9f5a1bf5 + +Eric Anholt (1): + - glsl/tests: Fix waiting for disk_cache_put() to finish. + +Eric Engestrom (7): + - bin/gen_release_notes.py: fix commit list command - .pick_status.json: Update to 24db276d11976905b2e8a44965c684bb48c3d49f - gen_release_notes: fix vulkan version reported @@ -65,8 +85,14 @@ Changes - .pick_status.json: Update to ba03e308b66b0b88f60b99d9d47851a5e1522e6e - vulkan/wsi: fix cleanup when dup() fails - gen_release_notes: fix version in "you should wait" message + +Francisco Jerez (1): + - intel/fs: Fix workaround for VxH indirect addressing bug under control flow. + +Jason Ekstrand (9): + - isl: Set 3DSTATE_DEPTH_BUFFER::Depth correctly for 3D surfaces - iris: Don't skip fast depth clears if the color changed - anv: Parse VkPhysicalDeviceFeatures2 in CreateDevice @@ -77,24 +103,48 @@ Changes - anv: Use a proper end-of-pipe sync instead of just CS stall - anv: Do end-of-pipe sync around MCS/CCS ops instead of CS stall - anv: Do an end-of-pipe sync before updating AUX table entries + +José Fonseca (1): + - meson: Avoid duplicate symbols. + +Kristian Høgsberg (2): + - Revert "glsl: Use a simpler formula for tanh" - Revert "spirv: Use a simpler and more correct implementaiton of tanh()" + +Marek Olšák (4): + - Revert "mesa: check for z=0 in \_mesa_Vertex3dv()" - radeonsi: add a bug workaround for NGG - LATE_ALLOC_GS - ac: add a bug workaround for the 100% NGG culling case - gallium/cso_context: remove cso_delete_xxx_shader helpers to fix the live cache + +Martin Fuzzey (3): + - freedreno: android: fix build failure on android due to python version - freedreno: android: add a6xx-pack.xml.h generation to android build - freedreno: android: fix build of perfcounters. + +Michel Dänzer (1): + - llvmpipe: Use uintptr_t for pointer values + +Rafael Antognolli (3): + - anv: Wait for the GPU to be idle before invalidating the aux table. - iris: Split aux map initialization from invalidation. - iris: Wait for the GPU to be idle before invalidating the aux table. + +Rob Clark (1): + - freedreno: fix FD_MESA_DEBUG=inorder + +Samuel Pitoiset (5): + - aco: fix image load/store with lod and 1D images - nir/lower_input_attachments: remove bogus assert in try_lower_input_texop() @@ -102,6 +152,12 @@ Changes - radv: only inject implicit subpass dependencies if necessary - radv: fix random depth range unrestricted failures due to a cache issue + +Timur Kristóf (2): + - nir: Add ability to lower non-const quad broadcasts to const ones. - radv: Enable lowering dynamic quad broadcasts. + +Vinson Lee (1): + - st/nine: Fix incompatible-pointer-types-discards-qualifiers errors. diff --git a/docs/relnotes/20.0.3.rst b/docs/relnotes/20.0.3.rst index 4dc165ff839..9113841dfd3 100644 --- a/docs/relnotes/20.0.3.rst +++ b/docs/relnotes/20.0.3.rst @@ -43,7 +43,12 @@ Bug fixes Changes ------- +Caio Marcelo de Oliveira Filho (1): + - mesa/main: Fix overflow in validation of DispatchComputeGroupSizeARB + +Dylan Baker (6): + - docs/relnotes: Add sha256 sums for 20.0.2 - .pick_status.json: Update to cf62c2b2ac69637785f55b790fdd601c17e7e9d5 - .pick_status.json: Mark 672d10619980687acec329742f055f7f3796c1b8 as @@ -53,41 +58,83 @@ Changes - .pick_status.json: Mark 56de6f698e3f164d97f132203e8159ef0b8e9bb8 as denominated - .pick_status.json: Update to aee004a7c8900938d1c17f0ac299d40001b383b0 + +Eric Engestrom (6): + - .pick_status.json: Update to 3252041a7872c49e53bb02ffe8b079b5fc43f15e - .pick_status.json: Update to 12711939320e4fcd3a0d86af22da1042ad92035f - .pick_status.json: Update to 05069e1f0794aadd40ce9269f858e50c64254388 - .pick_status.json: Update to 8970b7839aebefa7207c9535ac34ab4e8cc0ae25 - .pick_status.json: Update to 5f4d9b419a1c931ad468b8b22b8a95b1216891e4 - .pick_status.json: Update to 70ac7f5b0c46370075a35067c9f7dfe78e84b16d + +Erik Faye-Lund (3): + - rbug: do not return void-value - pipebuffer: clean up cast-warnings - vtn/opencl: fully enable OpenCLstd_Clz + +Francisco Jerez (1): + - intel/fs/gen12: Fix interaction of SWSB dependency combination with EU fusion workaround. + +Greg V (1): + - amd/addrlib: fix build on non-x86 platforms + +Ian Romanick (2): + - soft-fp64/fsat: Correctly handle NaN - soft-fp64: Split a block that was missing a cast on a comparison + +Jason Ekstrand (5): + - intel/blorp: Add support for swizzling fast-clear colors - anv: Swizzle fast-clear values - nir/lower_int64: Lower 8 and 16-bit downcasts with nir_lower_mov64 - anv: Account for the header in anv_state_stream_alloc - spirv: Implement OpCopyObject and OpCopyLogical as blind copies + +John Stultz (2): + - gallium: hud_context: Fix scalar initializer warning. - vc4_bufmgr: Remove duplicative VC definition + +Jordan Justen (2): + - intel: Update TGL PCI strings - intel: Add TGL PCI ID + +Lionel Landwerlin (5): + - isl: implement linear tiling row pitch requirement for display - isl: properly filter supported display modifiers on Gen9+ - isl: only apply main surface ccs pitch constraint with CCS - isl: drop min row pitch alignment when set by the driver - intel: add new TGL pci ids + +Marek Olšák (3): + - nir: fix clip/cull_distance_array_size in nir_lower_clip_cull_distance_arrays - ac: fix fast division - st/mesa: fix use of uninitialized memory due to st_nir_lower_builtin + +Marek Vasut (1): + - etnaviv: Emit PE.ALPHA_COLOR_EXT\* on GPUs with half-float support + +Neil Armstrong (1): + - Revert "ci: Remove T820 from CI temporarily" + +Pierre-Eric Pelloux-Prayer (1): + - st/mesa: disallow deferred flush if there are multiple contexts + +Rhys Perry (11): + - nir/gather_info: handle emit_vertex_with_counter - aco: set has_divergent_branch for discards in loops - aco: handle missing second predecessors at merge block phis @@ -100,8 +147,14 @@ Changes - aco: implement 64-bit VGPR constant copies in handle_operands() - glsl: fix race in instance getters - util/u_queue: fix race in total_jobs_size access + +Rob Clark (2): + - freedreno/ir3/ra: fix array liveranges - util: fix u_fifo_pop() + +Samuel Pitoiset (7): + - radv/gfx10: fix required subgroup size with VK_EXT_subgroup_size_control - radv/gfx10: fix required ballot size with @@ -111,9 +164,21 @@ Changes - ac/nir: use llvm.amdgcn.rcp for nir_op_frcp - ac/nir: use llvm.amdgcn.rsq for nir_op_frsq - ac/nir: use llvm.amdgcn.rcp in ac_build_fdiv() + +Tapani Pälli (1): + - glsl: set error_emitted true if type not ok for assignment + +Thomas Hellstrom (1): + - svga, winsys/svga: Fix persistent memory discard maps + +Timothy Arceri (3): + - glsl: fix varying packing for 64bit integers - nir: fix packing of TCS varyings not read by the TES - nir: fix crash in varying packing on interface mismatch + +Timur Kristóf (1): + - radv/llvm: fix subgroup shuffle for chips without bpermute diff --git a/docs/relnotes/20.0.4.rst b/docs/relnotes/20.0.4.rst index 05aa3f3c5ed..e661bef1d3e 100644 --- a/docs/relnotes/20.0.4.rst +++ b/docs/relnotes/20.0.4.rst @@ -32,7 +32,12 @@ Bug fixes Changes ------- +Eric Engestrom (2): + - docs/relnotes: add sha256sum for 20.0.3 - .pick_status.json: Update to c71c1f44b055c680f073a2608a3bf560b55f8974 + +Jason Ekstrand (1): + - Revert "spirv: Implement OpCopyObject and OpCopyLogical as blind copies" diff --git a/docs/relnotes/20.0.5.rst b/docs/relnotes/20.0.5.rst index 5991bb42797..fc9e550a279 100644 --- a/docs/relnotes/20.0.5.rst +++ b/docs/relnotes/20.0.5.rst @@ -59,40 +59,90 @@ Bug fixes Changes ------- +Arcady Goldmints-Orlov (1): + - nir: Lower returns correctly inside nested loops + +Bas Nieuwenhuizen (3): + - radv: Store 64-bit availability bools if requested. - radv: Consider maximum sample distances for entire grid. - radv: Use correct buffer count with variable descriptor set sizes. + +D Scott Phillips (1): + - util/sparse_array: don't stomp head's counter on pop operations + +Daniel Stone (1): + - EGL: Add eglSetDamageRegionKHR to GLVND dispatch list + +Danylo Piliaiev (1): + - st/mesa: Update shader info of ffvp/ARB_vp after translation to NIR + +Dave Airlie (2): + - draw: free the NIR IR. - llvmpipe/nir: free the nir shader + +Dylan Baker (6): + - .pick_status.json: Update to 089e1fb287eb9b70c191091128ed5ba7edd2960a - .pick_status.json: Update to 65e2eaa4d3a7095ac438fafb09d1e36a4210966e - .pick_status.json: Update to 28d36d26c2212276e1238fad8f0b12caab97fee8 - .pick_status.json: Update to acf7e73be54c7f1cc52fcc9be38a9df26849200e - .pick_status.json: Update to 13ce637f1b28381e72470763ff5e39dd3c562476 - .pick_status.json: Update to c3c1f4d6bcc210408f8b180727d269838b38193b + +Emil Velikov (4): + - glx: set the loader_logger early and for everyone - egl/drm: reinstate (kms\_)swrast support - Revert "egl/dri2: Don't dlclose() the driver on dri2_load_driver_common failure" - glx: omit loader_loader() for macOS + +Eric Anholt (1): + - ci: Remove LLVM from ARM test drivers. + +Eric Engestrom (1): + - docs/relnotes: add sha256sum for 20.0.4 + +Hyunjun Ko (1): + - nir: fix wrong assignment to buffer in xfb_varyings_info + +Ilia Mirkin (1): + - nv50: don't try to upload MSAA settings for BUFFER textures + +Jason Ekstrand (5): + - anv/image: Use align_u64 for image offsets - nir/load_store_vectorize: Fix shared atomic info - spirv: Handle OOB vector extract operations - intel: Add \_const versions of prog_data cast helpers - anv: Report correct SLM size + +Jose Maria Casanova Crespo (1): + - v3d: Primitive Counts Feedback needs an extra 32-bit padding. + +Juan A. Suarez Romero (2): + - intel/compiler: store the FS inputs in WM prog data - anv/pipeline: allow more than 16 FS inputs + +Karol Herbst (2): + - clover: fix build with single library clang build - Revert "nvc0: fix line width on GM20x+" + +Lionel Landwerlin (7): + - iris: properly free resources on BO allocation failure - iris: share buffer managers accross screens - iris: make resources take a ref on the screen object @@ -101,16 +151,40 @@ Changes - iris: drop cache coherent cpu mapping for external BO - util/sparse_free_list: manipulate node pointers using atomic primitives + +Marek Olšák (1): + - st/mesa: fix a crash due to passing a draw vertex shader into the driver + +Mathias Fröhlich (1): + - i965: Move down genX_upload_sbe in profiles. + +Matt Turner (1): + - meson: Specify the maximum required libdrm in dri.pc + +Neil Armstrong (3): + - gitlab-ci/lava: fix handling of lava tags - gitlab-ci: add FILES_HOST_URL and move FILES_HOST_NAME into jobs - gitlab-ci: re-enable mali400/450 and t820 jobs + +Rhys Perry (1): + - aco: fix 1D textureGrad() on GFX9 + +Rob Clark (1): + - nir: fix definition of imadsh_mix16 for vectors + +Rohan Garg (1): + - ci: Split out radv build-testing on arm64 + +Samuel Pitoiset (9): + - ac/nir: split 8-bit load/store to global memory on GFX6 - ac/nir: split 8-bit SSBO stores on GFX6 - radv/llvm: enable 8-bit storage features on GFX6-GFX7 @@ -121,16 +195,37 @@ Changes - radv/llvm: fix exporting the viewport index if the fragment shader needs it - aco: fix exporting the viewport index if the fragment shader needs it + +Tapani Pälli (4): + - mesa/st: unbind shader state before deleting it - mesa/st: release variants for active programs before unref - glsl: stop processing function parameters if error happened - mesa/st: initialize all winsys_handle fields for memory objects + +Thong Thai (1): + - gallium/auxiliary/vl: fix bob compute shaders for deint yuv + +Timothy Arceri (1): + - radeonsi: don't lower constant arrays to uniforms in GLSL IR + +Tobias Jakobi (1): + - meson: Link Gallium Nine with ld_args_build_id + +Tomeu Vizoso (2): + - gitlab-ci: Place files from the Mesa repo into the build tarball - gitlab-ci: Serve files for LAVA via separate service + +Vinson Lee (2): + - swr/rasterizer: Use private functions for min/max to avoid namespace issues. - swr: Remove Byte Order Mark. + +pal1000 (1): + - scons/windows: Support build with LLVM 10. diff --git a/docs/relnotes/20.0.6.rst b/docs/relnotes/20.0.6.rst index a2a102432ce..58f2944bdc8 100644 --- a/docs/relnotes/20.0.6.rst +++ b/docs/relnotes/20.0.6.rst @@ -40,13 +40,24 @@ Bug fixes Changes ------- +Abhishek Kumar (1): + - anv/android: fix assert in anv_import_ahw_memory + +Bas Nieuwenhuizen (1): + - radv: Use actual memory type count for setting app-visible bitset. + +Danylo Piliaiev (3): + - st/mesa: Re-assign vs in locations after updating nir info for ffvp/ARB_vp - spirv: Expand workaround for OpControlBarrier on old GLSLang - st/mesa: Treat vertex inputs absent in inputMapping as zero in mesa_to_tgsi + +Dylan Baker (9): + - docs: Add sha256 sums for 20.0.5 - .pick_status.json: Update to c552b5fd1d106adc04f62fcbe71d650a9a17f7e0 - meson: update llvm dependency logic for meson 0.54.0 @@ -57,9 +68,18 @@ Changes - .pick_status.json: Update to efdb7fa9a83b0a216b1837a5912b71669bf3f984 - .pick_status.json: Update to 42b1696ef627a5bfee29911a780fa0a4dbf04610 - .pick_status.json: Update to 6b551d9f360e45ba4e74867dbe79ae212e4766c5 + +Eric Anholt (1): + - freedreno: Fix calculation of the const buffer cmdstream size. + +Erik Faye-Lund (2): + - mesa/gallium: do not use enum for bit-allocated member - meson: correct windows-version define + +Jason Ekstrand (12): + - anv: Move vb_emit setup closer to where it's used in flush_state - anv: Apply any needed PIPE_CONTROLs before emitting state - spirv: Allow constants and NULLs in SpvOpConvertUToPtr @@ -72,12 +92,27 @@ Changes - nir/lower_subgroups: Mask off unused bits in ballot ops - intel/devinfo: Compute the correct L3$ size for Gen12 - anv: Expose CS workgroup sizes based on a maximum of 64 threads + +Joshua Ashton (1): + - radv: Use TRUNC_COORD on samplers + +Lionel Landwerlin (5): + - iris: fail screen creation when kernel support is not there - intel/perf: move register definition to special file - intel/perf: break GL query stuff away - intel/perf: move mdapi query definitions to their own file - intel/perf: Enable MDAPI queries for Gen12 + +Pierre-Eric Pelloux-Prayer (1): + - radeonsi: skip vs output optimizations for some outputs + +Quentin Glidic (1): + - meson: Use dependency.partial_dependency() + +Samuel Pitoiset (1): + - radv: make sure to export the viewport index if FS needs it diff --git a/docs/relnotes/20.0.7.rst b/docs/relnotes/20.0.7.rst index 924f46515df..e34acd45b6f 100644 --- a/docs/relnotes/20.0.7.rst +++ b/docs/relnotes/20.0.7.rst @@ -44,15 +44,38 @@ Bug fixes Changes ------- +Axel Davy (1): + - gallium/util: Fix leak in the live shader cache + +Bas Nieuwenhuizen (2): + - radv: Extend tiling flags to 64-bit. - winsys/amdgpu: Retrieve WC flags from imported buffers. + +Blaž Tomažič (1): + - radeonsi: Fix omitted flush when moving suballocated texture + +Christopher James Halse Rogers (1): + - egl/wayland: Fix zwp_linux_dmabuf usage + +D Scott Phillips (2): + - intel/fs: Update location of Render Target Array Index for gen12 - anv,iris: Fix input vertex max for tcs on gen12 + +Danylo Piliaiev (1): + - i965: Fix out-of-bounds access to brw_stage_state::surf_offset + +Dave Airlie (1): + - llvmpipo/nir: free compute shader NIR + +Dylan Baker (16): + - docs: Add SHA256 sums for 20.0.6 - .pick_status.json: Update to 2efa76f795cb2b2bf00b317c580aeeeddd1e9bc2 - .pick_status.json: Update to 3fac55ce0d066d767d6c6c8308f79d0c3e566ec0 @@ -73,28 +96,67 @@ Changes - radeonsi: retab - .pick_status.json: Update to 0bea2a13212be10982e14617002a3ff851b84717 - .pick_status.json: Update to d76e722ed63607ecead2c66ef9f3a37a12b62bab + +Ian Romanick (1): + - nir/algebraic: Optimize ushr of pack_half, not ishr + +Ivan Molodetskikh (1): + - egl: allow INVALID format for linux_dmabuf + +Jason Ekstrand (3): + - nir/copy_prop_vars: Report progress when deleting self-copies - intel/fs: Don't delete coalesced MOVs if they have a cmod - vulkan: Allow destroying NULL debug report callbacks + +Jose Maria Casanova Crespo (2): + - v3d: Fix swizzle in DXT3 and DXT5 formats - v3d: Include supported DXT formats to enable s3tc/dxt extensions + +Lionel Landwerlin (3): + - iris: don't assert on unfinished aux import in copy paths - intel/perf: store the probed i915-perf version - anv: don't expose VK_INTEL_performance_query without kernel support + +Marek Olšák (3): + - mesa: report GL_INVALID_OPERATION for invalid glTextureBuffer target - radeonsi: unify and align down the max SSBO/TBO/UBO buffer binding size - radeonsi: fix compilation of monolithic PS + +Neil Armstrong (1): + - ci: disable t820/mali4xx tests + +Pierre Moreau (1): + - clover/nir: Check the result of spirv_to_nir + +Pierre-Eric Pelloux-Prayer (1): + - radeonsi: fix export count + +Qiang Yu (1): + - panfrost: don't always build bifrost_compiler + +Rhys Perry (2): + - nir: add missing group_memory_barrier handling - aco: consider blocks unreachable if they are in the logical cfg + +Samuel Pitoiset (4): + - radv: report INITIALIZATION_FAILED when the amdgpu winsys init failed - radv: don't report error with other vendor DRM devices - aco: fix 64-bit trunc with negative exponents on GFX6 - radv: limit the Vulkan version to 1.1 for Android + +Tapani Pälli (1): + - st/mesa: destroy only own program variants when program is released diff --git a/docs/relnotes/20.0.8.rst b/docs/relnotes/20.0.8.rst index 05b9b680597..98b8aab426a 100644 --- a/docs/relnotes/20.0.8.rst +++ b/docs/relnotes/20.0.8.rst @@ -64,10 +64,18 @@ Bug fixes Changes ------- +Bas Nieuwenhuizen (3): + - radv/winsys: Remove extra sizeof multiply. - radv: Handle failing to create .cache dir. - radv: Provide a better error for permission issues with priorities. + +D Scott Phillips (1): + - anv/gen11+: Disable object level preemption + +Danylo Piliaiev (6): + - anv: Translate relative timeout to absolute when calling anv_timelines_wait - anv: Fix deadlock in anv_timelines_wait @@ -79,7 +87,13 @@ Changes SIMD16 - glsl: inline functions with unsupported return type before converting to nir + +Dave Airlie (1): + - llvmpipe: compute shaders work better with all the threads. + +Dylan Baker (10): + - docs/relnotes Add sha256 sums to 20.0.7 - .pick_status.json: Update to ceae09da156309327d7ba6f4a59d3a2e9b8837d9 - .pick_status.json: Update to a887ad7c84e14fdad7907037a39e9fee9d504bf3 @@ -90,54 +104,123 @@ Changes - radonsi/si_state.c: retab - .pick_status.json: Update to 0795241dde1507e0c6a3f9ef07c281ad4f2acf7b - vulkan-overlay/meson: use install_data instead of configure_file + +Eric Engestrom (3): + - tree-wide: fix deprecated GitLab URLs - glapi: remove deprecated .getchildren() that has been replace with an iterator - intel: fix gen_sort_tags.py + +Erik Faye-Lund (2): + - zink: use general-layout when blitting to/from same resource - nir: reuse existing psiz-variable + +Gert Wollny (1): + - nir: lower_tex: Don't normalize coordinates for TXF with RECT + +Ian Romanick (1): + - anv/tests: Don't rely on assert or changing NDEBUG in tests + +Ilia Mirkin (1): + - nouveau: allow invalidating coherent/persistent buffer backings + +Jan Palus (1): + - targets/opencl: fix build against LLVM>=10 with Polly support + +Jason Ekstrand (6): + - anv:gpu_memcpy: Emit 3DSTATE_VF_INDEXING on Gen8+ - nir/lower_double_ops: Rework the if (progress) tree - nir/opt_deref: Report progress if we remove a deref - nir/copy_prop_vars: Record progress in more places - intel/vec4: Stomp the return type of RESINFO to UINT32 - intel/fs: Fix unused texture coordinate zeroing on Gen4-5 + +Jonathan Marek (1): + - freedreno/a6xx: use nonbinning VS when GS is used + +Joshua Ashton (1): + - radeonsi: Use TRUNC_COORD on samplers + +Lionel Landwerlin (4): + - iris: fix BO destruction in error path - i965: don't forget to set screen on duped image - i965: fix export of GEM handles - iris: fix export of GEM handles + +Lucas Stach (1): + - etnaviv: retarget transfer to render resource when necessary + +Marek Olšák (2): + - radeonsi: don't expose 16xAA on chips with 1 RB due to an occlusion query issue - radeonsi: add a hack to disable TRUNC_COORD for shadow samplers + +Marek Vasut (1): + - etnaviv: Disable seamless cube map on GC880 + +Michel Dänzer (1): + - util: Change os_same_file_description return type from bool to int + +Nataraj Deshpande (1): + - dri_util: Update internal_format to GL_RGB8 for MESA_FORMAT_R8G8B8X8_UNORM + +Neha Bhende (1): + - util: Initialize pipe_shader_state for passthrough and transform shaders + +Pierre-Eric Pelloux-Prayer (1): + - omx: fix build with gcc 10 + +Rhys Perry (4): + - nir: fix lowering to scratch with boolean access - aco: fix interaction with 3f branch workaround and p_constaddr - aco: check instruction format before waiting for a previous SMEM store - aco: preserve more fields when combining additions into SMEM + +Rob Clark (1): + - freedreno: clear last_fence after resource tracking + +Samuel Pitoiset (4): + - spirv,radv,anv: implement no-op VK_GOOGLE_user_type - nir/lower_explicit_io: fix NON_UNIFORM access for UBO loads - radv: enable zero VRAM for Doom Eternal - radv: enable zero VRAM for all VKD3D (DX12->VK) games + +Timothy Arceri (3): + - glsl: stop cascading errors if process_parameters() fails - radv: fix regression with builtin cache - glsl: fix potential slow compile times for GLSLOptimizeConservatively + +Vinson Lee (4): + - zink: Check fopen result. - r300g: Remove extra printf format specifiers. - vdpau: Fix wrong calloc sizeof argument. - mesa: Fix NetBSD compiler macro. + +Yevhenii Kolesnikov (1): + - intel/compiler: fix cmod propagation optimisations diff --git a/docs/relnotes/20.1.0.rst b/docs/relnotes/20.1.0.rst index 9b46ecf9d3a..9646a4b4f4b 100644 --- a/docs/relnotes/20.1.0.rst +++ b/docs/relnotes/20.1.0.rst @@ -249,14 +249,25 @@ Bug fixes Changes ------- +Abhishek Kumar (1): + - anv/android: fix assert in anv_import_ahw_memory + +Adam Jackson (1): + - gallium: enable EGL_EXT_image_dma_buf_import_modifiers unconditionally + +Albert Astals Cid (5): + - cube_face_coord: Use fabsf instead of fabs since we know it's floats - cube_face_index: Use fabsf instead of fabs since we know it's floats - aco: Minor optimization in spill_ctx constructor - aco: pass vars by const & - Fix promotion of floats to doubles + +Alejandro Piñeiro (7): + - docs/features: add v3d driver - nir/linker: remove reference to just SPIR-V linking - v3d/tex: don't configure tmu config 1 if not needed @@ -265,9 +276,15 @@ Changes - v3d/packet: fixing TMU_Config_Parameter_2 definition - nir: add nir_tex_instr_need_sampler helper - v3d: support for textureQueryLOD + +Alexandros Frantzis (3): + - gitlab-ci: Automated testing with OpenGL traces - gitlab-ci: Fix traces caching in tracie - gitlab-ci: Check the Mesa version used for tracie tests + +Alyssa Rosenzweig (505): + - pan/midgard: Break out one-src read_components - pan/midgard: Implement mixed-type constant packing - panfrost: Avoid overlapping copy @@ -773,6 +790,9 @@ Changes - pan/mdg: Move condense_writemask to disasm - pan/mdg: Ensure fdot is scalar out in disasm - pan/mdg: Replicate 16-bit swizzles + +Andreas Baierl (8): + - lima/parser: Fix RSW depth test parsing - lima/parser: Extend AUX0 findings - lima/parser: Change value name in RSW parser @@ -781,6 +801,9 @@ Changes - gitlab-ci: Enable the lima job again - gitlab-ci: Add add a set of lima flakes - lima: Add etc1 support + +Andres Gomez (27): + - tracie: correct typo - gitlab-ci: add missing popd to the build-deqp-vk.sh script - gitlab-ci: build gfxreconstruct into the Vulkan testing container @@ -811,10 +834,19 @@ Changes - gitlab-ci: update tracie README after changes in main script - gitlab-ci: create always the "results" directory with tracie - gitlab-ci: correct tracie behavior with replay errors + +Andrii Simiklit (2): + - Revert "glx: convert glx_config_create_list to one big calloc" - i965/vec4: Ignore swizzle of VGRF for use by var_range_end() + +Anuj Phogat (2): + - intel/gen12+: Reserve 4KB of URB space per bank for Compute Engine - intel/gen12+: Set way_size_per_bank to 4 + +Arcady Goldmints-Orlov (7): + - compiler/nir: Add support for variable initialization from a pointer - compiler/spirv: Add support for non-constant initializers - Rename nir_lower_constant_initializers to @@ -823,7 +855,13 @@ Changes - nir: Lower returns correctly inside nested loops - anv: increase minUniformBufferOffsetAlignment to 64 - intel/compiler: fix alignment assert in nir_emit_intrinsic + +Axel Davy (1): + - gallium/util: Fix leak in the live shader cache + +Bas Nieuwenhuizen (29): + - radv: Allow non-dedicated linear images and buffer. - radv: Do not set SX DISABLE bits for RB+ with unused surfaces. - radv: Optimize emitting index buffer changes. @@ -853,8 +891,17 @@ Changes - radv: Handle failing to create .cache dir. - radv: Do not close fd -1 when NULL-winsys creation fails. - radv: Implement vkGetSwapchainGrallocUsage2ANDROID. + +Bernd Kuhls (1): + - util/os_socket: Include unistd.h to fix build error + +Blaž Tomažič (1): + - radeonsi: Fix omitted flush when moving suballocated texture + +Boris Brezillon (45): + - pan/midgard: Add an enum to describe the render targets - pan/midgard: Make sure we pass the right RT id to emit_fragment_store() @@ -908,6 +955,9 @@ Changes - panfrost: Use ctx->active_prim in panfrost_writes_point_size() - panfrost: Get rid of ctx->payloads[] - vtn/opencl: add rint-support + +Brian Ho (17): + - turnip: Promote tu_cs_get_size/is_empty to header - turnip: Execute main cs for secondary command buffers - turnip: Advertise 8 bit subpixel precision @@ -925,6 +975,9 @@ Changes - turnip: Correctly set layer stride for 3D images - turnip: Emit geometry shader descriptor consts - freedreno/turnip: Update GRAS_LAYER_CNTL to GRAS_MAX_LAYER_INDEX + +Caio Marcelo de Oliveira Filho (46): + - anv: Advertise VK_KHR_shader_non_semantic_info - radv: Advertise VK_KHR_shader_non_semantic_info - intel/gen12: Take into account opcode when decoding SWSB @@ -971,6 +1024,9 @@ Changes - intel/fs,vec4: Pull stall logic for memory fences up into the IR - intel/fs: Only stall after sending all memory fence messages - i965: Use correct constant for max_variable_local_size + +Chad Versace (12): + - anv: Drop unused anv_image_get_surface_for_aspect_mask() - anv: Rename param make_surface::dev to device - anv: Delete anv_image::ccs_e_compatible @@ -983,11 +1039,23 @@ Changes - anv: Refactor creation of aux surfaces (v2) - anv: Flatten the logic add_aux_surface_if_supported (v3) - anv: Use isl_drm_modifier_get_default_aux_state() + +Chia-I Wu (2): + - egl/android: require ANDROID_native_fence_sync for buffer age - egl/android: enable/disable KHR_partial_update correctly + +Chris Lord (2): + - vc4: fix vc4_yuv_blit overwriting fragment constant buffer slot 0 - vc4: Fix query_dmabuf_modifiers mis-reporting external_only property + +Chris Wilson (1): + - iris: Fix import sync-file into syncobj + +Christian Gmeiner (44): + - etnaviv: enable texture upload memory throttling - etnaviv: update headers from rnndb - etnaviv: fix alpha test on GC3000 @@ -1034,8 +1102,17 @@ Changes - ci: bare-metal: power down device after tests - etnaviv: fix SAMP_ANISOTROPY register value - etnaviv: do not use int filter when anisotropic filtering is used + +Christopher Egert (1): + - radv: use util_float_to_half_rtz + +Christopher James Halse Rogers (1): + - egl/wayland: Fix zwp_linux_dmabuf usage + +Connor Abbott (55): + - freedreno: Fix CP_COND_REG_EXEC bit positions - freedreno: Add CP_REG_WRITE documentation - freedreno: Fix CP_COND_EXEC @@ -1091,6 +1168,9 @@ Changes - st/nir: Fix assigning PointCoord location with !PIPE_CAP_TEXCOORD - ir3: Remove VARYING_SLOT_PNTC remapping hack - tu: Don't invert point coords + +D Scott Phillips (6): + - intel/tools/aubinator_error_decode: read HW Context before other batches - intel/tools/aubinator_error_decode: Decode ring buffers from HEAD to @@ -1099,6 +1179,9 @@ Changes - intel/fs: Update location of Render Target Array Index for gen12 - anv,iris: Fix input vertex max for tcs on gen12 - anv/gen11+: Disable object level preemption + +Daniel Schürmann (73): + - aco: fix image_atomic_cmp_swap - nir: gather info whether a shader uses demote_to_helper - nir: add pass to lower discard() to demote() @@ -1185,6 +1268,9 @@ Changes striding - aco: coalesce v_mad's accumulator with definition's affinities - aco: either copy-propagate or inline create_vector operands + +Daniel Stone (15): + - Revert "gitlab-ci: disable panfrost runners" - egl/wayland: Don't invalidate buffers on no-op resize - util/test: Use MAX_PATH on Windows @@ -1200,6 +1286,9 @@ Changes - Revert "CI: Disable Windows/VS2019 builds" - ci/windows: Make Chocolatey installs more reliable - CI: Disable Lima jobs due to lab unhealthiness + +Danylo Piliaiev (29): + - i965: Do not set front_buffer_dirty if there is no front buffer - st/mesa: Handle the rest renderbuffer formats from OSMesa - osmesa/tests: Cover OSMESA_RGB GL_UNSIGNED_BYTE case @@ -1238,6 +1327,9 @@ Changes mutex - intel/fs: Work around dual-source blending hangs in combination with SIMD16 + +Dave Airlie (69): + - llvmpipe/query: add support for indexed queries - gallivm/swr: add stream_id to geom epilogue emit - gallivm/nir: add support for multiple vertex streams @@ -1308,15 +1400,33 @@ Changes - draw/tess: free tessellation control shader i/o memory. - llvmpipo/nir: free compute shader NIR - llvmpipe: compute shaders work better with all the threads. + +David Stevens (1): + - egl/android: set window usage flags + +Denys (1): + - gitlab: add bug report template + +Dominik Behr (1): + - meson: fix debug build on Android + +Drew Davenport (1): + - radv: Filter extensions not whitelisted for Android + +Duncan Hopkins (2): + - zink. Added storage CISto descriptor pool. Added storage in descriptor pool for combined image samplers as well as uniform buffers. Stops some shaders from running through a pools storage faster than zinks internal tracking. - zink: zero out zink_render_pass_state + +Dylan Baker (48): + - docs/release-calendar: 20.0.0-rc1 has been released - docs: Mark 20.0-rc2 as done - docs: Add release notes for 19.3.4 @@ -1368,17 +1478,32 @@ Changes - mesa: Follow OpenGL conversion rules for values that exceed storage size - tests: Make tests aware of meson test wrapper + +Edmondo Tommasina (1): + - radv/sqtt: fix RADV_THREAD_TRACE_BUFFER_SIZE spelling + +Eduardo Lima Mitev (3): + - turnip/pipeline: Don't assume tu_shader is a valid object - turnip: Instance can be NULL resolving 'GetInstanceProcAddr' entry point - anv/radv: Resolving 'GetInstanceProcAddr' should not require a valid instance + +Eli Schwartz (1): + - docs: fix typo in v20 release notes + +Elie Tournier (3): + - spirv2nir: print nir shader if translation succed - spirv2nir: Add kernel spirv support - docs/features: Update virgl OpenGL 4.5 features GL_ARB_clip_control and GL_KHR_robustness are now expose in the guest. + +Emil Velikov (11): + - meson: glx: drop with_glx == dri check - glx: set the loader_logger early and for everyone - egl/drm: reinstate (kms\_)swrast support @@ -1392,7 +1517,13 @@ Changes - loader: fallback to kernel name, if PCI fails - glx: omit loader_loader() for macOS - egl: simplify client/platform extension handling + +Emmanuel Gil Peyrot (1): + - Expose EGL_KHR_platform\_\* when EXT is supported + +Eric Anholt (144): + - gallium/osmesa: Fix a typo in the unit test's test names. - gallium/osmesa: Fix MakeCurrent of non-8888 contexts. - gallium/osmesa: Fill out other format tests. @@ -1557,6 +1688,9 @@ Changes - util/ra: Use util_dynarray for the adjacency list. - util/ra: Use util_dynarray for handling the conflict lists. - util/ra: Improve ra_set_finalize() performance. + +Eric Engestrom (58): + - VERSION: bump after 20.0 branch point - egl: put full path to libEGL_mesa.so in GLVND json - gitlab-ci: disable a630 tests as mesa-cheza is down @@ -1625,6 +1759,9 @@ Changes - VERSION: bump to 20.1.0-rc4 - .pick_status.json: Update to a91306677c613ba7511b764b3decc9db42b24de1 - tree-wide: fix deprecated GitLab URLs + +Erik Faye-Lund (154): + - zink: enable texture-buffer objects - zink: implement load_instance_id - zink: implement support for derivative-control @@ -1779,6 +1916,9 @@ Changes - util/os_memory: never use os_memory_debug.h - zink: implement i2b1 - zink: use general-layout when blitting to/from same resource + +Francisco Jerez (57): + - intel/fs/cse: Make HALT instruction act as CSE barrier. - intel/fs/gen7: Fix fs_inst::flags_written() for SHADER_OPCODE_FIND_LIVE_CHANNEL. @@ -1876,13 +2016,22 @@ Changes - intel/ir: Remove scheduling-based cycle count estimates. - intel/ir: Update performance analysis parameters for memory fence codegen changes. + +Fritz Koenig (3): + - Revert "gitlab-ci: disable a630 tests as mesa-cheza is down" - Revert "gitlab-ci: disable a630 tests as mesa-cheza is down (again)" - freedreno: allow FMT6_8_UNORM as a UBWC format + +Georg Lehmann (3): + - Correctly wait in the fragment stage until all semaphores are signaled - Vulkan Overlay: Don't try to change the image layout to present twice - Vulkan overlay: use the corresponding image index for each swapchain + +Gert Wollny (63): + - r600: force new CF with TEX only if any texture value is written - r600: Increase space for IO values to agree with PIPE_MAX_SHADER_IN/OUTPUTS @@ -1950,12 +2099,24 @@ Changes - r600: Fix nir compiler options, i.e. don't lower IO to temps for TESS - r600/sfn: Fix printing vertex fetch instruction flags - r600: Fix duplicated subexpression in r600_asm.c + +Greg V (3): + - amd/addrlib: fix build on non-x86 platforms - r600: add missing include - svga: fix build on FreeBSD + +H.J. Lu (2): + - x86_init_func_common: Add ENDBR at function entry - x86: Add ENDBR at function entries + +Hanno Böck (1): + - Properly check mmap return value + +Hyunjun Ko (27): + - freedreno/ir3: fix printing half constant registers. - freedreno/ir3: Add cat4 mediump opcodes - freedreno/ir3: put the conversion back for half const to the right @@ -1986,7 +2147,13 @@ Changes - turnip : Fix wrong offset calculation for xfb buffer. - turnip: Skip unused regs when setting up streamout buffers - turnip: Fix crashes when geometry shader constants aren't used + +Iago Toral Quiroga (1): + - nir: add a bool bitsize lowering pass + +Ian Romanick (62): + - intel/fs: Don't count integer instructions as being possibly coissue - nir: Mark fmin and fmax as commutative and associative - mesa/draw: Make sure all the unused fields are initialized to zero @@ -2062,6 +2229,9 @@ Changes - nir/algebraic: Require operands to iand be 32-bit - nir/algebraic: Optimize ushr of pack_half, not ishr - anv/tests: Don't rely on assert or changing NDEBUG in tests + +Icecream95 (16): + - panfrost: Fix non-debug builds - panfrost: Inline panfrost_get_default_swizzle - panfrost: LogicOp support @@ -2078,9 +2248,15 @@ Changes - panfrost: Fix GL_EXT_vertex_array_bgra - panfrost: Enable PIPE_CAP_VERTEX_COLOR_UNCLAMPED - panfrost: Fix background showing when using discard + +Icenowy Zheng (3): + - lima: remove its hash table entry when invalidating a resource - lima: expose fragment shader derivatives capability - lima: implement zsbuf reload + +Ilia Mirkin (24): + - nv50: report max lod bias of 15.0 - gitlab-ci: disable panfrost runners - mesa: fix \_mesa_draw_nonzero_divisor_bits to return nonzero divisors @@ -2106,18 +2282,39 @@ Changes - docs: update for recently-added nvc0 features - mesa: add interaction between compute derivatives and variable local sizes + +Indrajit Kumar Das (4): + - glapi/copyimage: Implement CopyImageSubDataNV - gallium: prepare framework for supporting AlphaToCoverageDitherControlNV - mesa: add support for AlphaToCoverageDitherControlNV - radeonsi: enable support for AlphaToCoverageDitherControlNV + +Ivan Molodetskikh (1): + - egl: allow INVALID format for linux_dmabuf + +James Xiong (2): + - iris: handle the failure of converting unsupported yuv formats to isl - gallium: let the pipe drivers decide the supported modifiers + +James Zhu (1): + - radeonsi: fix Segmentation fault during vaapi enc test + +Jan Palus (1): + - targets/opencl: fix build against LLVM>=10 with Polly support + +Jan Vesely (2): + - clover: Use explicit conversion from llvm::StringRef to std::string - clover: Check if the detected clang libraries are usable + +Jan Zielinski (8): + - gallium/swr: Fix various asserts and security issues - gallium/swr: fix corruptions in Unigine Heaven - gallium/swr: use ElementCount type arguments for getSplat() @@ -2126,6 +2323,9 @@ Changes - gallium/gallivm: remove unused header include for newer LLVM - gallium/swr: Fix LLVM 11 compilation issues - gallium/swr: Fix crashes and failures in vertex fetch + +Jason Ekstrand (202): + - genxml: Add a new 3DSTATE_SF field on gen12 - anv,iris: Set 3DSTATE_SF::DerefBlockSize to per-poly on Gen12+ - intel/genxml: Drop SLMEnable from L3CNTLREG on Gen11 @@ -2332,9 +2532,15 @@ Changes - nir/lower_double_ops: Rework the if (progress) tree - nir/opt_deref: Report progress if we remove a deref - nir/copy_prop_vars: Record progress in more places + +Jesse Natalie (3): + - wgl: add official gldrv.h header-file - wgl: use gldrv.h instead of stw_icd.h - util/ralloc: fix ralloc alignment on Win64 + +John Stultz (7): + - freedreno: Add ir3_cf.c and ir3_delay.c to Makefile.sources - panfrost: Move pan_afbc.c file to the the right Makefile.source file - gallium: hud_context: Fix scalar initializer warning. @@ -2342,7 +2548,13 @@ Changes - etnaviv: Avoid shift overflow - vc4_bufmgr: Remove duplicative VC definition - r600: Fix build error in sfn_nir_lower_fs_out_to_vector.cpp + +Jon Turney (1): + - Fix util/process test on Cygwin + +Jonathan Marek (79): + - freedreno/a6xx: use single format enum - freedreno/a6xx: fix Z24_UNORM_S8_UINT_AS_R8G8B8A8 - freedreno: name sysmem color/depth flush events @@ -2424,32 +2636,50 @@ Changes - nir: add pack_32_2x16_split/unpack_32_2x16_split lowering - freedreno/ir3: run nir_lower_pack - turnip: fix wrong substream size in parse_multisample_and_color_blend + +Jordan Justen (6): + - intel/compiler: Restrict cs_threads to 64 - intel: Update TGL PCI strings - intel: Add TGL PCI ID - intel/dev: Split .num_subslices out of GEN12_FEATURES macro - intel/dev: Add device info for RKL - docs/relnotes/new_features.txt: Add RKL to 20.1 release notes + +Jose Maria Casanova Crespo (5): + - broadcom: Fix implicit declaration of ffs for Android build - v3d: Sync on last CS when non-compute stage uses resource written by CS - v3d: Primitive Counts Feedback needs an extra 32-bit padding. - v3d: Fix swizzle in DXT3 and DXT5 formats - v3d: Include supported DXT formats to enable s3tc/dxt extensions + +Joshua Ashton (3): + - radv: Use TRUNC_COORD on samplers - radv: Pass logical device to si_emit_graphics - radeonsi: Use TRUNC_COORD on samplers + +José Fonseca (4): + - meson: Avoid duplicate symbols. - scons: Prune out unnecessary targets. - gitlab-ci: Prune all SCons jobs except scons-win64, and allows failures. - appveyor: Remove Meson job. + +Juan A. Suarez Romero (6): + - nir/lower_double_ops: add note for lowering mod - nir/lower_double_ops: relax lower mod() - nir/algebraic: coalesce fmod lowering - anv: use urb_setup_attribs in SBE - intel/compiler: store the FS inputs in WM prog data - anv/pipeline: allow more than 16 FS inputs + +Karol Herbst (18): + - clover: add trivial clCreateCommandQueueWithProperties implementation - nir/lower_ssbo: handle atomics - gallium: make handles of set_global_binding 64 bit @@ -2470,6 +2700,9 @@ Changes - gallium: initialize viewport swizzle in cso_set_viewport_dims - Revert "nvc0: fix line width on GM20x+" - st/mesa: properly guard fallback_copy_texsubimage aginst failed maps + +Kenneth Graunke (14): + - intel/genxml: Drop "reserved" enum - isl: Fix the android build. - iris: Dump frame markers with INTEL_DEBUG=submit @@ -2486,7 +2719,13 @@ Changes opt_peephole_sel - nir: Actually do load/store vectorization beyond vec2 - iris: Fix downcast of bound_vertex_buffers from uint64_t to int + +Konrad Dybcio (1): + - freedreno/a4xx: enable A405 + +Kristian Høgsberg (39): + - nir: Delete unused is_var_constant() helper - nir: Make unroll pragma work on clang - freedreno/fdperf: Cast away some ignored return values @@ -2527,6 +2766,9 @@ Changes - freedreno/ir3: Reset lex line number when we start parsing - freedreno/ir3: Print @tex write mask using 0x%x - freedreno: Use the right amount of &'s + +Krzysztof Raszkowski (10): + - gallium/swr: fix gcc warnings - gallium/swr: Fix gcc 4.8.5 compile error - gallium/swr: Fix llvm11 compilation issues @@ -2537,9 +2779,21 @@ Changes - gallium/swr: Fix vcvtph2ps llvm intrinsic compile error - gallium/swr: Fix array stride problem. - gallium/swr: Re-enable scratch space for client-memory buffers + +Leandro Ribeiro (1): + - i965: remove duplicated comment + +Leo Liu (1): + - radeon/jpeg: fix the jpeg dt_pitch with YUYV format + +Lepton Wu (1): + - virgl: Use ETC2 formats directly when possible. + +Lionel Landwerlin (49): + - iris: implement gen12 post sync pipe control workaround - anv: implement gen9 post sync pipe control workaround - anv: implement gen12 post sync pipe control workaround @@ -2591,8 +2845,14 @@ Changes - drm-shim: stub syncobj wait ioctl - iris: don't assert on unfinished aux import in copy paths - anv: don't expose VK_INTEL_performance_query without kernel support + +Liviu Prodea (2): + - scons/windows: Support build with LLVM 10. - util: Make process_test path compatible with mingw native toolchains + +Louis-Francis Ratté-Boulianne (7): + - glsl/linker: add DisableTransformFeedbackPacking workaround - glsl/linker: handle array/struct members for DisableXfbPacking - glsl/linker: add xfb workaround for modified built-in variables @@ -2600,7 +2860,13 @@ Changes - gallium: add PIPE_CAP_VIEWPORT_TRANSFORM_LOWERED - gallium: add PIPE_CAP_PSIZ_CLAMPED - panfrost: fix transform feedback + +Lucas Stach (1): + - etnaviv: retarget transfer to render resource when necessary + +Marek Olšák (254): + - vbo: move GLvertexformat initialization into a template header file for reuse - vbo: use the template for noop GLvertexformat initialization @@ -2922,18 +3188,33 @@ Changes - radeonsi: fix compilation of monolithic PS - radeonsi: don't expose 16xAA on chips with 1 RB due to an occlusion query issue + +Marek Vasut (4): + - etnaviv: Destroy rsc->pending_ctx set in etna_resource_destroy() - etnaviv: Emit PE.ALPHA_COLOR_EXT\* on GPUs with half-float support - etnaviv: Fix depth stencil ops on GC880/GC2000 - etnaviv: Disable seamless cube map on GC880 + +Mark Janes (2): + - nir: check shader type before writing to shaderinfo.tess union - nir: place aligned members after bitfields in shader_info.tess + +Mark Menzynski (2): + - util/blob: Add overwrite function for uint8 - tgsi/util: Change boolean for bool + +Martin Fuzzey (3): + - freedreno: android: fix build failure on android due to python version - freedreno: android: add a6xx-pack.xml.h generation to android build - freedreno: android: fix build of perfcounters. + +Mathias Fröhlich (19): + - egl: Implement getImage/putImage on pbuffer swrast. - mesa: Fix FLUSH_VERTICES in SubpixelPrecisionBiasNV. - egl: Fix A2RGB10 platform_{device,surfaceless} PBuffer configs. @@ -2954,6 +3235,9 @@ Changes - mesa: Provide gl_vertex_format accessors. - i965: Make use of the vertex format functions in i965. - i965: Use gl_vertex_format in brw_vertex_element. + +Matt Turner (11): + - intel/tools: Do not print type/qualifiers/name for c_literal - intel/vec4: Make implied_mrf_writes() a vec4_instruction method - intel/compiler: Remove unnecessary local variables @@ -2965,11 +3249,17 @@ Changes - intel/compiler: Discount NOPs from instruction counts - isl: Avoid EXPECT_DEATH in unit tests - meson: Specify the maximum required libdrm in dri.pc + +Mauro Rossi (5): + - android: gallium/auxiliary: fix "Unused source files" in tesselator - android: aco: fix PIPE_FORMAT related building errors - android: r600/sfn: fix includes and libmesa_nir dependency - android: r600/sfn: Add GDS instructions - android: aco: add various compiler statistics + +Michel Dänzer (33): + - gitlab-ci: Update to latest ci-templates HEAD - gitlab-ci: Pass -j4 to make - gitlab-ci: Merge ccache and libxml2-utils into main apt-get install @@ -3006,6 +3296,9 @@ Changes - mesa: Skip 3-byte array formats in \_mesa_array_format_flip_channels - Revert "ac,radeonsi: fix compilations issues with LLVM 11" - Revert "gallium/gallivm: fix compilation issues with llvm 11" + +Mike Blumenkrantz (6): + - zink: set UBO alignments in nir_intrinsic_load_uniform lowering - zink: remove framebuffer cache - zink: explicitly unref old fb object when setting new one @@ -3013,20 +3306,35 @@ Changes - gallium: add pipe cap for scissored clears and pass scissor state to clear() hook - iris: handle PIPE_CAP_CLEAR_SCISSORED + +Nanley Chery (6): + - isl: Add a module which manages aux resolves - iris: Use isl_aux_usage_has_fast_clear() - iris: Use ISL's access preparation functions - iris: Use isl_aux_state_transition_write() - i965: Use ISL's access preparation functions - i965: Use isl_aux_state_transition_write() + +Nataraj Deshpande (1): + - dri_util: Update internal_format to GL_RGB8 for MESA_FORMAT_R8G8B8X8_UNORM + +Neha Bhende (2): + - svga: fix size of format_conversion_table[] - svga: Use pipe_shader_state_from_tgsi to set shader state + +Neil Armstrong (4): + - gitlab-ci/lava: fix handling of lava tags - Revert "ci: Remove T820 from CI temporarily" - gitlab-ci: add FILES_HOST_URL and move FILES_HOST_NAME into jobs - gitlab-ci: re-enable mali400/450 and t820 jobs + +Neil Roberts (17): + - nir/opcodes: Add nir_op_f2fmp - glsl: Add support for float16 types in the IR tree - glsl: Add IR conversion ops for 16-bit float types @@ -3044,6 +3352,9 @@ Changes - glsl: Inline builtins in a separate pass - glsl/lower_precision: Lower builtins depending on arguments - glsl/lower_precision: Use vector.back() instead of vector.end()[-1] + +Paulo Zanoni (8): + - intel: fix the gen 11 compute shader scratch IDs - intel: fix the gen 12 compute shader scratch IDs - intel/device: bdw_gt1 actually has 6 eus per subslice @@ -3052,8 +3363,17 @@ Changes - iris: remove unnecessary forward declaration - iris: remove useless bo->gtt_offset assignment - iris: make BATCH_SZ smaller by BATCH_RESERVED bytes + +Peng Huang (1): + - radeonsi: make si_fence_server_signal flush pipe without work + +Pierre Moreau (1): + - clover/nir: Check the result of spirv_to_nir + +Pierre-Eric Pelloux-Prayer (44): + - radeonsi/ngg: add VGT_FLUSH when enabling fast launch - radeonsi: test subsampled format in testdma - format: add format_to_chroma_format @@ -3098,8 +3418,14 @@ Changes - driconf: add force_integer_tex_nearest option - radeonsi: don't print gs_copy_shader stats for shaderdb - amd/addrlib: fix forgotten char -> enum conversions + +Plamena Manolova (2): + - intel/compiler: Add support for variable workgroup size - i965: Implement ARB_compute_variable_group_size + +Qiang Yu (35): + - lima: remove definition of lima_is_scanout - lima: use util_copy_framebuffer_state - lima: always add texture bo to submit @@ -3135,7 +3461,13 @@ Changes - lima: also check tiled and depth case when import - lima: set offset when export resource - panfrost: don't always build bifrost_compiler + +Quentin Glidic (1): + - meson: Use dependency.partial_dependency() + +Rafael Antognolli (18): + - intel: Load the driver even if I915_PARAM_REVISION is not found. - intel/tools: Update aubinator_error_decode. - intel/blorp: Implement GEN:BUG:1605967699. @@ -3154,6 +3486,9 @@ Changes - anv: Add anv_device parameter to anv_gem_munmap. - anv: Add support for new MMAP_OFFSET ioctl. - anv: Enable HiZ on multi-layer depth buffers. + +Rhys Perry (118): + - aco: fix gfx10_wave64_bpermute - aco: gfx10_wave64_bpermute reduce op to print_ir - aco: disable some instruction combining if it could change an exec @@ -3280,6 +3615,9 @@ Changes - nir/opt_if: run opt_peel_loop_initial_if after all other optimizations - nir: fix lowering to scratch with boolean access + +Rob Clark (147): + - freedreno/drm: readonly cmdstream - freedreno/ir3: shuffle a few ir3_register fields - freedreno/ir3: cleanup after lower_locals_to_regs @@ -3427,11 +3765,17 @@ Changes - freedreno: fix buffer import - freedreno/ir3: fix indirect cb0 load_ubo lowering - freedreno: clear last_fence after resource tracking + +Rohan Garg (5): + - ci: Split out radv build-testing on arm64 - ci: Drop the git dependency in tracie - tracie: Switch to using shutil.move for cross filesystem moves - tracie: Print results in a machine readable format - tracie: Reformat code to fix indentation + +Roland Scheidegger (7): + - gallivm: fix crash with bptc border color sampling - gallivm: fix crash in emit_get_buffer_size - gallivm: disable rgtc/latc SNORM accellerated fetches @@ -3439,9 +3783,15 @@ Changes - gallivm: fix rgtc2 format - gallivm: switch the mask6/mask7 cases for signed rgtc formats - gallivm: fix stream id fetch + +Roman Stratiienko (3): + - panfrost: Align Android makefiles with recent changes - lima: Add missing source file to Android.mk - panfrost: Align Android makefiles with recent changes + +Sagar Ghuge (13): + - intel/isl: Move get_format_encoding function to isl - intel/isl: Switch to R8_UNORM format for compatiblity - intel/tools: Handle illegal instruction @@ -3455,8 +3805,14 @@ Changes - intel/compiler: Track patch count threshold - anv: Set patch count threshold in 3DSTATE_HS - iris: Set patch count threshold in 3DSTATE_HS + +Samuel Iglesias Gonsálvez (2): + - radv: check buffer size in vkCreateBuffer() - radv: set sparseAddressSpaceSize to RADV_MAX_MEMORY_ALLOCATION_SIZE + +Samuel Pitoiset (197): + - aco: fix MUBUF VS input loads when expanding vec3 to vec4 on GFX6 - aco: do not use ds_{read,write}2 on GFX6 - gitlab-ci: disable a630 tests as mesa-cheza is down (again) @@ -3669,11 +4025,29 @@ Changes - radv: limit the Vulkan version to 1.1 for Android - radv: handle different Vulkan API versions correctly - radv: update the list of allowed Android extensions + +Satyajit Sahu (1): + - st/va: GetConfigAttributes: check profile and entrypoint combination + +Simon Ser (1): + - mesa: add support for NV_pixel_buffer_object + +Simon Zeni (1): + - mesa: enable GL_EXT_draw_instanced for gles2 + +Sonny Jiang (1): + - radeonsi: enable EXT_texture_shadow_lod + +Szymon Andrzejuk (1): + - virgl: Use align_free for align_malloc allocated buffer + +Tapani Pälli (27): + - intel/vec4: fix valgrind errors with vf_values array - glsl: fix a memory leak with resource_set - iris: fix aux buf map failure in 32bits app on Android @@ -3701,11 +4075,17 @@ Changes - mesa/st: initialize all winsys_handle fields for memory objects - anv: remove assert from GetImageMemoryRequirements[2] - st/mesa: destroy only own program variants when program is released + +Thomas Hellstrom (5): + - svga: Fix banded DMA upload - svga, winsys/svga: Fix persistent memory discard maps - svga: Treat forced coherent maps as maps of persistent memory - gallium/pipebuffer: Use persistent maps for slabs - winsys/svga: Optionally avoid caching buffer maps + +Thong Thai (7): + - Revert "st/va: Convert interlaced NV12 to progressive" - gallium/auxiliary/vl: fix bob compute shaders for deint yuv - st/va: remove unneeded code @@ -3713,6 +4093,9 @@ Changes - radeonsi: add 10-bit HEVC encode support for VCN2.0 devices - radeon: add support for 10-bit HEVC encoding to VCN 2.0 - st/va: add check for P010 and P016 encode/decode support + +Timothy Arceri (51): + - glsl: fix gl_nir_set_uniform_initializers() for image arrays - glsl: fix possible memory leak in nir uniform linker - glsl: set the correct number of samplers in a shader @@ -3765,6 +4148,9 @@ Changes - glsl: remove some duplicate code from the nir uniform linker - glsl: stop cascading errors if process_parameters() fails - glsl: fix slow linking of uniforms in the nir linker + +Timur Kristóf (90): + - aco/optimizer: Don't combine uniform bool s_and to s_andn2. - radv: Move some helper functions to the radv_shader.h header file. - aco: Extract setup_gs_variables into a separate function. @@ -3861,8 +4247,17 @@ Changes - radv: Use new linking helper to set default driver locations. - aco: Use new default driver locations. - radv: Use smaller esgs_itemsize for ACO. + +Tobias Jakobi (1): + - meson: Link Gallium Nine with ld_args_build_id + +Tomasz Pyra (1): + - gallium/swr: spin-lock performance improvement + +Tomeu Vizoso (34): + - panfrost: Print intended field when decoding - panfrost: Add more info to some assertions - pan/midgard: Handle nir_intrinsic_load_barycentric_centroid @@ -3898,7 +4293,13 @@ Changes - gitlab-ci: Test OpenGL ES 3.1 on virgl - gitlab-ci: Test Virgl with traces - panfrost: Add Bifrost texture trampoline BO to batch + +Uros Bizjak (1): + - doc: Update features.txt for r600 with misc supported features + +Vasily Khoruzhick (19): + - lima: handle early-z and pixel kill better - lima: implement PLB PP stream cache - lima: add RGBA5551 and RGBA4444 formats @@ -3918,10 +4319,19 @@ Changes - lima: add support for R and RG formats - lima: Implement lima_texture_subdata - lima: avoid situations when scissor minx > maxx or miny > maxy + +Veerabadhran (1): + - radeon/vce: Move global function pointer si_get_pic_param to local encoder structure Multi gpu use case broken when the function was global + +Vilya Harvey (1): + - zink. Don't set incorrect sType in VkImportMemoryFdInfoKHR struct + +Vinson Lee (16): + - swr: Fix build with GCC 10. - lima: Fix build with GCC 10. - swr: Fix GCC 4.9 checks. @@ -3940,6 +4350,15 @@ Changes variable. - r600/sfn: Use correct setter method. - freedreno: Add missing va_end. + +Yevhenii Kolesnikov (1): + - intel/compiler: fix cmod propagation optimisations + +Zhang, Boyuan (1): + - radeonsi: Add support for midstream bitrate change in encoder + +luc (1): + - zink: confused compilation macro usage for zink in target helpers. diff --git a/docs/relnotes/20.1.1.rst b/docs/relnotes/20.1.1.rst index 89151d1015f..a2debc34394 100644 --- a/docs/relnotes/20.1.1.rst +++ b/docs/relnotes/20.1.1.rst @@ -60,57 +60,128 @@ Bug fixes Changes ------- +Alyssa Rosenzweig (1): + - pan/bi: Fix emit_if successor assignment + +Andrii Simiklit (1): + - glsl: fix crash on glsl macro redefinition + +Charmaine Lee (1): + - llvmpipe: do not enable tessellation shader without llvm coroutines support + +Clément Guérin (1): + - radv: Always expose non-visible local memory type on dedicated GPUs + +Danylo Piliaiev (3): + - glsl: Don't replace lrp pattern with lrp if arguments are not floats - glsl: inline functions with unsupported return type before converting to nir - i965: Work around incorrect usage of glDrawRangeElements in UE4 + +Dave Airlie (1): + - llvmpipe: move coroutines out of noopt case + +Dylan Baker (1): + - vulkan-overlay/meson: use install_data instead of configure_file + +Eric Engestrom (5): + - docs/relnotes add sha256 sums to 20.1.0 - docs: drop new_features.txt - .pick_status.json: Update to 3a1a40b4431d505fa6487cd012ddb4b64387aee5 - glapi: remove deprecated .getchildren() that has been replace with an iterator - intel: fix gen_sort_tags.py + +Erik Faye-Lund (2): + - zink: Use store_dest_raw instead of storing an uint - nir: reuse existing psiz-variable + +Gert Wollny (1): + - nir: lower_tex: Don't normalize coordinates for TXF with RECT + +Ilia Mirkin (1): + - nouveau: allow invalidating coherent/persistent buffer backings + +Jason Ekstrand (2): + - intel/vec4: Stomp the return type of RESINFO to UINT32 - intel/fs: Fix unused texture coordinate zeroing on Gen4-5 + +Jonathan Marek (1): + - freedreno/a6xx: use nonbinning VS when GS is used + +Lionel Landwerlin (4): + - iris: fix BO destruction in error path - i965: don't forget to set screen on duped image - i965: fix export of GEM handles - iris: fix export of GEM handles + +Marek Olšák (1): + - radeonsi: add a hack to disable TRUNC_COORD for shadow samplers + +Neha Bhende (1): + - util: Initialize pipe_shader_state for passthrough and transform shaders + +Peter Seiderer (3): + - vc4_bufmgr: fix time_t printf - pan_bo.h: add time.h include for time_t - v3d_bufmgr: fix time_t printf + +Pierre-Eric Pelloux-Prayer (3): + - winsys/radeon: do not cast bo->va as void\* - ac/surface: set SCANOUT if surf->is_displayable - ac/surface: fix epitch when modifying surf_pitch + +Rhys Perry (4): + - aco: fix interaction with 3f branch workaround and p_constaddr - aco: consider SDWA during value numbering - aco: check instruction format before waiting for a previous SMEM store - aco: preserve more fields when combining additions into SMEM + +Rob Clark (1): + - freedreno/computerator: fix missing dependency on generated header + +Samuel Pitoiset (5): + - spirv,radv,anv: implement no-op VK_GOOGLE_user_type - aco: fix register allocation for subdword instructions on GFX10 - radv: enable zero VRAM for Doom Eternal - radv: enable zero VRAM for all VKD3D (DX12->VK) games - nir/lower_explicit_io: fix NON_UNIFORM access for UBO loads + +Satyeshwar Singh (1): + - intel/dev: Don't consider all TGL SKUs as GT1 only + +Timothy Arceri (2): + - radv: fix regression with builtin cache - glsl: fix potential slow compile times for GLSLOptimizeConservatively + +Vinson Lee (8): + - pan/bi: Initialize struct fma_op_info member extended. - zink: Check fopen result. - etnaviv: Fix memory leak on error path. -- 2.30.2