git.libre-soc.org Git - gem5.git/commit

author	Tony Gutierrez <anthony.gutierrez@amd.com>
	Fri, 15 Jun 2018 20:00:58 +0000 (16:00 -0400)
committer	Anthony Gutierrez <anthony.gutierrez@amd.com>
	Thu, 16 Jul 2020 20:37:22 +0000 (20:37 +0000)
commit	af621cd6e66921b0b5890d72c2ccf3d7ef6f3ac3
tree	2b6448863920c1ead530d1102bd470a851abfcb1	tree
parent	c2641eec894da38ff6bc35be8c9241b322f2bb2f	commit \| diff

gpu-compute, arch-gcn3: refactor barriers

Barriers were not modeled properly. Firstly, barriers were
allocated to each WG that was launched, which is not
correct, and the CU would provide an infinite number
of barrier slots. There are a limited number of barrier slots
per CU in reality. In addition, the CU will not allocate
barrier slots to WGs with a single WF (nothing to sync if
only one WF).

Beyond modeling problems, there also the issue of deadlock.
The barrier could deadlock because not all WFs are freed
from the barrier once it has been satisfied. Instead, we
relied on the scoreboard stage to release them lazily,
one-by-one.

Under this implementation the scoreboard may not fully release
all WFs participating in a barrier; this happens because the
first WF to be freed from the barrier could reach an s_barrier
instruction again, forever causing the barrier counts across
WFs to be out-of-sync.

This change refactors the barrier logic to:

1) Create a proper barrier slot implementation

2) Enforce (via a parameter) the number of barrier
   slots on the CU.

3) Simplify the logic and cleanup the code (i.e., we
   no longer iterate through the entire WF list each
   time we check if a barrier is satisfied).

4) Fix deadlock issues.

Change-Id: If53955b54931886baaae322640a7b9da7a1595e0
Reviewed-on: https://gem5-review.googlesource.com/c/public/gem5/+/29943
Reviewed-by: Anthony Gutierrez <anthony.gutierrez@amd.com>
Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com>
Tested-by: kokoro <noreply+kokoro@google.com>

src/arch/gcn3/insts/instructions.cc		diff \| blob \| history
src/gpu-compute/GPU.py		diff \| blob \| history
src/gpu-compute/compute_unit.cc		diff \| blob \| history
src/gpu-compute/compute_unit.hh		diff \| blob \| history
src/gpu-compute/scoreboard_check_stage.cc		diff \| blob \| history
src/gpu-compute/shader.cc		diff \| blob \| history
src/gpu-compute/wavefront.cc		diff \| blob \| history
src/gpu-compute/wavefront.hh		diff \| blob \| history