git.libre-soc.org Git - gem5.git/log

projects / gem5.git / log

commit | commitdiff | tree

Andreas Hansson [Tue, 23 Feb 2016 08:27:20 +0000 (03:27 -0500)]

scons: Add missing override to appease clang

Make clang happy...again.

commit | commitdiff | tree

Tony Gutierrez [Thu, 18 Feb 2016 15:50:16 +0000 (10:50 -0500)]

ruby: move range change send from RubyPort to derived classes.

commit | commitdiff | tree

John Kalamatianos [Thu, 18 Feb 2016 15:42:03 +0000 (10:42 -0500)]

gpu: fix bugs with MemFence, Flat Instrs and Resource utilization

Both Memory Fence is now flagged as Global Memory only to avoid resource
oversubscribing.
Flat instructions now check for Shared Memory resource busy to avoid
oversubscribing resources.
All WaitClass resources now use cycles (not ticks) to register the number
of pipe stages between Scoreboard and Execute to be consistent with
instruction scheduling logic which always used clock cycles.

commit | commitdiff | tree

Tony Gutierrez [Wed, 17 Feb 2016 16:46:02 +0000 (11:46 -0500)]

gpu-compute: remove brig_object.hh from hsa_object.cc

brig_object.hh is specific to the HSAIL ISA, and hence should not be
included in ISA-agnostic code.

commit | commitdiff | tree

Tony Gutierrez [Wed, 17 Feb 2016 16:31:54 +0000 (11:31 -0500)]

ruby: send address ranges from RubyPort

commit | commitdiff | tree

Andreas Hansson [Wed, 17 Feb 2016 08:56:20 +0000 (03:56 -0500)]

scons: Enable building with the gcc/clang Address Sanitizer

Allow the user to easily build gem5 with the Address Sanitizer, part
of both gcc and clang these days.

commit | commitdiff | tree

Andreas Hansson [Mon, 15 Feb 2016 08:40:32 +0000 (03:40 -0500)]

misc: Add missing overrides to appease clang

Since the last round of fixes a few new issues have snuck in. We
should consider switching the regression runs to clang.

commit | commitdiff | tree

Andreas Hansson [Mon, 15 Feb 2016 08:40:04 +0000 (03:40 -0500)]

mem: Avoid using invalid iterator in cache lock list traversal

Fix up issue highlighted by Valgrind and the clang Address Sanitizer.

commit | commitdiff | tree

Michael LeBeane [Mon, 15 Feb 2016 01:28:48 +0000 (20:28 -0500)]

ruby: make DMASequencer inherit from RubyPort

This patch essentially rolls back 10518:30e3715c9405 to make RubyPort the
parent class of DMASequencer. It removes redundant code and restores some
features which were lost when directly inheriting from MemObject. For
example,
DMASequencer can now communicate to other devices using PIO, which is useful
for memmory-mapped communication between multiple DMADevices.

commit | commitdiff | tree

Michael LeBeane [Sat, 13 Feb 2016 17:36:43 +0000 (12:36 -0500)]

configs: add command-line option to stop debug output

This patch adds a --debug-end flag to main.py so that debug output can be
stoped at a specified tick, while allowing the simulation to continue. It is
useful in situations where you would like to produce a trace for a region of
interest while still collecting stats for the entire run. This is in contrast
to the currently existing --debug-break flag, which terminates the simulation
at the tick.

commit | commitdiff | tree

Michael LeBeane [Sat, 13 Feb 2016 17:33:07 +0000 (12:33 -0500)]

syscall_emul: Implement clock_getres() system call

This patch implements the clock_getres() system call for arm and x86 in linux
SE mode.

commit | commitdiff | tree

Andreas Hansson [Wed, 10 Feb 2016 09:08:27 +0000 (04:08 -0500)]

stats: Update stats to reflect changes to cache and crossbar

commit | commitdiff | tree

Andreas Hansson [Wed, 10 Feb 2016 09:08:25 +0000 (04:08 -0500)]

mem: Be less conservative in clearing load locks in the cache

Avoid being overly conservative in clearing load locks in the cache,
and allow writes to the line if they are from the same context. This
is in line with ALPHA and ARM.

commit | commitdiff | tree

Andreas Hansson [Wed, 10 Feb 2016 09:08:25 +0000 (04:08 -0500)]

mem: Move the point of coherency to the coherent crossbar

This patch introduces the ability of making the coherent crossbar the
point of coherency. If so, the crossbar does not forward packets where
a cache with ownership has already committed to responding, and also
does not forward any coherency-related packets that are not intended
for a downstream memory controller. Thus, invalidations and upgrades
are turned around in the crossbar, and the memory controller only sees
normal reads and writes.

In addition this patch moves the express snoop promotion of a packet
to the crossbar, thus allowing the downstream cache to check the
express snoop flag (as it should) for bypassing any blocking, rather
than relying on whether a cache is responding or not.

commit | commitdiff | tree

Andreas Hansson [Wed, 10 Feb 2016 09:08:24 +0000 (04:08 -0500)]

mem: Align cache behaviour in atomic when upstream is responding

Adopt the same flow as in timing mode, where the caches on the path to
memory get to keep the line (if present), and we use the
responderHadWritable flag to determine if we need to forward the
(invalidating) packet or not.

commit | commitdiff | tree

Andreas Hansson [Wed, 10 Feb 2016 09:08:24 +0000 (04:08 -0500)]

mem: Align how snoops are handled when hitting writebacks

This patch unifies the snoop handling in case of hitting writebacks
with how we handle snoops hitting in the tags. As a result, we end up
using the same optimisation as the normal snoops, where we inform the
downstream cache if we encounter a line in Modified (writable and
dirty) state, which enables us to avoid sending out express snoops to
invalidate any Shared copies of the line. A few regressions
consequently change, as some transactions are sunk higher up in the
cache hierarchy.

commit | commitdiff | tree

Andreas Hansson [Wed, 10 Feb 2016 09:08:24 +0000 (04:08 -0500)]

mem: Deduce if cache should forward snoops

This patch changes how the cache determines if snoops should be
forwarded from the memory side to the CPU side. Instead of having a
parameter, the cache now looks at the port connected on the CPU side,
and if it is a snooping port, then snoops are forwarded. Less error
prone, and less parameters to worry about.

The patch also tidies up the CPU classes to ensure that their I-side
port is not snooping by removing overrides to the snoop request
handler, such that snoop requests will panic via the default
MasterPort implement

commit | commitdiff | tree

Curtis Dunham [Mon, 8 Feb 2016 19:39:45 +0000 (13:39 -0600)]

scons: always generate sim/tags.cc

Due to insufficient build deps, the checkpoint tags might not get
updated; this commit solves this. Due to the uncommon nature of the
build target, regenerating tags.cc is a fairly clean solution. Since
SCons hashes file contents, it won't recompile anything unless a new
checkpoint upgrader is actually added.

--HG--
extra : amend_source : ed3879da7668554693f697076deaf5029cc9b954

commit | commitdiff | tree

Alexandru Dutu [Sun, 7 Feb 2016 01:21:20 +0000 (17:21 -0800)]

x86: revamp cmpxchg8b/cmpxchg16b implementation

The previous implementation did a pair of nested RMW operations,
which isn't compatible with the way that locked RMW operations are
implemented in the cache models.  It was convenient though in that
it didn't require any new micro-ops, and supported cmpxchg16b using
64-bit memory ops.  It also worked in AtomicSimpleCPU where
atomicity was guaranteed by the core and not by the memory system.
It did not work with timing CPU models though.

This new implementation defines new 'split' load and store micro-ops
which allow a single memory operation to use a pair of registers as
the source or destination, then uses a single ldsplit/stsplit RMW
pair to implement cmpxchg.  This patch requires support for 128-bit
memory accesses in the ISA (added via a separate patch) to support
cmpxchg16b.

commit | commitdiff | tree