Korey Sewell [Wed, 23 Feb 2011 19:26:55 +0000 (14:26 -0500)]
configs: cache: add cache line size option
Korey Sewell [Wed, 23 Feb 2011 06:01:46 +0000 (01:01 -0500)]
configs: set default cache params
It's confusing (especially to new users), when you are setting some standard
parameters (as defined in Options.py) and they aren't reflected in the simulations
so we might as well link the settings in CacheConfig.py to those in Options.py
Korey Sewell [Wed, 23 Feb 2011 05:58:42 +0000 (00:58 -0500)]
ruby: extend dprintfs for RubyGenerated TraceFlag
"executing" isnt a very descriptive debug message and in going through the
output you get multiple messages that say "executing" but nothing to help
you parse through the code/execution.
So instead, at least print out the name of the action that is taking
place in these functions.
Korey Sewell [Wed, 23 Feb 2011 05:58:40 +0000 (00:58 -0500)]
ruby: cleaning up RubyQueue and RubyNetwork dprintfs
Overall, continue to progress Ruby debug messages to more of the normal M5
debug message style
- add a name() to the Ruby Throttle & PerfectSwitch objects so that the debug output
isn't littered w/"global:" everywhere.
- clean up messages that print over multiple lines when possible
- clean up duplicate prints in the message buffer
Brad Beckmann [Tue, 22 Feb 2011 19:16:40 +0000 (11:16 -0800)]
m5: merged in hammer fix
Nilay Vaish [Sat, 19 Feb 2011 23:32:43 +0000 (17:32 -0600)]
Ruby: Machine Type missing in MOESI CMP directory protocol
In certain actions of the L1 cache controller, while creating an outgoing
message, the machine type was not being set. This results in a
segmentation fault when trace is collected. Joseph Pusudesris provided
his patch for fixing this issue.
Nilay Vaish [Sat, 19 Feb 2011 23:32:00 +0000 (17:32 -0600)]
Ruby: clean MOESI CMP directory protocol
The L1 cache controller file contains references to foo and goo queues, which
are not in use at all. These have been removed.
Korey Sewell [Fri, 18 Feb 2011 19:35:15 +0000 (14:35 -0500)]
m5: merge inorder/release-notes/make_release changes
Korey Sewell [Fri, 18 Feb 2011 19:31:37 +0000 (14:31 -0500)]
inorder: regr-update: reduce dynamic mem. use to speedup sims
previous changesets took a closer look at memory mgmt in the inorder model and sought to avoid
dynamic memory mgmt (for access to pipeline resources) as much as possible. For the regressions
that were run, the sims are about 2x speedup from changeset 7726 which is the last change
since the recent commits in Feb. (note: these regressions now are 4-issue CPUs instead of just 1-issue)
Korey Sewell [Fri, 18 Feb 2011 19:31:31 +0000 (14:31 -0500)]
inorder: add names and slot #s to res. dprints
Korey Sewell [Fri, 18 Feb 2011 19:30:38 +0000 (14:30 -0500)]
inorder: ignore nops in execution unit
Korey Sewell [Fri, 18 Feb 2011 19:30:05 +0000 (14:30 -0500)]
inorder: update graduation unit
make sure instructions are able to commit before writing back to the RF
do not commit more than 1 non-speculative instruction per cycle
Korey Sewell [Fri, 18 Feb 2011 19:29:48 +0000 (14:29 -0500)]
inorder: recognize isSerializeAfter flag
keep track of when an instruction needs the execution
behind it to be serialized. Without this, in SE Mode
instructions can execute behind a system call exit().
Korey Sewell [Fri, 18 Feb 2011 19:29:44 +0000 (14:29 -0500)]
inorder: update default thread size(=1)
a lot of structures get allocated based off that MaxThreads parameter so this is an
effort to not abuse it
Korey Sewell [Fri, 18 Feb 2011 19:29:40 +0000 (14:29 -0500)]
inorder: don't overuse getLatency()
resources don't need to call getLatency because the latency is already a member
in the class. If there is some type of special case where different instructions
impose a different latency inside a resource then we can revisit this and
add getLatency() back in
Korey Sewell [Fri, 18 Feb 2011 19:29:31 +0000 (14:29 -0500)]
inorder: update max. resource bandwidths
each resource has a certain # of requests it can take per cycle. update the #s here
to be more realistic based off of the pipeline width and if the resource needs to
be accessed on multiple cycles
Korey Sewell [Fri, 18 Feb 2011 19:29:26 +0000 (14:29 -0500)]
inorder: cleanup in destructors
cleanup hanging pointers and other cruft in the destructors
Korey Sewell [Fri, 18 Feb 2011 19:29:17 +0000 (14:29 -0500)]
inorder: fix cache/fetch unit memory leaks
---
need to delete the cache request's data on clearRequest() now that we are recycling
requests
---
fetch unit needs to deallocate the fetch buffer blocks when they are replaced or
squashed.
Korey Sewell [Fri, 18 Feb 2011 19:29:02 +0000 (14:29 -0500)]
inorder: remove events for zero-cycle resources
if a resource has a zero cycle latency (e.g. RegFile write), then dont allocate an event
for it to use
Korey Sewell [Fri, 18 Feb 2011 19:28:37 +0000 (14:28 -0500)]
inorder: update pipeline interface for handling finished resource reqs
formerly, to free up bandwidth in a resource, we could just change the pointer in that resource
but at the same time the pipeline stages had visibility to see what happened to a resource request.
Now that we are recycling these requests (to avoid too much dynamic allocation), we can't throw
away the request too early or the pipeline stage gets bad information. Instead, mark when a request
is done with the resource all together and then let the pipeline stage call back to the resource
that it's time to free up the bandwidth for more instructions
*** inteface notes ***
- When an instruction completes and is done in a resource for that cycle, call done()
- When an instruction fails and is done with a resource for that cycle, call done(false)
- When an instruction completes, but isnt finished with a resource, call completed()
- When an instruction fails, but isnt finished with a resource, call completed(false)
* * *
inorder: tlbmiss wakeup bug fix
Korey Sewell [Fri, 18 Feb 2011 19:28:30 +0000 (14:28 -0500)]
inorder: remove request map, use request vector
take away all instances of reqMap in the code and make all references use the built-in
request vectors inside of each resource. The request map was dynamically allocating
a request per instruction. The request vector just allocates N number of requests
during instantiation and then the surrounding code is fixed up to reuse those N requests
***
setRequest() and clearRequest() are the new accessors needed to define a new
request in a resource
Korey Sewell [Fri, 18 Feb 2011 19:28:22 +0000 (14:28 -0500)]
inorder: add valid bit for resource requests
this will allow us to reuse resource requests within a resource instead
of always dynamically allocating
Korey Sewell [Fri, 18 Feb 2011 19:28:10 +0000 (14:28 -0500)]
inorder: remove reqRemoveList
we are going to be getting away from creating new resource requests for every
instruction so no more need to keep track of a reqRemoveList and clean it up
every tick
Korey Sewell [Fri, 18 Feb 2011 19:27:52 +0000 (14:27 -0500)]
inorder: initialize res. req. vectors based on resource bandwidth
first change in an optimization that will stop InOrder from allocating new memory for every instruction's
request to a resource. This gets expensive since every instruction needs to access ~10 requests before
graduation. Instead, the plan is to allocate just enough resource request objects to satisfy each resource's
bandwidth (e.g. the execution unit would need to allocate 3 resource request objects for a 1-issue pipeline
since on any given cycle it could have 2 read requests and 1 write request) and then let the instructions
contend and reuse those allocated requests. The end result is a smaller memory footprint for the InOrder model
and increased simulation performance
Nathan Binkert [Wed, 16 Feb 2011 15:57:04 +0000 (10:57 -0500)]
merge alpha system files into tree
Gabe Black [Wed, 16 Feb 2011 07:22:32 +0000 (23:22 -0800)]
Util: Get rid of the make_release.py script.
Since we're not doing releases any more we don't really need this script. If
we need it in the future, we can resurrect it from the history.
Nathan Binkert [Wed, 16 Feb 2011 06:34:02 +0000 (00:34 -0600)]
Cleanup system directory to fit into modern M5 tree
Nathan Binkert [Wed, 16 Feb 2011 06:34:01 +0000 (00:34 -0600)]
copyright: update copyright on alpha system files
Gabe Black [Tue, 15 Feb 2011 23:58:16 +0000 (15:58 -0800)]
X86: Get rid of "inline" on the MicroPanic constructor in decoder.cc.
This was making certain versions of gcc omit the function from the object file
which would break the build.
Gabe Black [Tue, 15 Feb 2011 05:36:37 +0000 (21:36 -0800)]
Info: Clean up some info files.
Get rid of RELEASE_NOTES since we no longer do releases, update some of the
information in README, and update the date in LICENSE.
Nilay Vaish [Mon, 14 Feb 2011 22:14:54 +0000 (16:14 -0600)]
Ruby: Improve Change PerfectSwitch's wakeup function
Currently the wakeup function for the PerfectSwitch contains three loops -
loop on number of virtual networks
loop on number of incoming links
loop till all messages for this (link, network) have been routed
With an 8 processor mesh network and Hammer protocol, about 11-12% of the
was observed to have been spent in this function, which is the highest
amongst all the functions. It was found that the innermost loop is executed
about 45 times per invocation of the wakeup function, when each invocation
of the wakeup function processes just about one message.
The patch tries to do away with the redundant executions of the innermost
loop. Counters have been added for each virtual network that record the
number of messages that need to be routed for that virtual network. The
inner loops are only executed when the number of messages for that particular
virtual network > 0. This does away with almost 80% of the executions of the
innermost loop. The function now consumes about 5-6% of the total execution
time.
Gabe Black [Mon, 14 Feb 2011 01:46:04 +0000 (17:46 -0800)]
X86: Update stats for the improved branch detection/prediction.
Gabe Black [Mon, 14 Feb 2011 01:45:47 +0000 (17:45 -0800)]
X86: Detect branches taking into account instruction size.
The size of the current instruction determines what the npc should be if
there's no branching.
Gabe Black [Mon, 14 Feb 2011 01:45:30 +0000 (17:45 -0800)]
X86: Update stats now that the dest reg isn't read unnecessarily to set flags.
Gabe Black [Mon, 14 Feb 2011 01:45:12 +0000 (17:45 -0800)]
X86: Put the result used for flags in an intermediate variable.
Using the destination register directly causes the ISA parser to treat it as a
source even if none of the original bits are used.
Gabe Black [Mon, 14 Feb 2011 01:44:32 +0000 (17:44 -0800)]
X86: Update stats for the reduced register reads.
Gabe Black [Mon, 14 Feb 2011 01:44:24 +0000 (17:44 -0800)]
X86: Don't read in dest regs if all bits are replaced.
In x86, 32 and 64 bit writes to registers in which registers appear to be 32 or
64 bits wide overwrite all bits of the destination register. This change
removes false dependencies in these cases where the previous value of a
register doesn't need to be read to write a new value. New versions of most
microops are created that have a "Big" suffix which simply overwrite their
destination, and the right version to use is selected during microop
allocation based on the selected data size.
This does not change the performance of the O3 CPU model significantly, I
assume because there are other false dependencies from the condition code bits
in the flags register.
Gabe Black [Mon, 14 Feb 2011 01:42:56 +0000 (17:42 -0800)]
X86: On a bad microopc, return a microop that returns a fault that panics.
This way a bad micropc will have to get all the way to commit before killing
the simulation. This accounts for misspeculated branches.
Gabe Black [Mon, 14 Feb 2011 01:42:05 +0000 (17:42 -0800)]
X86: Define fault objects to carry debug messages.
These faults can panic/warn/warn_once, etc., instead of instructions doing
that themselves directly. That way, instructions can be speculatively
executed, and only if they're actually going to commit will their fault be
invoked and the panic, etc., happen.
Gabe Black [Mon, 14 Feb 2011 01:41:10 +0000 (17:41 -0800)]
X86: Only reset npc to reflect instruction length once.
When redirecting fetch to handle branches, the npc of the current pc state
needs to be left alone. This change makes the pc state record whether or not
the npc already reflects a real value by making it keep track of the current
instruction size, or if no size has been set.
Gabe Black [Mon, 14 Feb 2011 01:40:07 +0000 (17:40 -0800)]
O3: Fetch from the microcode ROM when needed.
Ali Saidi [Sun, 13 Feb 2011 21:51:15 +0000 (16:51 -0500)]
O3: Fix GCC 4.2.4 complaint
Nilay Vaish [Sat, 12 Feb 2011 17:41:20 +0000 (11:41 -0600)]
Ruby: Reorder Cache Lookup in Protocol Files
The patch changes the order in which L1 dcache and icache are looked up when
a request comes in. Earlier, if a request came in for instruction fetch, the
dcache was looked up before the icache, to correctly handle self-modifying
code. But, in the common case, dcache is going to report a miss and the
subsequent icache lookup is going to report a hit. Given the invariant -
caches under the same controller keep track of disjoint sets of cache blocks,
we can move the icache lookup before the dcache lookup. In case of a hit in
the icache, using our invariant, we know that the dcache would have reported
a miss. In case of a miss in the icache, we know that icache would have
missed even if the dcache was looked up before looking up the icache.
Effectively, we are doing the same thing as before, though in the common case,
we expect reduction in the number of lookups. This was empirically confirmed
for MOESI hammer. The ratio lookups to access requests is now about 1.1 to 1.
Korey Sewell [Sat, 12 Feb 2011 15:14:52 +0000 (10:14 -0500)]
inorder:regress: host-inst-rate improved ~58%
there are still only a few inorder benchmark but for the lengthier benchmarks (twolf and vortext)
the latest changes to how instruction scheduling (how instructions figure out what they want to
do on each pipeline stage in the inorder model) were able to improve performance by a nice
amount... The latest results for the inorder model process about 100k insts/second
(note: 58% is over the last time run on 64-bit pool machines at UM)
Korey Sewell [Sat, 12 Feb 2011 15:14:48 +0000 (10:14 -0500)]
inorder: clean up the old way of inst. scheduling
remove remnants of old way of instruction scheduling which dynamically allocated
a new resource schedule for every instruction
Korey Sewell [Sat, 12 Feb 2011 15:14:45 +0000 (10:14 -0500)]
inorder: utilize cached skeds in pipeline
allow the pipeline and resources to use the cached instruction schedule and resource
sked iterator
Korey Sewell [Sat, 12 Feb 2011 15:14:43 +0000 (10:14 -0500)]
inorder: define iterator for resource schedules
resource skeds are divided into two parts: front end (all insts) and back end (inst. specific)
each of those are implemented as separate lists, so this iterator wraps around
the traditional list iterator so that an instruction can walk it's schedule but seamlessly
transfer from front end to back end when necessary
Korey Sewell [Sat, 12 Feb 2011 15:14:40 +0000 (10:14 -0500)]
inorder: stage scheduler for front/back end schedule creation
add a stage scheduler class to replace InstStage in pipeline_traits.cc
use that class to define a default front-end, resource schedule that all
instructions will follow. This will also replace the back end schedule in
pipeline_traits.cc. The reason for adding this is so that we can cache
instruction schedules in the future instead of calling the same function
over/over again as well as constantly dynamically alllocating memory on
every instruction to try to figure out it's schedule
Korey Sewell [Sat, 12 Feb 2011 15:14:36 +0000 (10:14 -0500)]
inorder: cache instruction schedules
first step in a optimization to not dynamically allocate an instruction schedule
for every instruction but rather used cached schedules
Korey Sewell [Sat, 12 Feb 2011 15:14:34 +0000 (10:14 -0500)]
inorder: comments for resource sked class
Korey Sewell [Sat, 12 Feb 2011 15:14:32 +0000 (10:14 -0500)]
inorder: remove unused file
inst_buffer file isn't used , so remove it
Korey Sewell [Sat, 12 Feb 2011 15:14:26 +0000 (10:14 -0500)]
inorder: remove unused isa ops
pass/fail ops were used for testing but arent part of isa
Ali Saidi [Sat, 12 Feb 2011 00:29:36 +0000 (18:29 -0600)]
Stats: Update the statistics for vnc patch.
Ali Saidi [Sat, 12 Feb 2011 00:29:36 +0000 (18:29 -0600)]
VNC/ARM: Use VNC server and add support to boot into X11
Ali Saidi [Sat, 12 Feb 2011 00:29:35 +0000 (18:29 -0600)]
VNC: Add VNC server to M5
Ali Saidi [Sat, 12 Feb 2011 00:29:35 +0000 (18:29 -0600)]
Serialization: Allow serialization of stl lists
Giacomo Gabrielli [Sat, 12 Feb 2011 00:29:35 +0000 (18:29 -0600)]
O3: Fix pipeline restart when a table walk completes in the fetch stage.
When a table walk is initiated by the fetch stage, the CPU can
potentially move to the idle state and never wake up.
The fetch stage must call cpu->wakeCPU() when a translation completes
(in finishTranslation()).
Giacomo Gabrielli [Sat, 12 Feb 2011 00:29:35 +0000 (18:29 -0600)]
O3: Fix a few bugs in the TableWalker object.
Uncacheable requests were set as such only in atomic mode.
currState->delayed is checked in place of currState->timing for resetting
currState in atomic mode.
Ali Saidi [Sat, 12 Feb 2011 00:29:35 +0000 (18:29 -0600)]
SimpleCPU: Fix a case where a DTLB fault redirects fetch and an I-side walk occurs.
This change fixes an issue where a DTLB fault occurs and redirects fetch to
handle the fault and the ITLB requires a walk which delays translation. In this
case the status of the cpu isn't updated appropriately, and an additional
instruction fetch occurs. Eventually this hits an assert as multiple instruction
fetches are occuring in the system and when the second one returns the
processor is in the wrong state.
Some asserts below are removed because it was always true (typo) and the state
after the initiateAcc() the processor could be in any valid state when a
d-side fault occurs.
Giacomo Gabrielli [Sat, 12 Feb 2011 00:29:35 +0000 (18:29 -0600)]
O3: Enhance data address translation by supporting hardware page table walkers.
Some ISAs (like ARM) relies on hardware page table walkers. For those ISAs,
when a TLB miss occurs, initiateTranslation() can return with NoFault but with
the translation unfinished.
Instructions experiencing a delayed translation due to a hardware page table
walk are deferred until the translation completes and kept into the IQ. In
order to keep track of them, the IQ has been augmented with a queue of the
outstanding delayed memory instructions. When their translation completes,
instructions are re-executed (only their initiateAccess() was already
executed; their DTB translation is now skipped). The IEW stage has been
modified to support such a 2-pass execution.
Ali Saidi [Sat, 12 Feb 2011 00:29:35 +0000 (18:29 -0600)]
ARM: Fix timer calculations.
The timer calculations were a bit off so time would run faster than
it otherwise should
Ali Saidi [Sat, 12 Feb 2011 00:29:35 +0000 (18:29 -0600)]
Timesync: Make sure timesync event is setup after curTick is unserialized
Setup initial timesync event in initState or loadState so that curTick has
been updated to the new value, otherwise the event is scheduled in the past.
Brad Beckmann [Thu, 10 Feb 2011 21:31:52 +0000 (13:31 -0800)]
merged with Ali X11 patch
Brad Beckmann [Thu, 10 Feb 2011 21:28:23 +0000 (13:28 -0800)]
MOESI_hammer: fixed wakeup for SS->S transistion
Ali Saidi [Thu, 10 Feb 2011 04:27:37 +0000 (22:27 -0600)]
Ext: Add X11 keysym header files to ext directory.
Brad Beckmann [Thu, 10 Feb 2011 00:02:09 +0000 (16:02 -0800)]
ruby: removed duplicate make response call
Brad Beckmann [Wed, 9 Feb 2011 02:07:54 +0000 (18:07 -0800)]
regess: protocol regression tester updates
Brad Beckmann [Tue, 8 Feb 2011 23:53:33 +0000 (15:53 -0800)]
memtest: due to contention increase, increased deadlock threshold
Brad Beckmann [Tue, 8 Feb 2011 23:52:44 +0000 (15:52 -0800)]
config: fixed minor bug connecting dma devices to ruby
Nilay Vaish [Tue, 8 Feb 2011 13:47:02 +0000 (07:47 -0600)]
MESI CMP: Unset TBE pointer in L2 cache controller
The TBE pointer in the MESI CMP implementation was not being set to NULL
when the TBE is deallocated. This resulted in segmentation fault on testing
the protocol when the ProtocolTrace was switched on.
Gabe Black [Tue, 8 Feb 2011 03:23:13 +0000 (19:23 -0800)]
Stats: Re update stats.
Gabe Black [Tue, 8 Feb 2011 03:23:11 +0000 (19:23 -0800)]
Stats: Back out broken update.
Tim Harris [Mon, 7 Feb 2011 23:18:52 +0000 (15:18 -0800)]
X86: Obey the wp bit of CR0.
If cr0.wp ("write protect" bit) is clear then do not generate page faults when
writing to write-protected pages in kernel mode.
Tim Harris [Mon, 7 Feb 2011 23:16:27 +0000 (15:16 -0800)]
X86: Use all 64 bits of the lstar register in the SYSCALL_64 macroop.
During SYSCALL_64, use dataSize=8 when handling new rip (ref
http://www.intel.com/Assets/PDF/manual/253668.pdf 5.8.8 IA32_LSTAR is a 64-bit
address)
Tim Harris [Mon, 7 Feb 2011 23:12:59 +0000 (15:12 -0800)]
X86: Fix JMP_FAR_I to unpack a far pointer correctly.
JMP_FAR_I was unpacking its far pointer operand using sll instead of srl like
it should, and also putting the components in the wrong registers for use by
other microcode.
Tim Harris [Mon, 7 Feb 2011 23:05:28 +0000 (15:05 -0800)]
X86: Read the LDT/GDT at CPL0 when executing an iret.
During iret access LDT/GDT at CPL0 rather than after transition to user mode
(if I'm reading the Intel IA-64 architecture spec correctly, the contents of
the descriptor table are read before the CPL is updated).
Nilay Vaish [Mon, 7 Feb 2011 18:42:23 +0000 (12:42 -0600)]
Orion: Replace printf() with fatal()
The code for Orion 2.0 makes use of printf() at several places where there as
an error in configuration of the model. These have been replaced with fatal().
Korey Sewell [Mon, 7 Feb 2011 17:19:46 +0000 (12:19 -0500)]
ruby: add stdio header in SRAM.hh
missing header file caused RUBY_FS to not compile
Gabe Black [Mon, 7 Feb 2011 09:23:16 +0000 (01:23 -0800)]
X86: Add stats for the new x86 fs regressions.
Gabe Black [Mon, 7 Feb 2011 09:23:02 +0000 (01:23 -0800)]
X86: Add scripts to support X86 FS configurations in the regressions.
Gabe Black [Mon, 7 Feb 2011 09:22:15 +0000 (01:22 -0800)]
X86, Config: Move the setting of work count options to a separate function.
This way things that don't care about work count options and/or aren't called
by something that has those command line options set up doesn't have to build
a fake object to carry in inert values.
Gabe Black [Mon, 7 Feb 2011 09:21:21 +0000 (01:21 -0800)]
X86: Fix compiling vtophys.cc
Brad Beckmann [Mon, 7 Feb 2011 06:14:23 +0000 (22:14 -0800)]
regress: Regression Tester output updates
Brad Beckmann [Mon, 7 Feb 2011 06:14:19 +0000 (22:14 -0800)]
ruby: support to stallAndWait the mandatory queue
By stalling and waiting the mandatory queue instead of recycling it, one can
ensure that no incoming messages are starved when the mandatory queue puts
signficant of pressure on the L1 cache controller (i.e. the ruby memtester).
--HG--
rename : src/mem/slicc/ast/WakeUpDependentsStatementAST.py => src/mem/slicc/ast/WakeUpAllDependentsStatementAST.py
Brad Beckmann [Mon, 7 Feb 2011 06:14:19 +0000 (22:14 -0800)]
ruby: minor fix to deadlock panic message
Brad Beckmann [Mon, 7 Feb 2011 06:14:19 +0000 (22:14 -0800)]
boot: script that creates a checkpoint after Linux boot up
Joel Hestness [Mon, 7 Feb 2011 06:14:19 +0000 (22:14 -0800)]
garnet: Split network power in ruby.stats
Split out dynamic and static power numbers for printing to ruby.stats
Brad Beckmann [Mon, 7 Feb 2011 06:14:19 +0000 (22:14 -0800)]
MOESI_hammer: fixed dir bug counting received acks
Brad Beckmann [Mon, 7 Feb 2011 06:14:19 +0000 (22:14 -0800)]
ruby: numa bit fix for sparse memory
Tushar Krishna [Mon, 7 Feb 2011 06:14:19 +0000 (22:14 -0800)]
MOESI_CMP_token: removed unused message fields
Brad Beckmann [Mon, 7 Feb 2011 06:14:19 +0000 (22:14 -0800)]
mem: Added support for Null data packet
The packet now identifies whether static or dynamic data has been allocated and
is used by Ruby to determine whehter to copy the data pointer into the ruby
request. Subsequently, Ruby can be told not to update phys memory when
receiving packets.
Brad Beckmann [Mon, 7 Feb 2011 06:14:19 +0000 (22:14 -0800)]
m5: added work completed monitoring support
Brad Beckmann [Mon, 7 Feb 2011 06:14:18 +0000 (22:14 -0800)]
dev: fixed bugs to extend interrupt capability beyond 15 cores
Joel Hestness [Mon, 7 Feb 2011 06:14:18 +0000 (22:14 -0800)]
x86: Timing support for pagetable walker
Move page table walker state to its own object type, and make the
walker instantiate state for each outstanding walk. By storing the
states in a queue, the walker is able to handle multiple outstanding
timing requests. Note that functional walks use separate state
elements.
Joel Hestness [Mon, 7 Feb 2011 06:14:18 +0000 (22:14 -0800)]
TimingSimpleCPU: split data sender state fix
In sendSplitData, keep a pointer to the senderState that may be updated after
the call to handle*Packet. This way, if the receiver updates the packet
senderState, it can still be accessed in sendSplitData.
Brad Beckmann [Mon, 7 Feb 2011 06:14:18 +0000 (22:14 -0800)]
ruby: Fix RubyPort to properly handle retrys
Joel Hestness [Mon, 7 Feb 2011 06:14:18 +0000 (22:14 -0800)]
Ruby: Fix to return cache block size to CPU for split data transfers
Joel Hestness [Mon, 7 Feb 2011 06:14:18 +0000 (22:14 -0800)]
Ruby: Add support for locked memory accesses in X86_FS
Joel Hestness [Mon, 7 Feb 2011 06:14:18 +0000 (22:14 -0800)]
Ruby: Update the Ruby request type names for LL/SC
Brad Beckmann [Mon, 7 Feb 2011 06:14:18 +0000 (22:14 -0800)]
ruby: Assert for x86 misaligned access
This patch ensures only aligned access are passed to ruby and includes a fix
to the DPRINTF address print.