2f985bd2e7f9a3297dddcb03d7663c5f7c0fe28c
[binutils-gdb.git] / sim / README-HACKING
1 This is a loose collection of notes for people hacking on simulators.
2 If this document gets big enough it can be prettied up then.
3
4 Contents
5
6 - The "common" directory
7 - Common Makefile Support
8 - TAGS support
9 - Generating "configure" files
10 - C Language Assumptions
11 - "dump" commands under gdb
12 \f
13 The "common" directory
14 ======================
15
16 The common directory contains:
17
18 - common documentation files (e.g. run.1, and maybe in time .texi files)
19 - common source files (e.g. run.c)
20 - common Makefile fragment and configury (e.g. Make-common.in, aclocal.m4).
21
22 In addition "common" contains portions of the system call support
23 (e.g. callback.c, nltvals.def).
24
25 Even though no files are built in this directory, it is still configured
26 so support for regenerating nltvals.def is present.
27 \f
28 Common Makefile Support
29 =======================
30
31 A common configuration framework is available for simulators that want
32 to use it. The common framework exists to remove a lot of duplication
33 in configure.ac and Makefile.in, and it also provides a foundation for
34 enhancing the simulators uniformly (e.g. the more they share in common
35 the easier a feature added to one is added to all).
36
37 The configure.ac of a simulator using the common framework should look like:
38
39 --- snip ---
40 dnl Process this file with autoconf to produce a configure script.
41 sinclude(../common/aclocal.m4)
42 AC_INIT(Makefile.in)
43
44 SIM_AC_COMMON
45
46 ... target specific additions ...
47
48 SIM_AC_OUTPUT
49 --- snip ---
50
51 SIM_AC_COMMON:
52
53 - invokes the autoconf macros most often used by the simulators
54 - defines --enable/--with options usable by all simulators
55 - initializes sim_link_files/sim_link_links as the set of symbolic links
56 to set up
57
58 SIM_AC_OUTPUT:
59
60 - creates the symbolic links defined in sim_link_{files,links}
61 - creates config.h
62 - creates the Makefile
63
64 The Makefile.in of a simulator using the common framework should look like:
65
66 --- snip ---
67 # Makefile for blah ...
68 # Copyright blah ...
69
70 ## COMMON_PRE_CONFIG_FRAG
71
72 # These variables are given default values in COMMON_PRE_CONFIG_FRAG.
73 # We override the ones we need to here.
74 # Not all of these need to be mentioned, only the necessary ones.
75 # In fact it is better to *not* mention ones if the value is the default.
76
77 # List of object files, less common parts.
78 SIM_OBJS =
79 # List of extra dependencies.
80 # Generally this consists of simulator specific files included by sim-main.h.
81 SIM_EXTRA_DEPS =
82 # List of flags to always pass to $(CC).
83 SIM_EXTRA_CFLAGS =
84 # List of extra libraries to link with.
85 SIM_EXTRA_LIBS =
86 # List of extra program dependencies.
87 SIM_EXTRA_LIBDEPS =
88 # Dependency of `all' to build any extra files.
89 SIM_EXTRA_ALL =
90 # Dependency of `install' to install any extra files.
91 SIM_EXTRA_INSTALL =
92 # Dependency of `clean' to clean any extra files.
93 SIM_EXTRA_CLEAN =
94
95 ## COMMON_POST_CONFIG_FRAG
96
97 # Rules need to build $(SIM_OBJS), plus whatever else the target wants.
98
99 ... target specific rules ...
100 --- snip ---
101
102 COMMON_{PRE,POST}_CONFIG_FRAG are markers for SIM_AC_OUTPUT to tell it
103 where to insert the two pieces of common/Make-common.in.
104 The resulting Makefile is created by doing autoconf substitions on
105 both the target's Makefile.in and Make-common.in, and inserting
106 the two pieces of Make-common.in into the target's Makefile.in at
107 COMMON_{PRE,POST}_CONFIG_FRAG.
108
109 Note that SIM_EXTRA_{INSTALL,CLEAN} could be removed and "::" targets
110 could be used instead. However, it's not clear yet whether "::" targets
111 are portable enough.
112 \f
113 TAGS support
114 ============
115
116 Many files generate program symbols at compile time.
117 Such symbols can't be found with grep nor do they normally appear in
118 the TAGS file. To get around this, source files can add the comment
119
120 /* TAGS: foo1 foo2 */
121
122 where foo1, foo2 are program symbols. Symbols found in such comments
123 are greppable and appear in the TAGS file.
124 \f
125 Generating "configure" files
126 ============================
127
128 For targets using the common framework, "configure" can be generated
129 by running `autoconf'.
130
131 To regenerate the configure files for all targets using the common framework:
132
133 $ cd devo/sim
134 $ make -f Makefile.in SHELL=/bin/sh autoconf-common
135
136 To add a change-log entry to the ChangeLog file for each updated
137 directory (WARNING - check the modified new-ChangeLog files before
138 renaming):
139
140 $ make -f Makefile.in SHELL=/bin/sh autoconf-changelog
141 $ more */new-ChangeLog
142 $ make -f Makefile.in SHELL=/bin/sh autoconf-install
143
144 In a similar vein, both the configure and config.in files can be
145 updated using the sequence:
146
147 $ cd devo/sim
148 $ make -f Makefile.in SHELL=/bin/sh autoheader-common
149 $ make -f Makefile.in SHELL=/bin/sh autoheader-changelog
150 $ more */new-ChangeLog
151 $ make -f Makefile.in SHELL=/bin/sh autoheader-install
152
153 To add the entries to an alternative ChangeLog file, use:
154
155 $ make ChangeLog=MyChangeLog ....
156
157 \f
158 C Language Assumptions
159 ======================
160
161 An ISO C11 compiler is required, as is an ISO C standard library.
162 \f
163 "dump" commands under gdb
164 =========================
165
166 gdbinit.in contains the following
167
168 define dump
169 set sim_debug_dump ()
170 end
171
172 Simulators that define the sim_debug_dump function can then have their
173 internal state pretty printed from gdb.
174
175 FIXME: This can obviously be made more elaborate. As needed it will be.
176 \f
177 Rebuilding nltvals.def
178 ======================
179
180 Checkout a copy of the SIM and LIBGLOSS modules (Unless you've already
181 got one to hand):
182
183 $ mkdir /tmp/$$
184 $ cd /tmp/$$
185 $ cvs checkout sim-no-testsuite libgloss-no-testsuite newlib-no-testsuite
186
187 Configure things for an arbitrary simulator target (I've d10v for
188 convenience):
189
190 $ mkdir /tmp/$$/build
191 $ cd /tmp/$$/build
192 $ /tmp/$$/devo/configure --target=d10v-elf
193
194 In the sim/ directory rebuild the headers:
195
196 $ cd sim/
197 $ make nltvals
198
199 To add a new target:
200
201 devo/sim/common/gennltvals.sh
202
203 Add your new processor target (you'll need to grub
204 around to find where your syscall.h lives).
205
206 devo/sim/<processor>/Makefile.in
207
208 Add the definition:
209
210 ``NL_TARGET = -DNL_TARGET_d10v''
211
212 just before the line COMMON_POST_CONFIG_FRAG.
213
214 devo/sim/<processor>/*.[ch]
215
216 Include targ-vals.h instead of syscall.h.
217 \f
218 Tracing
219 =======
220
221 For ports based on CGEN, tracing instrumentation should largely be for free,
222 so we will cover the basic non-CGEN setup here. The assumption is that your
223 target is using the common autoconf macros and so the build system already
224 includes the sim-trace configure flag.
225
226 The full tracing API is covered in sim-trace.h, so this section is an overview.
227
228 Before calling any trace function, you should make a call to the trace_prefix()
229 function. This is usually done in the main sim_engine_run() loop before
230 simulating the next instruction. You should make this call before every
231 simulated insn. You can probably copy & paste this:
232 if (TRACE_ANY_P (cpu))
233 trace_prefix (sd, cpu, NULL_CIA, oldpc, TRACE_LINENUM_P (cpu), NULL, 0, "");
234
235 You will then need to instrument your simulator code with calls to the
236 trace_generic() function with the appropriate trace index. Typically, this
237 will take a form similar to the above snippet. So to trace instructions, you
238 would use something like:
239 if (TRACE_INSN_P (cpu))
240 trace_generic (sd, cpu, TRACE_INSN_IDX, "NOP;");
241
242 The exact output format is up to you. See the trace index enum in sim-trace.h
243 to see the different tracing info available.
244
245 To utilize the tracing features at runtime, simply use the --trace-xxx flags.
246 run --trace-insn ./some-program
247 \f
248 Profiling
249 =========
250
251 Similar to the tracing section, this is merely an overview for non-CGEN based
252 ports. The full API may be found in sim-profile.h. Its API is also similar
253 to the tracing API.
254
255 Note that unlike the tracing command line options, in addition to the profile
256 flags, you have to use the --verbose option to view the summary report after
257 execution. Tracing output is displayed on the fly, but the profile output is
258 only summarized.
259
260 To profile core accesses (such as data reads/writes and insn fetches), add
261 calls to PROFILE_COUNT_CORE() to your read/write functions. So in your data
262 fetch function, you'd use something like:
263 PROFILE_COUNT_CORE (cpu, target_addr, size_in_bytes, map_read);
264 Then in your data write function:
265 PROFILE_COUNT_CORE (cpu, target_addr, size_in_bytes, map_write);
266 And in your insn fetcher:
267 PROFILE_COUNT_CORE (cpu, target_addr, size_in_bytes, map_exec);
268
269 To use the PC profiling code, you simply have to tell the system where to find
270 your simulator's PC. So in your model initialization function:
271 CPU_PC_FETCH (cpu) = function_that_fetches_the_pc;
272
273 To profile branches, in every location where a branch insn is executed, call
274 one of the related helpers:
275 PROFILE_BRANCH_TAKEN (cpu);
276 PROFILE_BRANCH_UNTAKEN (cpu);
277 If you have stall information, you can utilize the other helpers too.
278 \f
279 Environment Simulation
280 ======================
281
282 The simplest simulator doesn't include environment support -- it merely
283 simulates the Instruction Set Architecture (ISA). Once you're ready to move
284 on to the next level, call the common macro in your configure.ac:
285 SIM_AC_OPTION_ENVIRONMENT
286
287 This will support for the user, virtual, and operating environments. See the
288 sim-config.h header for a more detailed description of them. The former are
289 pretty straight forward as things like exceptions (making system calls) are
290 handled in the simulator. Which is to say, an exception does not trigger an
291 exception handler in the simulator target -- that is what the operating env
292 is about. See the following userspace section for more information.
293 \f
294 Userspace System Calls
295 ======================
296
297 By default, the libgloss userspace is simulated. That means the system call
298 numbers and calling convention matches that of libgloss. Simulating other
299 userspaces (such as Linux) is pretty straightforward, but let's first focus
300 on the basics. The basic API is covered in include/gdb/callback.h.
301
302 When an instruction is simulated that invokes the system call method (such as
303 forcing a hardware trap or exception), your simulator code should set up the
304 CB_SYSCALL data structure before calling the common cb_syscall() function.
305 For example:
306 static int
307 syscall_read_mem (host_callback *cb, struct cb_syscall *sc,
308 unsigned long taddr, char *buf, int bytes)
309 {
310 SIM_DESC sd = (SIM_DESC) sc->p1;
311 SIM_CPU *cpu = (SIM_CPU *) sc->p2;
312 return sim_core_read_buffer (sd, cpu, read_map, buf, taddr, bytes);
313 }
314 static int
315 syscall_write_mem (host_callback *cb, struct cb_syscall *sc,
316 unsigned long taddr, const char *buf, int bytes)
317 {
318 SIM_DESC sd = (SIM_DESC) sc->p1;
319 SIM_CPU *cpu = (SIM_CPU *) sc->p2;
320 return sim_core_write_buffer (sd, cpu, write_map, buf, taddr, bytes);
321 }
322 void target_sim_syscall (SIM_CPU *cpu)
323 {
324 SIM_DESC sd = CPU_STATE (cpu);
325 host_callback *cb = STATE_CALLBACK (sd);
326 CB_SYSCALL sc;
327
328 CB_SYSCALL_INIT (&sc);
329
330 sc.func = <fetch system call number>;
331 sc.arg1 = <fetch first system call argument>;
332 sc.arg2 = <fetch second system call argument>;
333 sc.arg3 = <fetch third system call argument>;
334 sc.arg4 = <fetch fourth system call argument>;
335 sc.p1 = (PTR) sd;
336 sc.p2 = (PTR) cpu;
337 sc.read_mem = syscall_read_mem;
338 sc.write_mem = syscall_write_mem;
339
340 cb_syscall (cb, &sc);
341
342 <store system call result from sc.result>;
343 <store system call error from sc.errcode>;
344 }
345 Some targets store the result and error code in different places, while others
346 only store the error code when the result is an error.
347
348 Keep in mind that the CB_SYS_xxx defines are normalized values with no real
349 meaning with respect to the target. They provide a unique map on the host so
350 that it can parse things sanely. For libgloss, the common/nltvals.def file
351 creates the target's system call numbers to the CB_SYS_xxx values.
352
353 To simulate other userspace targets, you really only need to update the maps
354 pointers that are part of the callback interface. So create CB_TARGET_DEFS_MAP
355 arrays for each set (system calls, errnos, open bits, etc...) and in a place
356 you find useful, do something like:
357
358 ...
359 static CB_TARGET_DEFS_MAP cb_linux_syscall_map[] = {
360 # define TARGET_LINUX_SYS_open 5
361 { CB_SYS_open, TARGET_LINUX_SYS_open },
362 ...
363 { -1, -1 },
364 };
365 ...
366 host_callback *cb = STATE_CALLBACK (sd);
367 cb->syscall_map = cb_linux_syscall_map;
368 cb->errno_map = cb_linux_errno_map;
369 cb->open_map = cb_linux_open_map;
370 cb->signal_map = cb_linux_signal_map;
371 cb->stat_map = cb_linux_stat_map;
372 ...
373
374 Each of these cb_linux_*_map's are manually declared by the arch target.
375
376 The target_sim_syscall() example above will then work unchanged (ignoring the
377 system call convention) because all of the callback functions go through these
378 mapping arrays.
379 \f
380 Events
381 ======
382
383 Events are scheduled and executed on behalf of either a cpu or hardware devices.
384 The API is pretty much the same and can be found in common/sim-events.h and
385 common/hw-events.h.
386
387 For simulator targets, you really just have to worry about the schedule and
388 deschedule functions.
389 \f
390 Device Trees
391 ============
392
393 The device tree model is based on the OpenBoot specification. Since this is
394 largely inherited from the psim code, consult the existing psim documentation
395 for some in-depth details.
396 http://sourceware.org/psim/manual/
397 \f
398 Hardware Devices
399 ================
400
401 The simplest simulator doesn't include hardware device support. Once you're
402 ready to move on to the next level, call the common macro in your configure.ac:
403 SIM_AC_OPTION_HARDWARE(yes,,devone devtwo devthree)
404
405 The basic hardware API is documented in common/hw-device.h.
406
407 Each device has to have a matching file name with a "dv-" prefix. So there has
408 to be a dv-devone.c, dv-devtwo.c, and dv-devthree.c files. Further, each file
409 has to have a matching hw_descriptor structure. So the dv-devone.c file has to
410 have something like:
411 const struct hw_descriptor dv_devone_descriptor[] = {
412 {"devone", devone_finish,},
413 {NULL, NULL},
414 };
415
416 The "devone" string as well as the "devone_finish" function are not hard
417 requirements, just common conventions. The structure name is a hard
418 requirement.
419
420 The devone_finish() callback function is used to instantiate this device by
421 parsing the corresponding properties in the device tree.
422
423 Hardware devices typically attach address ranges to themselves. Then when
424 accesses to those addresses are made, the hardware will have its callback
425 invoked. The exact callback could be a normal I/O read/write access, as
426 well as a DMA access. This makes it easy to simulate memory mapped registers.
427
428 Keep in mind that like a proper device driver, it may be instantiated many
429 times over. So any device state it needs to be maintained should be allocated
430 during the finish callback and attached to the hardware device via set_hw_data.
431 Any hardware functions can access this private data via the hw_data function.
432 \f
433 Ports (Interrupts / IRQs)
434 =========================
435
436 First, a note on terminology. A "port" is an aspect of a hardware device that
437 accepts or generates interrupts. So devices with input ports may be the target
438 of an interrupt (accept it), and/or they have output ports so that they may be
439 the source of an interrupt (generate it).
440
441 Each port has a symbolic name and a unique number. These are used to identify
442 the port in different contexts. The output port name has no hard relationship
443 to the input port name (same for the unique number). The callback that accepts
444 the interrupt uses the name/id of its input port, while the generator function
445 uses the name/id of its output port.
446
447 The device tree is used to connect the output port of a device to the input
448 port of another device. There are no limits on the number of inputs connected
449 to an output, or outputs to an input, or the devices attached to the ports.
450 In other words, the input port and output port could be the same device.
451
452 The basics are:
453 - each hardware device declares an array of ports (hw_port_descriptor).
454 any mix of input and output ports is allowed.
455 - when setting up the device, attach the array (set_hw_ports).
456 - if the device accepts interrupts, it will have to attach a port callback
457 function (set_hw_port_event)
458 - connect ports with the device tree
459 - handle incoming interrupts with the callback
460 - generate outgoing interrupts with hw_port_event