Tool-MX-Vest

Implementation: Simulation Technology

The ``simulation technology'' is how the original machine instructions (or other source representation) gets translated into an executable representation that is suitable for simulation and/or tracing. Choices include:

ddi: Decode-and-dispatch interpretation: the input representation for an operation is fetched and decoded each time it is executed.
pdi: Predecode interpretation: the input form is translated into a form that is faster to decode; that form is then saved so that successive invocations (e.g. subsequent iterations of a loop) need only fetch and decode the ``fast'' form. Note that
- The translation may happen before program invocation, during startup, or incrementally during execution; and that the translated form may be discarded and regenerated.
- If the original instructions change, the translated form becomes incoherent with the original representation; a system that fails to update (invalidate) the translated form before it is then reexecuted will simulate the old instructions instead of the new ones. For some systems (e.g., those with hardware coherent instruction caches) such behavior is erronious.
tci: Threaded code interpretation: a particularly common and efficient form of predecode interpretation.
scc: Static cross-compilation: The input form is statically (before program execution) translated from the target instruction set to the host instruction set. Note that:
- All translation costs are paid statically, so runtime efficiency may be very good. In contrast, dynamic analysis and transformation costs are paid during simulation, and so it may be necessary to ``cut corners'' with dynamic translation in order to manage the runtime cost. Cutting corners may affect both the quality of analysis of the original program and the quality of code generation.
- Instructions that cannot be located statically or which do not exist until runtime cannot be translated statically.
- Historically, it is difficult to distinguish between memory words that are used for instructions and those that are used for data; translating data as instructions may cause errors.
- Translating to machine code allows the use of the host hardware's instruction fetch/decode/dispatch hardware to help simulate the target's.
- Translating to machine code makes it easier to translate clumps of host instructions; most dispatching between target instructions is thus eliminated.
dcc: Dynamic Cross Compilation: Host machine code is generated dynamically, as the program runs. Note that:
- Translating ``on demand'' eases the problem of determining what is code and what is data; a given word may even be used as both code and data.
- Translating to machine code is often more expensive than translating to other representations; both the cost of generating the machine code and the cost of executing it contribute to the overall execution time.
- Theoretical performance advantages from dynamic cross-compilation may be overwhelmed by the host's increased cache miss ratio due to dynamic cross-compilation's larger code sizes [Pittman 95].
aug: Augmentation: cross-compilation where the host and target are the same machine. Note that
- Augmentation is typically done statically.
- There is a fine line between having identical host and target machines (augmetnation) and having nearly-identical machines in which just a few features (e.g. memory references) are simulated, but in which the bulk of instruction sets and encodings are identical.
emu: Emulation: Where software simulation is sped up using hardware assistance. ``Hardware assistance'' might include special compatability modes but might also include careful use of page mappings. (See ``emulation''.)

From instruction-set simulation and tracing