Next: PTLsim/X Architecture Details Up: PTLsim/X: Full System SMP/SMT Previous: Background Contents

Subsections

Getting Started with PTLsim/X

NOTE: This part of the manual is relevant only if you are using the full-system PTLsim/X. If you are looking for the userspace-only version, please skip this entire part and read Part II instead.

1]1WARNING: PTLsim/X assumes fairly high level of familiarity with Xen and the Linux kernel. If you have never compiled your own Linux kernel or if you are not yet running Xen or are unsure how to create and use domains, STOP NOW and become familiar with Xen itself before attempting to use PTLsim/X. The following sections all assume you are familiar with Xen, at least from a system administration perspective. We cannot provide support for Xen-related issues unless they are caused by PTLsim.

Building PTLsim/X

Prerequisites:

PTLsim/X requires a modern 64-bit x86-64 machine. This means an AMD Athlon 64 / Opteron / Turion or an Intel Pentium 4 (specifically with EM64T) or Intel Core 2. We do not plan to offer a 32-bit version of PTLsim/X due to the technical deficiencies in 32-bit x86 that make it difficult to properly implement a full system simulator with all of PTLsim's features. Besides, 64-bit hardware is now the standard (in some cases the only option) from all the major x86 processor vendors and is very affordable.
The 64-bit requirement only applies to the host system running PTLsim/X. Inside the virtual machine, you are still free to use standard 32-bit Linux distributions, applications and so forth under PTLsim/X
PTLsim/X assumes you have root access to your machine. The PTLsim/X hypervisor runs below Linux itself, so you must use a Xen compatible kernel in domain 0 (more on this later).
We highly recommend you use a Linux distribution already designed to work with Xen 3.x. We use SuSE 10.2 and highly recommend it; most other distributions now support Xen. This requirement only applies to domain 0 - the virtual machines you'll be running can use any distribution and do not even need to know about Xen at all (other than the kernel, which must support Xen hypercalls and block/network drivers).
We have successfully built PTLsim/X with gcc 4.1.x+ (gcc 4.0.x has documented bugs affecting some of our code).

Quick Start Steps:

All files listed below can be downloaded from https://ptlsim.org/download.php.

1]1IMPORTANT: The instructions below refer to specific versions of various files (i.e., Xen hypervisor, Linux kernel, etc.). We regularly update the versions of these files, and newer PTLsim/X versions may not work correctly with older kernel and/or hypervisor versions (i.e. the versions should be matching). The following instructions are therefore for informational purposes only; always check the PTLsim web site's download page for the latest versions of these files. The following versions are correct as of September 20th, 2007

Set up Xen with PTLsim/X extensions:
- Download our modified Xen source tree (xen-3.1-ptlsim.tar.bz2) from https://ptlsim.org/download.php. This is the easiest way to make sure you have the correct PTLsim-compatible version of Xen with all patches pre-applied.
  - We also provide ptlsim-xen-hypervisor.diff in case you want to manually apply the patches to a development version of Xen; the patches are fairly simple and can be adapted as needed.
- Build and install both the Xen hypervisor and the userspace Xen tools:
  - In xen-3.1-ptlsim/xen, run make. You can optionally copy the compiled Xen hypervisor (in xen/xen) somewhere else (such as wherever your kernel and initrd files are stored).
  - In xen-3.1-ptlsim/tools, run make, then run make install.
- Download our sample kernel and modules (linux-2.6.22.6-mtyrel-64bit-xen.tar.bz2) and extract in the root directory (via tar jxvf linux-2.6.22.6-mtyrel-64bit-xen.tar.bz2) to create /lib/modules/2.6.22.6-mtyrel-64bit-xen/....
  - This is a SMP kernel based on 2.6.22.6 with the Xen patches maintained by SuSE Linux. The complete source is in linux-2.6.22-mtyrel-source.tar.gz, if you want to recompile it.
  - This is just a sample kernel we use - PTLsim/X should work even if you use the Xen-compatible kernel shipped with your Linux distribution of choice. However, we recommend you run this same kernel in domain 0 as well as in the target domain under simulation, simply because we know it works correctly and has all the latest Xen patches.
  - In addition, our kernels feature the ability to create Xen checkpoints and initiate PTLsim actions from within the domain by writing to /proc/xen/checkpoint and /proc/xen/ptlsim, respectively. The major changes are in linux-2.6.22-mtyrel/patches.mty/linux-2.6.22-xen-self-checkpointing.diff if you want to apply them to a different kernel or learn how they work.
- Activate the new Xen hypervisor and kernel:
  - Install the new kernel and Xen hypervisor in a manner specific to your distribution. While we cannot provide instructions for every distribution, on SuSE, you need to run mkinitrd to collect the required boot drivers like this:
    
    mkinitrd -k /lib/modules/2.6.22.6-mtyrel-64bit-xen/linux
    -i /lib/modules/2.6.22.6-mtyrel-64bit-xen/initrd
    -M /lib/modules/2.6.22.6-mtyrel-64bit-xen/System.map
    
    IMPORTANT: All parts of this command should be on a single line (this manual makes long lines difficult to show)
  - Edit the GRUB bootloader configuration (usually in /boot/grub/menu.lst on most distributions) to specify the new Xen-enabled kernel and hypervisor. The first entry should be similar to:
    
    title Linux 2.6.22.6-mtyrel-64bit-xen
    kernel (hd0,0)/project/xen-3.1-ptlsim/xen/xen console=vga
    module (hd0,0)/lib/modules/2.6.22.6-mtyrel-64bit-xen/linux root=/dev/...
    module (hd0,0)/lib/modules/2.6.22.6-mtyrel-64bit-xen/initrd
    
    Obviously you may need to adjust the file locations, if you're booting from a different hard drive or compiled Xen in a location other than /project/xen-3.1-ptlsim.
- Reboot, and make sure the PTLsim/X extensions to Xen are actually running: ``cat /sys/hypervisor/properties/capabilities'' should list ``ptlsim''. If this file doesn't exist, you're not running under Xen at all.
Set up sample virtual machine and disk images:
- Download our pre-configured example disk image (ptlsim-disk-image-example.tar.bz2) and uncompress with tar jxvf ptlsim-disk-image-example.tar.bz2. The sample scripts inside this archive assume that the files were extracted into /project/ptlsim-disk-image-example.
  - We recommend placing this disk image on a local hard disk rather than NFS. However, if you're running Cluster NFS and/or are using the no_root_squash NFS option, it's perfectly fine if you put the disk image on an NFS volume.
- You already downloaded our Xen-compatible kernel above.
- The disk image archive contains a sample Xen configuration file (sample-xen-domain) and some helpful scripts (e.g. run-domain, restore-domain, etc.)
- Make sure you can create this domain ``xm create sample-xen-domain -c''. You should get a console with the text ``Welcome to the PTLsim Demo Machine''.
Setup PTLsim itself:
- Download the stable version of PTLsim from our web site (in ptlsim-2007xxxx-rXXX.tar.gz) and unpack this file to create the ptlsim directory.
- Edit the PTLsim Makefile and uncomment the ``PTLSIM_HYPERVISOR=1'' line to enable full system PTLsim/X support.
- Run make.
  - If the build process complains about missing header files, make sure /usr/include/xen is a symlink to /project/xen-3.1-ptlsim/tools/libxc/xen (or wherever you put the PTLsim-modified xen-3.1-ptlsim tree you downloaded). Delete /usr/include/xen beforehand if needed.

Running PTLsim

PTLsim is run in domain 0 as root, for instance by using the ``sudo ptlsim ...'' command. The -domainN option is used to specify the domain to access. The following scenarios show by example how this is done.

Booting Linux under PTLsim

In the following examples, we will assume the target domain is called ptlvm.

Start your domain as follows:

sudo xm create domainname -paused

sudo xm list

sudo xm console domainname

The -paused option tells Xen to pause the domain as soon as it's created, so we can run the entire boot process under PTLsim.

The xm list command will print the domain ID assigned to ptlvm. On our test machine, the output looks like:

yourst [typhoon /project/ptlsim] sudo xm create ptlvm -paused; sudo xm list; sudo xm console ptlvm;

Using config file "ptlvm".

Started domain ptlvm

Name ID Mem(MiB) VCPUs State Time(s)

Domain-0 0 7877 4 r--- 137.9

ptlvm 21 128 1 -p-- 0.0

You may also want to give the PTLsim domain a low priority; otherwise it may cause the system to respond slowly. This can be done by adding:

: sudo xm sched-credit -d ptlvm -w 16

Open another console and start PTLsim on this domain (using the domain ID ``21'' given in the example above):

: sudo ./ptlsim -domain ptlvm -logfile ptlsim.log -native

The resulting output:

// PTLsim: Cycle Accurate x86-64 Full System Simulator

// Revision 225 (2007-09-21)

// Built Sep 21 2007 16:21:36 on tidalwave.lab.ptlsim.org using gcc-4.2

// Running on typhoon.lab.ptlsim.org

Processing -domain 21 -logfile ptlsim.log -native

System Information:

Running on hypervisor version xen-3.0-x86_64-ptlsim

Xen is mapped at virtual address 0xffff800000000000

PTLsim is running across 1 VCPUs:

VCPU 0: 2202 MHz

Memory Layout:

System: 524208 pages, 2096832 KB

Domain: 32768 pages, 131072 KB

PTLsim reserved: 8192 pages, 32768 KB

Page Tables: 275 pages, 1100 KB

PTLsim image: 407 pages, 1628 KB

Heap: 7510 pages, 30040 KB

Stack: 256 pages, 1024 KB

Interfaces:

PTLsim page table: 282898

Shared info mfn: 4056

Shadow shinfo mfn: 295164

PTLsim hostcall: event channel 3

PTLsim upcall: event channel 4

Switched to native mode

Back in the Xen console for the domain, you'll see the familiar Linux boot messages:

Bootdata ok (command line is nousb noide root=/dev/hda1 xencons=ttyS console=ttyS0)

Linux version 2.6.18-mtyrel-k8-64bit-xen (yourst@tidalwave) (gcc version 4.1.0 (SUSE Linux)) #2 Sun Oct 8 02:29:10 EDT 2006

BIOS-provided physical RAM map:

Xen: 0000000000000000 - 0000000008800000 (usable)

No mptable found.

Built 1 zonelists. Total pages: 34816

Kernel command line: nousb noide root=/dev/hda1 xencons=ttyS console=ttyS0

Initializing CPU#0

PID hash table entries: 1024 (order: 10, 8192 bytes)

Xen reported: 2202.808 MHz processor.

Console: colour dummy device 80x25

Dentry cache hash table entries: 32768 (order: 6, 262144 bytes)

Inode-cache hash table entries: 16384 (order: 5, 131072 bytes)

Software IO TLB disabled

Memory: 123180k/139264k available (2783k kernel code, 7728k reserved, 959k data, 184k init)

Calibrating delay using timer specific routine.. 4407.14 BogoMIPS (lpj=2203570)

...

NET: Registered protocol family 1

NET: Registered protocol family 17

VFS: Mounted root (ext2 filesystem) readonly.

Welcome to the PTLsim demo machine!

root [ptlsim /] cat /proc/cpuinfo

You'll notice how we specified the ``-native'' option to speed up the boot process by running all code on the real CPU rather than PTLsim's synthetic CPU model. Booting Linux within PTLsim is slow since the kernel often executes several billion instructions before finally presenting a command line.

Running Simulations: PTLctl

At this point, we would like to start an actual simulation run. For purposes of illustration, this run is composed of three actions:

Simulate 100 million x86 instructions using PTLsim's out of order superscalar model
Simulate another 100 million using PTLsim's sequential model. The sequential model is much faster than the out of order superscalar model, so it's useful for testing and debugging functional issues, as well as simply interacting with the domain. However, it does not collect any cycle accurate timing data. Section 9.4 gives more information on the sequential model.
Return to native mode

In the first example, we will start this run from within the running domain using ptlctl (PTLsim controller), a program supplied with PTLsim. PTLctl is actually an example program showing the use of PTLsim hypercalls (``PTL calls''), special x86 instructions that can be used to control a domain's own simulation. More information on the PTLcall API is in Section 14.4.

To conduct this simulation, the ptlctlcommand is used within the running virtual machine (by typing it at the domain's console); it is not run on the host system at all:

root [ptlsim /] tar zc usr lib | tar ztv > /tmp/allfiles.txt &

[1] 775

root [ptlsim /] ptlctl -core ooo -stopinsns 100m -run : -core seq -stopinsns 200m -run : -native

Sending flush and command list to PTLsim hypervisor:

-core ooo -stopinsns 100m -run

-core seq -stopinsns 200m -run

-native

PTLsim returned rc 0

root [ptlsim /]

The first command simply runs several CPU-intensive multi-threaded processes in the background for simulation purposes (in this case, compressing and uncompressing files in the virtual machine's filesystem).

The second ptlctl command submits the three simulation actions to PTLsim, separated by colons (``:'').

At the PTLsim console, the following output is produced (the cycle counters will update regularly):

...

Breakout request received from native mode

Switched to simulation mode

Returned from switch to native: now back in sim

Processing -core ooo -stopinsns 100m -run

Completed 75258330 cycles, 100000000 commits: 461819 cycles/sec, 795201, insns/sec

Processing -core seq -stopinsns 200m -run

Completed 200000000 cycles, 200000000 commits: 6941302 cycles/sec, 6941302, insns/sec

Processing -native

Switched to native mode

Notice how the command list is always terminated by a final simulation action (in this case, -native). If the command list only had one simulation run with a fixed duration, once that simulation ended, the domain would freeze, since PTLsim would pause until another command arrived. However, since the domain is frozen, the next command would never arrive: there is no way to execute the ptlctl program a second time if the domain is stopped. To avoid this sort of deadlock, ptlctllets the user atomically submit batches of multiple commands as shown ahove.

This powerful capability allows ``self-directed'' simulation scripts (i.e. standard shell scripts), in which ptlctlis run immediately before starting a benchmark program, then ptlctlis run again after the program exits to end the simulation and switch back to native mode.

PTLsim/X Options

In Section 10.3, the configuration options common to both userspace PTLsim and full system PTLsim/X wer listed. PTLsim/X also introduces a number of special options only applicable to full system simulation:

Actions:

-run

Start a simulation run, using the core model specified by the -core option (the default core is ``ooo'').
-stop

Stop the simulation run currently in progress, and wait for further commands. This is generally issued from another console window.
-native

Switch the domain to native mode.
-kill

Kill the domain. This is equivalent to ``xm destroy'', but it also allows PTLsim to perform cleanup actions and flush all files before exiting.

Live Updates of Configuration Options

PTLsim/X provides the ability to send commands and modify configuration options in the running simulation from another console on the host system. This is different from how the ptlctl program is used inside the target domain to script simulations: in this case, the commands are submitted asynchronously from the host system.

For instance,

: sudo ptlsim -native -domain ptlvm

will immediately switch the target domain back to native mode.

To reset the log level in the middle of a simulation run, use the following:

sudo ptlsim -domain ptlvm -loglevel 99 : -run

ptlsim: Sending request '-domain ptlvm -loglevel 99 : -run' to domain 12...OK

(This is an example only! Using -loglevel 99 will create huge log files).

Most options (such as -loglevel, -stoprip, etc.) can be updated at any time in this manner.

To end a simulation currently in progress, use this:

: sudo ptlsim -domain ptlvm -kill

This will force PTLsim to cleanly exit.

Command Scripts

PTLsim supports command scripts, in which a file containing a list of commands is passed on the PTLsim command line as follows:

: sudo ./ptlsim -domain name @ptlvm.cmd

where ptlvm.cmd (specified following the ``@'' operator) contains the example lines:

# Configuration options:

-logfile ptlsim.log -loglevel 4 -stats ptlsim.stats -snapshot-cycles 10m

# Run the simulation

-core seq -run -stopinsns 20m

-core ooo -run -stopinsns 100m

-native # All done (switch to native mode)

These commands are executed by PTLsim one at a time, waiting until the previous command completes before starting the next. Notice the use of comments (starting with ``#''), and how configuration options can be spread across lines if desired. This mode is very useful for specifying breakpoints using -stoprip and similar options; when the target RIP is reached, the simulation stops and the next command in the command list is executed.

Command scripts can be nested (i.e. a script can itself include other scripts using @scriptname). When multiple commands are given on the command line separated by colons (``:''), any @scriptname clauses are processed after all other commands on the command line.

Working with Checkpoints

1]1We maintain a tutorial on how to set up checkpoints and perform advanced checkpointing techniques at https://ptlsim.org/capswiki/index.php/SPEC_2006. Note that this address is subject to change.

Xen provides the ability to capture the state of a domain into a checkpoint file stored on disk. PTLsim can leverage this capability to start simulation from a checkpoint, avoiding the need to go through the entire boot process, and allowing precisely reproducable results across multiple simulation runs.

To create a checkpoint, boot the domain in native mode without PTLsim running, and bring the domain to the point where you would like to begin simulation. Then, in another console, run:

: sudo xm save ptlvm /tmp/ptlvm.img

If you're using our sample disk images, this command will pause until you do the following from within the domain:

: echo checkpoint > /proc/xen/checkpoint

This facility allows very precise checkpoint placement, even by writing to this special file from within a benchmark.

To restore the domain to that checkpoint, run:

sudo xm restore /tmp/ptlvm.img -paused

sudo xm list

sudo xm console ptlvm

PTLsim can then be started in the normal manner, by specifying -domain domainname. If the checkpoint was made while the domain waited for input (e.g. at a shell command line), you may have to press a few keys to get any response from its console.

To exit PTLsim, use ``sudo ptlsim -kill -domain X'' from another console. To abort PTLsim immediately, use Ctrl+C on the ptlsim process, then type ``xm kill ptlvm'' to destroy the domain.

The Nature of Time

Full system simulation poses some difficult philosophical questions about the nature of time itself and the relativistic phenomenon of ``time dilation''. Specifically, if a simulator runs X times slower than the native CPU, both external interrupts and timer interrupts should theoretically be generated X times slower than in the real world. This is critical for obtaining accurate simulation results: for events like network traffic, if a real network device fed interrupts into the domain in realtime, and the simulator injected these interrupts into the simulation at the same rate, they would appear to arrive thousands of times faster than any physical network interface could deliver them. This can easily result in a livelock situation not possible in a real machine; at the very least it will deliver misleading performance results.

On the other hand, interacting with a domain running at the ``correct'' rate according to its own simulated clock can be unpleasant for users. For instance, if the ``sleep 1'' command is run in a Linux domain under PTLsim, instead of sleeping for 1 second of wall clock time (as perceived by the user), the domain will wait until 1 billion cycles have been fully simulated (assuming the simulated processor frequency is 1 GHz). This is because PTLsim keys interrupt delivery and all timers to the simulated cycle number in which the interrupt should arrive (based on the core clock frequency). In addition to being annoying, this behavior will massively confuse network applications that rely on precise timing information: a TCP/IP endpoint outside the domain will not expect packets to arrive thousands of times slower than its own realtime clock expects, resulting in retransmissions and timeouts that would never occur if both endpoints were inside the same ``time dilated'' domain.

Rather than attempt to solve this philosophical dilemma, PTLsim allows users to choose the options that best suit their simulation accuracy needs. The following options control the notion of time inside the simulation:

-corefreq Hz

Specify the CPU core frequency (in Hz) reported to the domain. To specify a 2.4 GHz core, use ``-corefreq 2400m''. This option is used to calculate the number of cycles between timer interrupts, as described below.

NOTE: If you plan on switching the domain between simulation and native mode, we strongly recommend avoiding this option, to allow the host machine frequency to match the simulated frequency.
-timerfreq Hz

Specify the timer interrupt frequency in interrupts per second. By default, 100 interrupts per second are used, since this is the standard for Linux kernels.

Hint: if keyboard interaction with the domain seems slow or sluggish, this is because Linux only flushes console buffers to the screen at every clock tick. Specifying -timerfreq 1000 will greatly improve interactive response at the expense of more interrupt overhead.
-pseudo-rtc

By default, the realtime clock reported to the domain is the current time of day. This option forces the clock to reset to whatever time the domain's checkpoint (if any) was created. This may allow better cycle accurate reproducibility of random number generators, for instance.
-realtime

PTLsim normally delivers all interrupts at the time dilated rate, as described above. While this provides the most realistic simulation accuracy, it may be undesirable for some applications, particularly in networking. The -realtime option delivers external interrupts to the domain as soon as they arrive at PTLsim's interrupt handler; they are not deferred. The realtime clock reported to the domain is also not dilated; it is locked to the current wall clock time. This option does not affect the timer interrupt frequency; use the -timerfreq option to directly manipulate this.
-maskints

Do not allow any external interrupts or events to reach the domain; only the timer interrupt is delivered at the specified rate by PTLsim. This mode is necessary to provide guaranteed reproducable cycle accurate behavior across runs; it eliminates almost all non-deterministic events (like outside device interrupts) from the simulation. However, it is not very practical, since disk and network access is impossible in this mode (since the Xen disk and network drivers could never wake up the domain when data arrives). This mode is most useful for debugging starting at a checkpoint, or when using a ramdisk with pre-scripted boot actions.

Other Options

PTLsim/X has a few additional options related to full system simulation:

-reservemem M

Reserves M megabytes of physical memory for PTLsim and its translation cache. The default is 32 MB; the valid range is from 16 MB to 512 MB. See Chapter 14 for details.

All other options in Section 10.3 (unless otherwise noted) are common to both userspace PTLsim and full system PTLsim/X.

Next: PTLsim/X Architecture Details Up: PTLsim/X: Full System SMP/SMT Previous: Background Contents

Matt T Yourst 2007-09-26