[Indigo]<PostCedar5.2>Documentation>SpyDoc.tioga!1

SpyDoc.tioga

Written by: John Maxwell, October 20, 1983

Last Edited by: John Maxwell on December 12, 1983 11:37 am

Last Edited by: Subhana, May 30, 1984 1:58:19 pm PDT

THE CEDAR SPY

CEDAR 5.2 — FOR INTERNAL XEROX USE ONLY

Documentation for the Cedar Spy

Release as: [Indigo]<Cedar5.2>Documentation>SpyDoc.tioga

Abstract: The Cedar Spy is a tool for monitoring the performance of programs. It can measure several different aspects of performance: CPU usage, page faults, allocations, or process time. Hopefully a programmer will find that the Spy is the only tool he needs for the bulk of his performance analysis.

Attributes: user interface, Viewers, sub-classing, menus

XEROX Xerox Corporation
Palo Alto Research Center
3333 Coyote Hill Road
Palo Alto, California 94304

For Internal Xerox Use Only

Introduction.

The Cedar Spy is designed to be the main tool for analyzing the performance of programs in Cedar. It consists of of an array of means for viewing the execution of a program all presented in a single, consistent form. With the Spy, the programmer can see which procedures are consuming CPU cycles, which are causing page faults, which are using the allocator, or which are calling a particular procedure. When the programmer narrows his focus to just one process, the Spy will tell him where that process is spending its time, where it is waiting on page faults, where it is waiting on monitor locks, where it is waiting on condition variables, and when it is preempted by other processes. In addition, the programmer can measure precisely what he is interested in since the Spy provides a facility for setting breakpoints to determine where the Spy should start and stop its measurements. Hopefully a programmer will find that the Spy is the only tool he needs for the bulk of his performance analysis.

The main paradigm of the Spy is to record the call stack of an interesting process. The definition of "interesting" varies according to what the Spy is measuring. If the Spy is measuring page faults, it records the stack of the process that just page faulted. If the Spy is measuring CPU usage, it records the stack of the top-most active process at regular intervals. The only things recorded are the procedures in the call stack; parameters and local variables are not recorded.

All of the functionality provided by the Spy viewer can also be accessed programmatically through the interface "SpyClient". For documentation, see that interface.

Obtaining the Spy.

The Spy can be obtained by bringing over the appropriate files and loading "Spy.bcd". To bring over the files, type

Bringover /a /p [Indigo]<Cedar>Top>Spy.df

to the user executive. Typing "Run Spy" will cause a Spy viewer to appear at the bottom of the screen. (The Spy viewer looks like a magnifying glass laying on top of the word "Spy"). If you destroy the Spy viewer, you can obtain another one by typing "Spy" to the UserExec.

Preparing the Spy for its Mission.

The Spy provides the programmer with a number of parameters to specify what it will measure. These parameters allow the programmer to tailor the Spy somewhat to his particular needs. The parameters are changed by clicking them with the mouse; repeated clickings will cycle a parameter through all of the possible values. Most people will find that the default parameters provide a good starting point.

Watching: {CPU, process number, breakProcess, pagefaults, wordsAllocated, allocations, user breaks}

The "Watching" parameter determines what the Spy should measure.

* If "CPU" is chosen, the Spy will wake up 80 times a second (10 on a Dolphin) and record the call stack of the process that is currently running.

* If "process number: " is chosen, the Spy will monitor the process given by the user. It will keep track of what the process was doing: whether it was currently running, currently pre-empted by another process, waiting on a condition variable, waiting on a monitor lock, or waiting on a page fault.

* If "breakProcess" is chosen, the Spy will monitor the first process that causes a start break. This is useful when the user doesn't know which process is going to run. Otherwise, this is just like the previous mode.

* If "pagefaults" is chosen, the Spy will record the call stack of the process that pagefaulted.

* The difference between "allocations" and "wordsAllocated" is that in the former 1 count is recorded for each allocation and in the second 1 count is recorded for each word allocated, which would be the same a recording n counts for the allocation, where n is the number of words allocated.

* If "user breaks" is chosen, the Spy will record data each time it encounters a breakpoint set using "SetUserBreak!".

SetUserBreak! ClearUserBreaks!

These two commands allow the user to specify breakpoints that will cause the Spy to log data. The breaks are only enabled while the Spy is running and the "Watching" mode is "user-defined". There are a number of ways of specifying a breakpoint:

* Selection in an Impl module. The selection does not have to be a procedure name, it can be anywhere in the code (just like the Interpreter).

* Selection of a rope of the form "FooImpl.Proc". The selection can be anywhere, even the Spy's output.

You should know that the Spy will not be able to set a breakpoint if a) the module has not been loaded yet, or b) the symbols are not on the disk.

If "ClearUserBreaks!" is bugged, ALL of the breakpoints will go away.

SetTrace!

Whenever a trace breakpoint is encountered, the Spy will put an entry in its log saying when and where it was. The traces are only enabled while the Spy is running and the "Watching" mode is "user-defined". Trace breaks are set just like user breaks. They are cleared with `ClearUserBreaks!'.

Starting and Stopping the Spy.

There are two main ways of starting and stopping the Spy: by bugging "Spy: {on}" and "Spy: {off}", or by setting breakpoints that determine programmatically where the Spy is to start and stop. Start and Stop breakpoints are set in exactly the same manner as given above for user breakpoints. The only difference is that if a Stop breakpoint is set on a the beginning procedure, the break will occur at the exit of the procedure rather than its entry.

Setting a Start or Stop breakpoint does not actually enable the Spy to run. Instead, the "Spy: {off}" switch will be changed to say "Break: {off}". The Start and Stop breaks are only enabled when the programmer has set the switch to "Break: {on}". This allows the programmer to specify all of the breaks he wants before the Spy starts measuring. Changing the switch to "Break: {off}" doesn't remove the breaks, it just disables them. Thus the user can reuse the breaks he has already set.

The breakpoint facility is designed so that the Spy is recording data as long as the number of Start breaks exceeds the number of Stop breaks. This allows breaks to be set in recursive procedures.

When "ClearBreaks!" is bugged, all of the Start and Stop breaks are removed and the on/off switch is changed to say "Spy: {off}".

Displaying the Data.

Clicking "DisplayData!" will cause the data to be displayed in a new typescript. It will also automatically turn off the Spy if it is running. "DisplayData!" does not destroy the data stored. If some of the symbols were missing, the user can fetch them and click "DisplayData!" again. The second invocation will take advantage of the new symbols.

To the left of the "DisplayData!" button is a button called "cutoff". The "cutoff" button lets you specify how much information you want to be displayed. The number after the cutoff is the percentage of times a procedure must be in the call stack in order to be displayed. If a procedure is in the call stack fewer times than this, then it will not be displayed.

Interpreting the Results.

The Spy's output is divided into three main parts: a header, a breakdown of the processes, and a breakdown of the procedures. The header gives overall statistics about the execution of the Spy while the two breakdown sections are focussed on detailed data. The three sections are separated by double rows of tildas (~~~).

==================================
Cedar Spy of: 29-Apr-82 9:19:41.
Executed at: 29-Apr-82 9:20:15.
.
.
.

The header is pretty much self-evident. At the top it records the parameters that the programmer specified for this particular run: what was being measured, where breaks were set, etc. Below this it gives overall statistics such as how long the Spy ran, how many wakeups were recorded, what priorities did things run at, how many page faults occurred overall, and so on. At the bottom is a section called "Statistics on the execution of the Spy" which gives some data about the running of the Spy itself. This is only for debugging purposes.

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Breakdown of interesting processes.
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

The second section gives a list of the interesting processes that ran and some data about them. A typical entry might look like this:

Process 220B running at priority [normal, pagefaultLow] (39 page faults) = 609 (32.6%).
BBStart.EvalBase = 608 (32.5%).
CachedRegionImplB = 1 (0.1%).

The first line says that process 220B ran at two different priorities (normal and pagefaultLow) and that 39 page faults occurred as an immediate consequence of its work. It also says that 609 counts were recorded for this process, and that this accounted for 32.6% of the time.

The second and third lines says that of these 609 counts, 608 counts were in BBStart.EvalBase and 1 count was in CachedRegionImplB. These counts corresponded to 32.5% and 0.1% of all of the counts respectively.

Process 220B has two entries because it was shared by more than one logical process. Most processes will only have one entry unless a lot of processes are being forked.

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Breakdown of interesting procedures.
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

The last section gives a list of modules and procedures indented so as to indicate precedence relationships. (Module information is only printed when the Spy is unable to obtain the symbols necessary to print out the procedures.) Each module or procedure that occurs in more than 3.0% of the call stacks will be printed exactly once somewhere in this section. A typical entry might look like this:

Procedure StackSpy.AddSon (2 refs) = 93, 2794 (2.9%) (92.6%).
(20154) StackSpy.AddProcedure = 1946 (62.4%).
(20277) RTPrefAllocImpl.NewPrefixedObject = 78 (2.5%).
(20348) StackSpy.AddProcedure = 632 (20.2%).
(20398) RTPrefAllocImpl.NewPrefixedObject = 60 (1.9%).
(20444) waiting.pagefault = 59 (1.9%).

The first line indicates that the procedure StackSpy.AddSon was called from two places (2 refs) and was in the call stack a total of 92.6% of the time. This 92.6% of the time was divided into 93 counts when the CPU was executing code in the procedure itself, and 2794 counts when the CPU was executing code in procedures called from AddSon. The 2.9% is only there because the number of counts that were in the procedure itself was great enough (>1.0%) to force it to print a percentage for just those counts. Note that percentages are always for the amount of time that a procedure is in the call stack. This means that percentages may add up to more than 100% since most call stacks have more than one procedure. The meanings of these numbers are printed in the header of the output for easy reference.

The number of refs to a procedure is not the same as the number of calls; it indicates the number of different places that a call to this procedure occurs in the static text of the code. In this case, AddSon was called from two different places in the procedure StackSpy.EnterStack.

The next five lines give the list of procedures that occur in the call stack under AddSon more than 1.0%. These are the significant procedures that AddSon calls. They are sorted by PC, and so they are listed in the order that they appear in the code listings. The numbers in parenthesis at the beginning are the source indexes for the calls. Selecting one of the numbers and clicking "Position" in the file StackSpy.mesa will show you where the call was made. If a procedure is called from more than one place (as StackSpy.AddProcedure is) then it will appear in the list more than once. Since the Spy does not print procedures that are called less than 1.0%, the sum of the counts for the procedures listed (1946+78+632+60+59 => 2775) may be less that the total number of calls given in the procedure header (2794).

The count that appears after each procedure listed is the number of times that that procedure was in the call stack under AddSon. The percentage is the ratio of that count to the total number of counts. The last entry, waiting.pagefault, is an entry that only appears when exactly one process is being observed. It says that AddSon was waiting on a pagefault for 1.9% of the time. (Other "waiting" procedures are ML (monitor lock), CV (condition variable) and preempted. Preempted means that the process was ready to run, but another process got to run instead.)

Interpreting Indentation.

The procedures as a whole are indented according to the following rule: A procedure is indented immediately under the deepest procedure that completely contains all of the calls to it. Usually this just means that a procedure is indented under the procedure that calls it, but it means something different if a procedure is called from more than one place. Indentation can be used to eliminate whole sub-trees of the print out. If you aren't interested in a procedure, you can ignore all of the procedures indented below it.

Occassionally, a procedure will be indented under a procedure that doesn't call it directly. (The names of such procedures are printed in italics for easy identification.) For instance, if A calls B which calls C, and A calls D which calls C, then C is indented under A rather than B or D. This is because only A completely contains all of the calls to C. One consequence of this scheme is that utility procedures that are called from many places will appear high in the hierarchy even though they are always low in the call stack. I consider this a feature, since if you are spending 50% of your time in the allocator, you want that fact to jump out at you, especially if it is buried in hundreds of little calls.

Procedures that are all at the same level of indentation are grouped into sets of disjoint procedures using exclamation points (!). All of the procedures that are connected by a line of exclamation points are disjoint. That is, none of the procedures in the set ever calls any of the other procedures in the set, either directly or indirectly. Sets of disjoint procedures are connected with periods (.) to help the reader keep track of indentation. Disjoint sets are just a hint. Procedures from different sets may be disjoint even though they are not indicated as such. However, procedures in the same disjoint set are guaranteed to be disjoint.

This indentation scheme may not be the best, but it does not hurt since it can always be ignored. The indentations do not add any information, they just summarize information already given by the calls from other procedures. However I will be glad to entertain suggestions for a better scheme.

Using the Spy Effectively.

1) First use `Watching: {CPU}'. This will tell you which processes and procedures are consuming CPU cycles. Look at `Total page faults' and `Total words allocated' in the header to see if there are a lot of page faults or allocations. If there are, you might want to run the Spy with `Watching: {pagefaults}' or `Watching: {wordsAllocated}'.

2) If the Spy can't find the symbols it needs when it prints its log, it will print the data with numbers instead of procedure names. You do not need to run your experiment again to get the procedure names -- just bringover the symbols and click 'DisplayData!' again.

3) If the processor hog is a single process, use 'Watching: {breakProcess}' to gather more information about that process. Select the name of a top level procedure in the Spy's typescript and then click 'SetStartBreak!' and 'SetStopBreak!'. Then run your experiment again. When the first break is hit, the Spy will start monitoring the process that was calling the procedure.

4) If you suspect that things are running slowly because a procedure is being called too often, set a user break (click 'SetUserBreak!') on the procedure and run the experiment again. This will show how many times the procedure gets called and who called it.

How the Spy Works.

We know from Heisenburg that no tool could measure an event without influencing it. However, if you have an idea of how the tool measures, you may be able to factor out its influence. Most programmers should be able to use the Spy without understanding how it gathers data. But those who are really particular about accurate data (or where the data is likely to be inaccurate) should read this section.

The Spy works by a) choosing a process, b) making up a "stack" of GFI's and PC's for that process, and c) writing the stack into a special log which interfaces to the disk. A process is chosen according to the parameters set by the programmer at the beginning of the session. If the programmer tells the Spy to watch pagefaults then processes that cause pagefaults will be logged. If the programmer tells the Spy to watch user-defined breaks, then processes that cause breaks will be logged.

If the programmer tells the Spy to watch the CPU, then the Spy will wake up at regular intervals and log the top process on the ready list. It finds the "top process" by checking priority levels from top to bottom looking for a process that is ready and able to run (A process would not be able to run if there were no state vector at its level). There are two ways to wake up at regular intervals: use a special IntervalTimer or wake up on the vertical retrace. The IntervalTimer the best way to wakeup but it isn't always available. So sometimes the Spy wakes up on the vertical retrace.

There are some bad side effects of waking up on the vertical retrace. Since timeouts are all synchronized to the vertical retrace, actions that are triggered by timeouts may appear to be running far more than they actually are. To counter this, the Spy tries to guess which processes run off of the vertical retrace. When it is first run, it scans the process table looking for processes that have very short timeouts. If it finds any, it saves them in a table. When it scans the ready list it skips over those processes if they are still at the pc locations that they were when they were waiting. This heuristic is only partically effective, but it is better than nothing.

Once the Spy has decided which process to record, it builds a stack by chasing up the frame links and writing down the GFI's and PC's. If the stack has more that 75 frames, it aborts its work and writes an error entry in the log (if this ever occurs, it is noted under "Statistics on the execution of the Spy"). Otherwise it writes the stack it constructed into the log.

The log consists of two pinned buffers backed up by a file named "Spy.log". The Spy will write in one buffer until it is full and then write in the other. As it writes into the second, a background process writes the first one to the disk and remaps it to the next section of the file. A background process is used since the Spy cannot page fault. The background process may appear in the Spy's output. If it does, it will have the name "SpyLogImpl.NewBuffer".

Occasionally, the disk will be so busy that the first buffer will not be ready for the Spy when it has finished filling the second. When this happens the Spy just hangs until the first buffer is ready. If the Spy is watching the CPU, this means that it may miss a few ticks of the vertical retrace. You can get some idea of how many ticks were missed by looking at the average number of counts per second that is given at the beginning of the output. If the average is less that 80, some information was lost. If you suspect that this is a problem, set SpyKernelImpl.monitor to TRUE and SpyLogReaderImpl.recordSpyTime to TRUE. From then on the Spy will record the distribution of delays between counts under the header "Statistics on the execution of the Spy".

Appendix: Spy Commands.

name Spy: {off, on}

Description:

Clicking the 'Spy' button toggles the current state of the Spy. If the Spy is off, then clicking the button will turn it on. If the Spy is on, then clicking the button will turn it off. The Spy is initialized using the parameters shown in the viewer when the button is clicked.

Examples:

Suppose you want to know why opening a viewer takes so long. Open the Spy and toggle the 'Watching' button until it says 'CPU' and then click the 'Spy' button so that it says '{on}'. Then open a viewer. Finally click 'DisplayData!' to see the results. ('DisplayData!' turns the Spy off automatically.)

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement

Keywords:

to be supplied by the Index Czar at the appropriate time in the future

name Break: {off, on}

Description:

Clicking the 'Break' button enables/disables the start and stop breaks set by the user.

Examples:

Suppose you only want to measure the performance of the garbage collector, and aren't interested in taking measurements when it isn't running. Then you could set start and stop breaks on the garbage collector so that the Spy would only take measurements while the garbage collector is running. Use 'SetStartBreak!' and 'SetStopBreak!' to set breaks on the top-level procedure. Clicking either of these will change the 'Spy' button into a 'Break' button. When you have all of the breaks set, then click 'Break' to enable them. (Having a 'Break' button allows the breaks to be enabled atomically).

Stop/Undo:

Click 'ClearBreaks' to change the 'Break' button back into a 'Spy' button.

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement

Keywords:

to be supplied by the Index Czar at the appropriate time in the future

name SetStartBreak!

Description:

Select a code location and then click 'SetStartBreak!' to set a break that will start the Spy. Clicking 'SetStartBreak!' will turn the 'Spy' button into a 'Break' button. There are two ways of selecting a code location: by selecting a position in the source, or by selecting a name of the form 'ModuleImpl.Procedure'. The latter will cause a break to be set at the beginning of the procedure given. The Spy will run as long as the number of start breaks exceeds the number of stop breaks.

A start break can be set from a command file if you know the source index of the break. Just call "← SpyClient.SetStartBreak[NIL, "FooImpl.mesa", <source index>] CR". (The Spy prints out the source index for you whenever you set a break.)

Examples:

You have run the Spy watching the CPU, and are interested in finding out more about a procedure that appears in the output. So you select the name of the procedure and click 'SetStartBreak!' and 'SetStopBreak!'. Then you set 'Watching' to 'break process' and run the experiment again. When the experiment is done you can click 'DisplayData!' to see the results.

Warnings:

The Spy won't be able to set a breakpoint if the module hasn't been started yet or their are no symbols for it.

Stop/Undo:

Click 'ClearBreaks!'.

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement, breakpoints

Keywords:

to be supplied by the Index Czar at the appropriate time in the future

name SetStopBreak!

Description:

Select a code location and then click 'SetStopBreak!' to set a break that will stop the Spy. Clicking 'SetStopBreak!' will turn the 'Spy' button into a 'Break' button. There are two ways of selecting a code location: by selecting a position in the source, or by selecting a name of the form 'ModuleImpl.Procedure'. The latter will cause a break to be set at the end of the procedure given. The Spy will run as long as the number of start breaks exceeds the number of stop breaks.

A stop break can be set from a command file if you know the source index of the break. Just call "← SpyClient.SetStopBreak[NIL, "FooImpl.mesa", <source index>] CR". (The Spy prints out the source index for you whenever you set a break.)

Warnings:

The Spy won't be able to set a breakpoint if the module hasn't been started yet or their are no symbols for it.

Stop/Undo:

Click 'ClearBreaks!'.

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement, breakpoints

Keywords:

to be supplied by the Index Czar at the appropriate time in the future

name ClearBreaks!

Description:

Clicking 'ClearBreaks!' clears the start and stop breaks that have already been set.

Examples:

Suppose you have set a number of start and stop breaks, and then find that you set a break in the wrong place. Clicking 'ClearBreaks!' will clear all of the breaks and allow you to start again. (There is no way to clear just one break.)

Stop/Undo:

Impossible.

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement, breakpoints

Keywords:

to be supplied by the Index Czar at the appropriate time in the future

name DisplayData!

Description:

Clicking 'DisplayData!' turns the Spy off and produces a typescript of the data gathered since the Spy was turned on. The data is displayed in a complicated tree structure which is explained in the section above called 'Interpreting the Results'.

If you hold down the control key while clicking 'DisplayData!', then the Spy will monitor itself while it is displaying its data. When the first typescript is finished, a second one will appear with the data for the Spy.

Warnings:

If the Spy cannot find the symbols for a module, then it will print PC ranges instead of procedure names. You can get a new typescript by bringing over the correct symbols and clicking 'DisplayData!' again (you do not have to run the experiment twice). The new typescript will use the latest symbols.

Stop/Undo:

You can stop the output by destroying the typescript viewer or by clicking control DEL. (It may take several tries before it responds to the control DEL.)

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement

Keywords:

to be supplied by the Index Czar at the appropriate time in the future

name cutoff: {n}

Description:

The cutoff parameter determines how much of the data will be displayed. The parameter measures the percentage of time that a procedure is in the call stack. If the procedure is in the call stack less time than the cutoff specifies, then it gets pruned from the output. You can display the data using several different cutoff parameters just by changing the cutoff and clicking 'DisplayData!' again.

Left-clicking 'cutoff' increments the parameter, right-clicking decrements it. Middle-clicking 'cutoff' sets it to the default value, 3.

Examples:

Suppose you don't want to wade through a lot of little consumers to find the big procecures. Then set the cutoff to a high value, say, 10. Or suppose you want to see every little detail, no matter how insignificant. Then set the cutoff parameter to 0. (A cutoff of 0 means display everything.)

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement

Keywords:

to be supplied by the Index Czar at the appropriate time in the future

name Watching: {CPU}

Description:

The 'Watching' button determines what the Spy is going to watch. Left-clicking it enumerates the possibilities in one direction, right-clicking enumerates it in the other. Middle-clicking resets it to the most common choice: 'CPU'.

When watching the CPU, the Spy wakes up at fixed intervals and samples the CPU to see which process is currently running. This gives a statistical picture of who is consuming CPU cycles.

Examples:

Suppose you want to know who is using the processor when the machine is supposed to be idle. Set 'Watching' to 'CPU' and then turn the Spy on. Wait a while, and then click 'DisplayData!'. The Spy will then break down its output by process, showing which processes were running and where they were spending their time.

Warnings:

The CPU doesn't have to be running to do disk or Ethernet IO. So the Spy may report that nothing is happening when in fact there is a lot going on with one of these. You can check for disk activity in the output by looking at 'Total page faults', or by comparing 'processor idle' with 'processor waiting on disk' under 'Scheduled Process-Priority Summary'. If you want to measure disk activity, use 'Watching: {pagefaults}' or 'Watching: {break process}'.

Stop/Undo:

Click 'Spy'/'Break' or 'DisplayData!' to stop.

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement

Keywords:

to be supplied by the Index Czar at the appropriate time in the future

name Watching: {process number: }

Description:

There is a text box after 'process number' that allows you to fill in the number of the process that you want to watch. When watching a particular process, the Spy wakes up at fixed intervals and samples the process to see what it is currently doing. There are five things that it could be doing: executing, waiting for permision to use the CPU (because a higher priority process preempted it), waiting on a page fault, waiting on a monitor lock, or waiting on a condition variable.

Examples:

Suppose you want to know what the VM LaundryProcess spends its time doing. Set 'Watching' to process 200B (the fixed process that the VM LaundryProcess uses) and then turn the Spy on. After a little while, click 'DisplayData!'. The Spy will tell you that the LaundryProcess spends ~65% of its time waiting for something to do, ~18% of its time waiting for the disk, and ~10% of its time waiting to use the CPU.

Warnings:

You can't always know which process number a process will use, since the process number doesn't get picked until the process gets forked. If the process is permanent, you can find out the process number by running the Spy with watching = CPU. If the process is transient, you can use watching = break process (see below) to have the process number determined dynamically.

Stop/Undo:

Click 'Spy'/'Break' or 'DisplayData!' to stop.

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement, processes, monitor locks, condition variables

Keywords:

to be supplied by the Index Czar at the appropriate time in the future

name Watching: {break process}

Description:

When watching a particular process, the Spy wakes up at fixed intervals and samples the process to see what it is currently doing. There are five things that it could be doing: executing, waiting for permision to use the CPU (because a higher priority process preempted it), waiting on a page fault, waiting on a monitor lock, or waiting on a condition variable. The first start break encountered will determine which processed will be watched.

Examples:

Suppose you have a procedure FooImpl.Foo that you think is spending too much time waiting on monitor locks. Select "FooImpl.Foo" and then click 'SetStartBreak!' and 'SetStopBreak!'. Then set 'Watching' to 'break process' and click 'Break' to enable the start and stop breaks. Then say '← FooImpl.Foo[ . . .]' to the interpreter. When it returns, click 'DisplayData!' to see the results.

Stop/Undo:

Click 'Spy'/'Break' or 'DisplayData!' to stop.

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement, processes

Keywords:

to be supplied by the Index Czar at the appropriate time in the future

name Watching: {pagefaults}

Description:

Everytime a pagefault is encountered, an entry is logged in the Spy. The output will distinguish between pagefaults that occur because code had to be swapped in (pagefault.code) and pagefaults that occur because data had to be swapped in (pagefault.data) Procedures that had pagefaults in the main body of the code will be printed in bold. (Non-bold procedures only had pagefaults in procedures further down in the stack.)

Stop/Undo:

Click 'Spy'/'Break' or 'DisplayData!' to stop.

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement, pagefaults

Keywords:

to be supplied by the Index Czar at the appropriate time in the future

name Watching: {allocations}

Description:

Whenever an allocation occurs, the Spy logs one entry. If you want to monitor the total words allocated rather than the allocations, use 'Watching: {wordsAllocated}'.

Warnings:

The Spy only measures words allocated from safe storage, it does not measure words allocated from an uncounted heap.

Stop/Undo:

Click 'Spy'/'Break' or 'DisplayData!' to stop.

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement, allocations

Keywords:

to be supplied by the Index Czar at the appropriate time in the future

name Watching: {wordsAllocated}

Description:

Whenever an allocation occurs, the Spy logs an entry for each word allocated. If you want to monitor the allocations rather than the total words allocated, use 'Watching: {allocations}'.

Warnings:

The Spy only measures words allocated from safe storage, it does not measure words allocated from an uncounted heap.

Stop/Undo:

Click 'Spy'/'Break' or 'DisplayData!' to stop.

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement

Keywords:

to be supplied by the Index Czar at the appropriate time in the future

name Watching: {user breaks}

Description:

A entry is logged each time a user-specified break is encountered. Users can set and clear breaks with `SetUserBreak!' and `ClearUserBreaks!'.

Examples:

Suppose you want to know who is sending packets out on the Ethernet. Then you should set a user break on the procedure that sends packets, and click 'Spy' to turn it on. (Setting a user break automatically sets the 'Watching' parameter to 'user breaks'). Let things run for a while and then click 'DisplayData!' to see what happened.

Stop/Undo:

Click 'Spy'/'Break' or 'DisplayData!' to stop.

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement

Keywords:

to be supplied by the Index Czar at the appropriate time in the future

name SetUserBreak!

Description:

Select a code location and then click 'SetUserBreak!' to set a user break in the Spy. (See 'Watching: {user breaks}' for more information.) There are two ways of selecting a code location: by selecting a position in the source, or by selecting a name of the form 'ModuleImpl.Procedure'. The latter will cause a break to be set at the beginning of the procedure given.

A user break can be set from a command file if you know the source index of the break. Just call "← SpyClient.SetUserBreak[NIL, "FooImpl.mesa", <source index>] CR". (The Spy prints out the source index for you whenever you set a break.)

Stop/Undo:

Click `ClearUserBreaks!'.

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement, breakpoints

Keywords:

to be supplied by the Index Czar at the appropriate time in the future

name SetTrace!

Description:

Select a code location and then click 'SetTrace!' to set a trace break in the Spy. Whenever the trace break is encountered, a trace will be logged in the Spy's log. The trace tells when and where the Spy was. There are two ways of selecting a code location: by selecting a position in the source, or by selecting a name of the form 'ModuleImpl.Procedure'. The latter will cause a break to be set at the beginning of the procedure given.

A trace break can be set from a command file if you know the source index of the break. Just call "← SpyClient.SetTrace[NIL, "FooImpl.mesa", <source index>] CR". (The Spy prints out the source index for you whenever you set a break.)

Examples:

Suppose you are tinkering with a multi-process system that does spurts of work at sporadic intervals. The normal Spy doesn't help, because there is too little going on to measure. So instead you set trace breaks at all of the interesting points. What you get as output is a log of the traces encountered, giving the break location, the absolute time of the trace, and the delta since the last trace. Examining this log carefully may give you clues as to what is going on.

Stop/Undo:

Click 'ClearUserBreaks!'.

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement, tracing, breakpoints

Keywords:

to be supplied by the Index Czar at the appropriate time in the future

name ClearUserBreaks!

Description:

Clicking 'ClearUserBreaks!' clears any user breaks or trace breaks that have been set.

Examples:

Suppose you have set a number of user breaks or trace breaks, and then find that you set a break in the wrong place. Clicking 'ClearUserBreaks!' will clear all of the breaks and allow you to start again. (There is no way to clear just one break.)

Stop/Undo:

Impossible.

Contact:

John Maxwell

Keyword Hints:

spying, performance, measurement, breakpoints

Keywords:

to be supplied by the Index Czar at the appropriate time in the future.