Peripheral Processor Routines and Requests

PP Code vs. CPU Code

In the early days of CDC operating systems, much of the operating system was coded on PPs. Probably, some motivating factors were:

The desire to keep (the very limited) central memory free for user programs.
The CPU's inability to do I/O. (But many OS responsibilities have nothing to do with I/O.)
The fact that there were more PPs than CPUs.

Most CDC machines had only a single CPU, but some models like the 6500 had two. The I/O frame either contained 7 PP's, 10 PP's (initially on the 6400 at TNO), 14 PP's (we paid for later) or 20 PP's. Note that some PP's were pre-allocated or most of the time continuously occupied:

PP0: The PP monitor (MTR)
PP1: the console driver (DSD)
one PP per disk channel (1SP, 1SQ)
the network PP driver (1MR, 1ND)
JANUS card, plot, print (1IR) when active
all other PP programs were either loaded based upon user request or were regularly started based upon a wake-up timer or a monitored bit change

For the OS "kernel" itself (though that word was never used), both CPU and PP components were done entirely in assembly language. PPs did not have an RA register to bias addresses, and CPU OS code always ran with RA=0. As I recall, there was a limited amount of overlaying done with CPU OS code. This with exception of Fortran based overlay programs and additional system 'utilities' as TNO's Single User Editor (SUEDI/SUEDA). But if you like overlays, PPs are the place for you.

Overlays and PP Memory Layout

There were two basic types of PP programs: those that used the standard "PP Resident" library, and those that did not. PP Resident was a very tightly coded collection of useful utility routines used by nearly all PP programs. A few important PP programs were so space-critical that they were written without the benefit of this library. PP Resident was inexplicably named STL; some people guessed that might stand for SysTem Library.

All PP programs reserved locations 0 - 77B as direct cells. For PP programs that used STL, STL was located in locations 100B - 777B, with the program itself starting at 1000B. Other programs simply started at 100B.

Because memory was so limited in PPs, many PP programs were written using overlays. By convention, overlays were written to load at a multiple of 1000 octal. PP programs were given names with 3 characters, and by convention the first character was a digit representing where the program should load (the address divided by 1000B). Main overlays typically loaded at 1000B, so many programs had names that started with 1. For instance, 1AJ (Advance Job) was called when a command in a job was completed and the next control card needed to be read, parsed, and executed. Child overlays loaded at higher locations, so their names started with bigger digits, such as 4.

Another reason for using a number at the start of a PP name was security. Only PP programs starting with a alphabetical character could be called by users when the access level of the PP was set within the user authorization bounds.

Important PP programs

Notable PP programs included:

MTR: System monitor
This was the most important program in the system. This, although some of the monitor functionality as well as the job scheduler tasks were assigned to the system monitor also called CPUMTR. MTR was a very tightly coded program. MTR kept track of the time and noted CPU requests for intervention (a user PP call or CPU exchange request). CDC CPUs resembled early PCs in that they did not come with a time-of-day clock, but they did have access to a source of precisely-timed pulses. One hardware channel was reserved for read-only access to a 12-bit counter which constantly incremented. Each time through its loop, MTR would read this special channel and see whether the counter was smaller than last time. If so, the counter must have overflowed. MTR knew how often the counter overflowed and used this information to update a date-and-time data structure in memory. OS requests for the time-of-day were handled by giving them a copy of this data structure, just as PC BIOSes do today. When the OS was very busy, sometimes MTR would loop so slowly that it wouldn't notice the counter overflowing. Hence, the software clock would lose time.
DSD: Operator console
The "Dynamic System Display" was the routine that ran the operator's console. It was heavily overlayed. For a description of using DSD, see Console Commands.
1SP/1SQ: Disk I/O
The above PP routines were unusual in that they hogged a PP; once loaded, they stayed running in that PP forever. Most PP programs were transient. They were loaded into a PP, did their work in typically a fraction of a second, and then the PP was marked as available for loading another PP program. PP resident (STL) was responsible for the loading.

There was one important PP routine that was a hybrid: 1SP (later 1SQ), the Stack Processor. 1SP was responsible for the actual disk I/O. It processed a list of disk I/O requests that were organized in priority lists, the so called stacks. The stack processor tried to optimize head movements and sector selections to obtain the highest overall throughput and to minimize waiting times. Responsive disk I/O was very important to system performance, of course, so the system made sure that a copy of 1SP was always loaded into at least one PP, even if there were no outstanding disk I/O requests. In fact, since there were multiple disk controllers and disk units, the system could do true simultaneous disk I/O, and therefore tried to keep multiple copies of 1SP loaded to allow this to happen. The system dynamically adjusted the number of copies of 1SP/1SQ in PPs. If there was a lot of disk I/O on multiple units for a while, more copies of 1SP would be loaded. However, you wouldn't want to tie up too many PPs with idle copies of 1SP, so the number would be allowed to dwindle when the I/O load decreased.

Most PP routines were stored on disk, but the master copy of 1SP was kept in central memory as well as the code of some other PP's and DSD overlays. That code was required to reside in the expensive main memory, e.g. because the code was required to handle disk error situations or monitored tasks.

PP requests

CDC operating systems implemented an unusual system call mechanism. System requests - referred to as PP requests even if no PP program was involved - were made by placing a specially-formatted word at address 1 of a program's field length (i.e., RA+1). This location was scanned periodically by MTR (or CPUMTR). When the system noticed that a job's RA+1 was non-zero, it would zero the location and start servicing the request. By convention, applications would loop, waiting for RA+1 to zero both before and after issuing a request. It certainly was necessary for an application to ensure that RA+1 was zero before issuing a request, lest a previously-issued but as yet unserviced request be overwritten. But this could have been done by consistently checking either before or after each request.

In the early days, a significant amount of the system's CPU time (probably 5-10%) was spent by applications looping, waiting for the system to notice their RA+1 requests. An optional instruction, the Central Processor Exchange Jump, was available to allow an application to transfer control to the OS and have it notice the request. This XJ instruction was kind of like a software interrupt.

(with special thanks to Mark Riordan who provided the basis for this page)

[email protected]

25/02/1998