paper, section 6.

6. System operation

This section describes procedures used to operate a 4.4BSD UNIX system. Procedures described here are used periodically, to reboot the system, analyze error messages from devices, do disk backups, monitor system performance, recompile system software and control local changes.

6.1. Bootstrap and shutdown procedures

In a normal reboot, the system checks the disks and comes up multi-user without intervention at the console. Such a reboot can be stopped (after it prints the date) with a ^C (interrupt). This will leave the system in single-user mode, with only the console terminal active. (If the console has been marked ``insecure'' in /etc/ttys you must enter the root password to bring the machine to single-user mode.) It is also possible to allow the filesystem checks to complete and then to return to single-user mode by signaling fsck(8) with a QUIT signal (^\).

To bring the system up to a multi-user configuration from the single-user status, all you have to do is hit ^D on the console. The system will then execute /etc/rc, a multi-user restart script (and /etc/rc.local), and come up on the terminals listed as active in the file /etc/ttys. See init(8) and ttys(5) Note, however, that this does not cause a filesystem check to be done. Unless the system was taken down cleanly, you should run ``fsck -p'' or force a reboot with reboot(8) to have the disks checked.

To take the system down to a single user state you can use

# kill 1

or use the shutdown(8) command (which is much more polite, if there are other users logged in) when you are running multi-user. Either command will kill all processes and give you a shell on the console, as if you had just booted. Filesystems remain mounted after the system is taken single-user. If you wish to come up multi-user again, you should do this by:

# cd /
# /sbin/umount -a
# ^D

Each system shutdown, crash, processor halt and reboot is recorded in the system log with its cause.

6.2. Device errors and diagnostics

When serious errors occur on peripherals or in the system, the system prints a warning diagnostic on the console. These messages are collected by the system error logging process syslogd(8) and written into a system error log file /var/log/messages. Less serious errors are sent directly to syslogd, which may log them on the console. The error priorities that are logged and the locations to which they are logged are controlled by /etc/syslog.conf. See syslogd(8) for further details.

Error messages printed by the devices in the system are described with the drivers for the devices in section 4 of the programmer's manual. If errors occur suggesting hardware problems, you should contact your hardware support group or field service. It is a good idea to examine the error log file regularly (e.g. with the command tail -r /var/log/messages).

6.3. Filesystem checks, backups, and disaster recovery

Periodically (say every week or so in the absence of any problems) and always (usually automatically) after a crash, all the filesystems should be checked for consistency by fsck(1). The procedures of reboot(8) should be used to get the system to a state where a filesystem check can be done manually or automatically.

Dumping of the filesystems should be done regularly, since once the system is going it is easy to become complacent. Complete and incremental dumps are easily done with dump(8). You should arrange to do a towers-of-hanoi dump sequence; we tune ours so that almost all files are dumped on two tapes and kept for at least a week in most every case. We take full dumps every month (and keep these indefinitely). Operators can execute ``dump w'' at login that will tell them what needs to be dumped (based on the /etc/fstab information). Be sure to create a group operator in the file /etc/group so that dump can notify logged-in operators when it needs help.

More precisely, we have three sets of dump tapes: 10 daily tapes, 5 weekly sets of 2 tapes, and fresh sets of three tapes monthly. We do daily dumps circularly on the daily tapes with sequence `3 2 5 4 7 6 9 8 9 9 9 ...'. Each weekly is a level 1 and the daily dump sequence level restarts after each weekly dump. Full dumps are level 0 and the daily sequence restarts after each full dump also.

Thus a typical dump sequence would be:

    tape name   level number       date         opr      size
    ----------------------------------------------------------
      FULL           0         Nov 24, 1992   operator   137K
        D1           3         Nov 28, 1992   operator   29K
        D2           2         Nov 29, 1992   operator   34K
        D3           5         Nov 30, 1992   operator   19K
        D4           4          Dec 1, 1992   operator   22K
        W1           1          Dec 2, 1992   operator   40K
        D5           3          Dec 4, 1992   operator   15K
        D6           2          Dec 5, 1992   operator   25K
        D7           5          Dec 6, 1992   operator   15K
        D8           4          Dec 7, 1992   operator   19K
        W2           1          Dec 9, 1992   operator   118K
        D9           3         Dec 11, 1992   operator   15K
       D10           2         Dec 12, 1992   operator   26K
        D1           5         Dec 15, 1992   operator   14K
        W3           1         Dec 17, 1992   operator   71K
        D2           3         Dec 18, 1992   operator   13K
      FULL           0         Dec 22, 1992   operator   135K

We do weekly dumps often enough that daily dumps always fit on one tape.

Dumping of files by name is best done by tar(1) but the amount of data that can be moved in this way is limited to a single tape. Finally if there are enough drives entire disks can be copied with dd(1) using the raw special files and an appropriate blocking factor; the number of sectors per track is usually a good value to use, consult /etc/disktab.

It is desirable that full dumps of the root filesystem be made regularly. This is especially true when only one disk is available. Then, if the root filesystem is damaged by a hardware or software failure, you can rebuild a workable disk doing a restore in the same way that the initial root filesystem was created.

Exhaustion of user-file space is certain to occur now and then; disk quotas may be imposed, or if you prefer a less fascist approach, try using the programs du(1), df(1), and quot(8), combined with threatening messages of the day, and personal letters.

6.4. Moving filesystem data

If you have the resources, the best way to move a filesystem is to dump it to a spare disk partition, or magtape, using dump(8), use newfs(8) to create the new filesystem, and restore the filesystem using restore(8). Filesystems may also be moved by piping the output of dump to restore. The restore program uses an ``in-place'' algorithm that allows filesystem dumps to be restored without concern for the original size of the filesystem. Further, portions of a filesystem may be selectively restored using a method similar to the tape archive program.

If you have to merge a filesystem into another, existing one, the best bet is to use tar(1). If you must shrink a filesystem, the best bet is to dump the original and restore it onto the new filesystem. If you are playing with the root filesystem and only have one drive, the procedure is more complicated. If the only drive is a Winchester disk, this procedure may not be used without overwriting the existing root or another partition. What you do is the following:

1.: GET A SECOND PACK, OR USE ANOTHER DISK DRIVE!!!!
2.: Dump the root filesystem to tape using dump(8).
3.: Bring the system down.
4.: Mount the new pack in the correct disk drive, if using removable media.
5.: Load the distribution tape and install the new root filesystem as you did when first installing the system. Boot normally using the newly created disk filesystem.

Note that if you change the disk partition tables or add new disk drivers they should also be added to the standalone system in /sys/<architecture>/stand, and the default disk partition tables in /etc/disktab should be modified.

6.5. Monitoring system performance

The systat program provided with the system is designed to be an aid to monitoring systemwide activity. The default ``pigs'' mode shows a dynamic ``ps''. By running in the ``vmstat'' mode when the system is active you can judge the system activity in several dimensions: job distribution, virtual memory load, paging and swapping activity, device interrupts, and disk and cpu utilization. Ideally, there should be few blocked (b) jobs, there should be little paging or swapping activity, there should be available bandwidth on the disk devices (most single arms peak out at 20-30 tps in practice), and the user cpu utilization (us) should be high (above 50%).

If the system is busy, then the count of active jobs may be large, and several of these jobs may often be blocked (b). If the virtual memory is active, then the paging demon will be running (sr will be non-zero). It is healthy for the paging demon to free pages when the virtual memory gets active; it is triggered by the amount of free memory dropping below a threshold and increases its pace as free memory goes to zero.

If you run in the ``vmstat'' mode when the system is busy, you can find imbalances by noting abnormal job distributions. If many processes are blocked (b), then the disk subsystem is overloaded or imbalanced. If you have several non-dma devices or open teletype lines that are ``ringing'', or user programs that are doing high-speed non-buffered input/output, then the system time may go high (60-70% or higher). It is often possible to pin down the cause of high system time by looking to see if there is excessive context switching (cs), interrupt activity (in) and per-device interrupt counts, or system call activity (sy). Cumulatively on one of our large machines we average about 60-200 context switches and interrupts per second and about 50-500 system calls per second.

If the system is heavily loaded, or if you have little memory for your load (2M is little in most any case), then the system may be forced to swap. This is likely to be accompanied by a noticeable reduction in system performance and pregnant pauses when interactive jobs such as editors swap out. If you expect to be in a memory-poor environment for an extended period you might consider administratively limiting system load.

6.6. Recompiling and reinstalling system software

It is easy to regenerate either the entire system or a single utility, and it is a good idea to try rebuilding pieces of the system to build confidence in the procedures.

In general, there are six well-known targets supported by all the makefiles on the system:

all: This entry is the default target, the same as if no target is specified. This target builds the kernel, binary or library, as well as its associated manual pages. This target does not build the dependency files. Some of the utilities require that a make depend be done before a make all can succeed.
depend: Build the include file dependency file, ``.depend'', which is read by make. See mkdep(1) for further details.
install: Install the kernel, binary or library, as well as its associated manual pages. See install(1) for further details.
clean: Remove the kernel, binary or library, as well as any object files created when building it.
cleandir: The same as clean, except that the dependency files and formatted manual pages are removed as well.
obj: Build a shadow directory structure in the area referenced by /usr/obj and create a symbolic link in the current source directory to referenced it, named ``obj''. Once this shadow structure has been created, all the files created by make will live in the shadow structure, and /usr/src may be mounted read-only by multiple machines. Doing a make obj in /usr/src will build the shadow directory structure for everything on the system except for the contributed, old, and kernel software.

The system consists of three major parts: the kernel itself, found in /usr/src/sys, the libraries , found in /usr/src/lib, and the user programs (the rest of /usr/src).

Deprecated software, found in /usr/src/old, often has old style makefiles; some of it does not compile under 4.4BSD at all.

Contributed software, found in /usr/src/contrib, usually does not support the ``cleandir'', ``depend'', or ``obj'' targets.

The kernel does not support the ``obj'' shadow structure. All kernels are compiled in subdirectories of /usr/src/sys/compile which is usually abbreviated as /sys/compile. If you want to mount your source tree read-only, /usr/src/sys/compile will have to be on a separate filesystem from /usr/src. Separation from /usr/src can be done by making /usr/src/sys/compile a symbolic link that references /usr/obj/sys/compile. If it is a symbolic link, the S variable in the kernel Makefile must be changed from ../.. to the absolute pathname needed to locate the kernel sources, usually /usr/src/sys. The symbolic link created by config(8) for machine must also be manually changed to an absolute pathname. Finally, the /usr/src/sys/libkern/obj directory must be located in /usr/obj/sys/libkern.

Each of the standard utilities and libraries may be built and installed by changing directories into the correct location and doing:

# make
# make install

Note, if system include files have changed between compiles, make will not do the correct dependency checks if the dependency files have not been built using the ``depend'' target.

The entire library and utility suite for the system may be recompiled from scratch by changing directory to /usr/src and doing:

# make build

This target installs the system include files, cleans the source tree, builds and installs the libraries, and builds and installs the system utilities.

To recompile a specific program, first determine where the binary resides with the whereis(1) command, then change to the corresponding source directory and build it with the Makefile in the directory. For instance, to recompile ``passwd'', all one has to do is:

# whereis passwd
/usr/bin/passwd
# cd /usr/src/usr.bin/passwd
# make
# make install

this will compile and install the passwd utility.

If you wish to recompile and install all programs into a particular target area you can override the default path prefix by doing:

# make
# make DESTDIR=pathname install

Similarly, the mode, owner, group, and other characteristics of the installed object can be modified by changing other default make variables. See make(1), /usr/src/share/mk/bsd.README, and the ``.mk'' scripts in the /usr/share/mk directory for more information.

If you modify the C library or system include files, to change a system call for example, and want to rebuild and install everything, you have to be a little careful. You must ensure that the include files are installed before anything is compiled, and that the libraries are installed before the remainder of the source, otherwise the loaded images will not contain the new routine from the library. If include files have been modified, the following commands should be done first:

# cd /usr/src/include
# make install

Then, if, for example, C library files have been modified, the following commands should be executed:

# cd /usr/src/lib/libc
# make depend
# make
# make install
# cd /usr/src
# make depend
# make
# make install

Alternatively, the make build command described above will accomplish the same tasks. This takes several hours on a reasonably configured machine.

6.7. Making local modifications

The source for locally written commands is normally stored in /usr/src/local, and their binaries are kept in /usr/local/bin. This isolation of local binaries allows /usr/bin, and /bin to correspond to the distribution tape (and to the manuals that people can buy). People using local commands should be made aware that they are not in the base manual. Manual pages for local commands should be installed in /usr/local/man/cat[1-8]. The man(1) command automatically finds manual pages placed in /usr/local/man/cat[1-8] to encourage this practice (see man.conf(5)).

6.8. Accounting

UNIX optionally records two kinds of accounting information: connect time accounting and process resource accounting. The connect time accounting information is stored in the file /var/log/wtmp, which is summarized by the program ac(8). The process time accounting information is stored in the file /var/account/acct after it is enabled by accton(8), and is analyzed and summarized by the program sa(8).

If you need to recharge for computing time, you can develop procedures based on the information provided by these commands. A convenient way to do this is to give commands to the clock daemon /usr/sbin/cron to be executed every day at a specified time. This is done by adding lines to /etc/crontab.local; see cron(8) for details.

6.9. Resource control

Resource control in the current version of UNIX is more elaborate than in most UNIX systems. The disk quota facilities developed at the University of Melbourne have been incorporated in the system and allow control over the number of files and amount of disk space each user and/or group may use on each filesystem. In addition, the resources consumed by any single process can be limited by the mechanisms of setrlimit(2). As distributed, the latter mechanism is voluntary, though sites may choose to modify the login mechanism to impose limits not covered with disk quotas.

To use the disk quota facilities, the system must be configured with ``options QUOTA''. Filesystems may then be placed under the quota mechanism by creating a null file quota.user and/or quota.group at the root of the filesystem, running quotacheck(8), and modifying /etc/fstab to show that the filesystem is to run with disk quotas (options userquota and/or groupquota). The quotaon(8) program may then be run to enable quotas.

Individual quotas are applied by using the quota editor edquota(8). Users may view their quotas (but not those of other users) with the quota(1) program. The repquota(8) program may be used to summarize the quotas and current space usage on a particular filesystem or filesystems.

Quotas are enforced with soft and hard limits. When a user and/or group first reaches a soft limit on a resource, a message is generated on their terminal. If the user and/or group fails to lower the resource usage below the soft limit for longer than the time limit established for that filesystem (default seven days) the system then treats the soft limit as a hard limit and disallows any allocations until enough space is reclaimed to bring the user and/or group back below the soft limit. Hard limits are enforced strictly resulting in errors when a user and/or group tries to create or write a file. Each time a hard limit is exceeded the system will generate a message on the user's terminal.

Consult the auxiliary document, ``Disc Quotas in a UNIX Environment'' (SMM:4) and the appropriate manual entries for more information.

6.10. Network troubleshooting

If you have anything more than a trivial network configuration, from time to time you are bound to run into problems. Before blaming the software, first check your network connections. On networks such as the Ethernet a loose cable tap or misplaced power cable can result in severely deteriorated service. The netstat(1) program may be of aid in tracking down hardware malfunctions. In particular, look at the -i and -s options in the manual page.

Should you believe a communication protocol problem exists, consult the protocol specifications and attempt to isolate the problem in a packet trace. The SO_DEBUG option may be supplied before establishing a connection on a socket, in which case the system will trace all traffic and internal actions (such as timers expiring) in a circular trace buffer. This buffer may then be printed out with the trpt(8) program. Most of the servers distributed with the system accept a -d option forcing all sockets to be created with debugging turned on. Consult the appropriate manual pages for more information.

6.11. Files that need periodic attention

We conclude the discussion of system operations by listing the files that require periodic attention or are system specific:

/etc/fstab           how disk partitions are used
/etc/disktab         default disk partition sizes/labels
/etc/printcap        printer database
/etc/gettytab        terminal type definitions
/etc/remote          names and phone numbers of remote machines for tip(1)
/etc/group           group memberships
/etc/motd            message of the day
/etc/master.passwd   password file; each account has a line
/etc/rc.local        local system restart script; runs reboot; starts daemons
/etc/inetd.conf      local internet servers
/etc/hosts           local host name database
/etc/networks        network name database
/etc/services        network services database
/etc/hosts.equiv     hosts under same administrative control
/etc/syslog.conf     error log configuration for syslogd(8)
/etc/ttys            enables/disables ports
/etc/crontab         commands that are run periodically
/etc/crontab.local   local commands that are run periodically
/etc/aliases         mail forwarding and distribution groups
/var/account/acct    raw process account data
/var/log/messages    system error log
/var/log/wtmp        login session accounting

1. Introduction
1.1. Distribution format
1.2. UNIX device naming
1.3. UNIX devices: block and raw
2. Bootstrap procedure
2.1. Bootstrapping from the tape
2.2. Booting the HP300
2.2.1. Supported hardware
2.2.2. Standalone device file naming
2.2.3. The procedure
2.2.3.1. Step 1: selecting and formatting a disk
2.2.3.2. Step 2: copying the root filesystem from tape to disk
2.2.3.3. Step 3: booting the root filesystem
2.2.3.4. Step 4: (optional) restoring the root filesystem
2.2.3.5. Step 5: placing labels on the disks
2.3. Booting the SPARC
2.3.1. Supported hardware
2.3.2. Limitations
2.3.3. The procedure
2.4. Booting the DECstation
2.4.1. Supported hardware
2.4.2. The procedure
2.4.2.1. Procedure A: copy root filesystem to disk
2.4.2.2. Procedure B: bootstrap from tape
2.4.2.3. Procedure C: bootstrap over the network
2.4.3. Label disk and create the root filesystem
2.5. Disk configuration
2.5.1. Disk naming and divisions
2.5.2. Layout considerations
2.5.3. Filesystem parameters
2.5.4. Implementing a layout
2.6. Installing the rest of the system
2.7. Additional conversion information
3. Upgrading a 4.3BSD system
3.1. Installation overview
3.2. Files to save
3.3. Installing 4.4BSD
3.4. Merging your files from 4.3BSD into 4.4BSD
3.4.1. Changes in the /etc directory
3.4.2. Shadow password files
3.4.3. The /var filesystem
3.5. Bug fixes and changes between 4.3BSD and 4.4BSD
3.5.1. Changes to the kernel
3.5.2. Security
3.5.2.1. Virtual memory changes
3.5.2.2. Networking additions and changes
3.5.2.3. Additions and changes to filesystems
3.5.2.4. POSIX terminal driver changes
3.5.2.5. Native operating system compatibility
3.5.3. Changes to the utilities
3.5.3.1. Make and Makefiles
3.5.3.2. Kerberos
3.5.3.3. Timezone support
3.5.3.4. Additions and changes to the libraries
3.5.3.5. Additions and changes to other utilities
3.6. Hints on converting from 4.3BSD to 4.4BSD
4. System setup
4.1. Kernel configuration
4.1.1. Kernel organization
4.1.2. Devices and device drivers
4.1.3. Building new system images
4.2. Configuring terminals
4.3. Adding users
4.4. Site tailoring
4.5. Setting up the line printer system
4.6. Setting up the mail system
4.6.1. Setting up a UUCP connection
5. Network setup
5.1. System configuration
5.2. Local subnets
5.3. Internet broadcast addresses
5.4. Routing
5.5. Use of 4.4BSD machines as gateways
5.6. Network databases
5.6.1. Network servers
5.6.2. Internet daemon
5.6.3. The /etc/hosts.equiv file
5.6.4. The /etc/ftpusers file
6. System operation
6.1. Bootstrap and shutdown procedures
6.2. Device errors and diagnostics
6.3. Filesystem checks, backups, and disaster recovery
6.4. Moving filesystem data
6.5. Monitoring system performance
6.6. Recompiling and reinstalling system software
6.7. Making local modifications
6.8. Accounting
6.9. Resource control
6.10. Network troubleshooting
6.11. Files that need periodic attention