Version 8.1.3
 —  Operations  —

Restart/Recovery Processing

Restart/recovery occurs if a cluster nucleus fails. Restart/recovery uses the Work data sets of all nuclei to recover the database. The Work data sets are dynamically allocated from the data set names recorded in the PPT. Adabas Cluster Services 8.1 supports offline and online recovery.

This document covers the following topics:


Offline Recovery (Session Autorestart)

Offline recovery occurs if all active cluster nuclei in an Adabas sysplex cluster fail. Offline recovery relies only on information from the physical database and the Work data sets of each cluster nucleus. All information in the coupling facility is lost.

The first cluster nucleus to restart repairs any physical inconsistencies in the database and backs out all incomplete commands and transactions. The restarted nucleus obtains recovery information from blocks in the common database and from the Work data sets of all the failed nuclei.

The restarting nucleus retrieves the Work data set names from the PPT block for each terminated nucleus and opens these data sets using dynamic allocation. From that point, normal recovery processing occurs:

While reading through the Work data sets, the restarting nucleus on the fly merges the protection records by their timestamps into chronological sequence.

Top of page

Online Recovery

When one or more cluster nuclei have failed while one or more other nuclei in the same cluster remain active, online recovery processing is performed by collaboration of all surviving nuclei.

All surviving cluster nuclei quiesce their operations and reinitialize their working storage. Command processing is quiesced and the internal status variables, tables, and pools are repaired.

The peer nuclei compete for the recovery lock: when one of the nuclei obtains it, it invokes offline recovery processing. It repairs any physical inconsistencies in the database and backs out all incomplete command and transactions. Open transactions executed by the surviving nuclei are backed out as well. All information in the lock and cache structures is discarded.

Once this recovery processing has completed, normal processing resumes.

Users are affected by online recovery as follows:

Top of page

Automatic Restart Management (ARM)

Automatic restart management (ARM) is a z/OS facility that can be used to automatically restart a nucleus when it ABENDs. Automatic restart is suppressed when the ABEND is intentional; for example, when it results from a parameter error.

ARM can be used for Adabas nuclei in both cluster and noncluster environments.

The ADARUN parameter ARMNAME is used to identify the element in the ARM 'policy' that is to be activated. Each element specifies when, where, and how often an automatic restart is to be attempted.

If an ARM policy has not been defined, the ARMNAME parameter has no effect.

Top of page

Archive Recovery

Archive recovery occurs if the container data sets of the database are damaged or restart/recovery is not effective.

Archive recovery

The protection logs to be regenerated are the output of the ADARES PLCOPY protection log copy and merge process that occurs in sysplex cluster environments. The restore/regenerate process is the same in both cluster and noncluster environments.

Top of page