by Ron Herardian
©1999 Global System Services Corporation (GSS)
Overview
Backing up Domino is often a challenge. Betting good
backups is not just a matter of slapping a DAT on each server machine.
I want to look at the problem of backups from a low-level technical
perspective then go into some system design considerations for backup
and restore of Domino servers.
The basic problem with backing up Domino databases
is backing up open files. This is an issue for every database system
and for every major backup system. There are 4 basic options for backing
up a database and they apply to most database systems as well as to
Domino. DBAs usually understand the problem better than Domino administrators.
Remember that cc:Mail is a database system much more than it is a data
communications system.
The simplest ways to handle open database files are
to either shut down and backup or to do a brute force read of the database
without shutting down. A superior approach is to use a backup system
that can handle open files. ArcServe and other products, for example,
supports backing up of various database systems. Most network backup
systems support backup of open files or provide agents designed for
compatibility with specific database systems.
There are 4 basic ways to backup open files:
1. LINEAR READ: The issue is that most copy
and backup utilities assume that a file that is already open should
not be copied or backed up because there is no way to guarantee the
integrity of data within a single file or across files within a set
of related files. However, there are backup programs and utilities that
ally ‘hot’ copies of live database files. This is the most crude backup
method and it brings with it two basic limitations.
It is technically possible to simply read open files
linearly without regard to ongoing changes (open the files in sharing
mode). The problem with this approach is that database files in the
backup may be inconsistent both internally and with each other in a
set of related files due to ongoing changes during the backup. In other
words the backup process can open and read a file which is being modified
but as the backup proceeds through the file later portions of the file
may be inconsistent with earlier portions due to live modifications.
Also, when a set of files contains references across files the file
set may be inconsistent across files when each file is read in turn
while all of the files are being modified.
Depending on the value and the nature of the data as
well as on the availability of tools to reconcile differences within
and between files the linear read backup method may be acceptable. However,
most database systems contain critical data and low-level database analysis
and repair tools are often unavailable or limited. Oracle, for example,
is not tolerant such inconsistencies resulting from hot backups and
may not be repairable upon restore. (the accepted method is to fall
back to a known good version of the database and reapply logged transactions
to produce a reconstructed version of the data up to the minute of a
failure).
2. OPEN FILE MANAGERS: The second way to backup
open files is through true open file management in the backup system.
This means that the backup system will wait to start the backup of a
given file until all write operations have completed. Once there are
no writes pending the backup will take a snapshot of the file system
data for that file and begin backing up. Systems such as Veritas provide
this type of advanced functionality.
In this case, as changes to the open file continue
during the backup, the open file manager intercepts write calls to the
file being backed up and stores any changed records or sectors of the
file in a temporary location. As the backup proceeds any changed records
are read from the temporary backup so that the backup system ultimately
produces a copy of the open file exactly as it was when no writes were
pending. This method ensures internal file consistency but not consistency
across files in a set of related files. Using this type of backup technology
it is still necessary to run repair and maintenance procedures before
a restored database back online. However, this method is adequate for
most Domino databases.
More sophisticated agents are available from some backup
vendors that are capable or ensuring integrity across files by applying
the same open file management logic, at one time, to a complete set
of files. In this case the backup software, in effect, takes a ‘snapshot’
of the filesystem then logs changes as it proceeds to back up the filesystem
fetching data from a temporary cache for any files that are modified
during the backup. On a busy server, however, this may require a brief
shutdown of server processes to bring the filesystem to a known state
before starting the backup (this functionality is available with some
versions of the HP/UX operating system).
3. CLUSTERING: A more sophisticated approach
is to use clustering, e.g., NetWare SFT 3 or Domino clustering, then
break the cluster and backup the secondary server (and when finished,
reestablish the cluster. While this is a technically excellent solution
that eliminates a variety of problems it requires a cluster of servers
which means at least one additional server machine with identical storage
resources.
A refinement to this solution for Domino is to ‘archive’
data to a cluster server maintaining a lesser amount of data on a primary
server (this can be done through Domino replication and server-based
agents). This makes it possible to bring the primary server back on
line quickly since it will have smaller databases to process.
Although clustering is being recommend here as a backup
and archiving solution the larger benefit is high availability (H/A)
and fail-over should the primary server fail.
4. SHUT DOWN AND BACKUP: The final, and simplest,
method is well known to many administrators. Domino administrators often
automate server shutdown and backup with scripts such as (for Windows
NT Server):
net stop lotus notes server
<backup command here>
net start lotus notes server
NETWORK BACKUP AND RESTORE: Large systems often
demand network backups using tools like NetBackup, Enterprise ArcServe,
and IBM’s ADSM. However, while these systems work well on high-speed
server backbones they do not work well over a WAN. More to the point,
restore of several gigabytes of data over typical WAN link is unacceptably
slow.
SUMMARY: Backing up Domino presents a
set of technology issues and backup methodologies similar to other database
systems. The crude solution is to shut down the server, back up, and
bring the server back online but this is unacceptable where 7x24 up
time is required. The most elegant solutions are sophisticated backup
software capable of open file management and an investment in additional
hardware for clustering.
Global System Services Corporation (GSS) is the leading
provider of consulting and professional services for large-scale and
distributed infrastructure systems such as email and messaging, directory
services, groupware, and wireless solutions. GSS customers include Fortune
500 companies, large services providers and telecom companies, government
agencies, major messaging product vendors, and innovative technology
startups.
GSS provides a complementary suite of services including
strategic technology consultation and competitive vendor and product
analysis, product and system architecture and design, system development
deployment, customization, and testing, technical support, email migration,
and other IT services. GSS has been directly responsible for some of
the largest global systems and solutions and counts as customers many
of the largest companies in the world.
From its offices in the Silicon Valley California, GSS delivers services and solutions
to customers worldwide through a network of mobile consultants and qualified
GSS Affiliates. With industry certified professionals on staff, GSS
is a Qualified
Lotus Business Partner, a Certified
Microsoft Solution Provider (MCSP), a Principal Partner in the Sun Partner Advantage program and a member of the Sun Software Partner Council, as well as a member of key industry organizations.