Symantec-NBU软件日常维护
-V1.0.4.doc
Netbackup- -日常维护
目录
第1章 维护检查说明 ....................................................... 4 第2章 NBU中添加环境变量的
............................................ 5 第3章 NBU服务自启动、停止脚本位置 ........................................ 8 第4章 NBU启动、关闭的方法 ............................................... 10 第5章 日常维护健康检查的内容和示例运行结果 ............................... 11 第6章 磁带出入库操作 .................................................... 21 6.2 从磁带库取出磁带 ....................................................... 21 6.3 向磁带库加入磁带 ....................................................... 22
第1章 维护检查说明
对于一个关键业务系统而言,数据资料是整个系统运作的核心。一旦由于系统硬件的功能失效,存储介质的老化损坏,人为的错误操作,以及各种难以预料的外界因素导致数据意外丢失或损坏,那么将会对于企业业务运做造成无法估量的影响。所以必须对数据存储系统的完整性和可靠性以及整体的运行状况给与高度重视,并根据整体的检查结果,提供一个完善的调整和优化建议,以避免在各种极端情况下造成的重大损失。
第2章 NBU中添加环境变量的方法 1、 Unix系统
修改/etc/profile文件,添加:
PATH=$PATH:/usr/openv/netbackup/bin PATH=$PATH:/usr/openv/netbackup/bin/admincmd PATH=$PATH:/usr/openv/netbackup/bin/goodies PATH=$PATH:/usr/openv/volmgr/bin
export PATH
MANPATH=$MANPATH,/usr/openv/man/
export MANPATH
2、 Linux系统
修改/etc/.bash_profile,添加
PATH=$PATH:/usr/openv/netbackup/bin PATH=$PATH:/usr/openv/netbackup/bin/admincmd PATH=$PATH:/usr/openv/netbackup/bin/goodies PATH=$PATH:/usr/openv/volmgr/bin
export PATH
MANPATH=$MANPATH,/usr/openv/man/
export MANPATH
3、 Windows系统
点击打开控制面板中系统选项,
点击选择高级中环境变量选项,
其中选择添加系统变量,按照NBU安装路径设置,如下图示例
选择修改系统变量中其中的路径内容如下:
;%NETBACKUP%/bin; %NETBACKUP%/bin/admincmd; %NETBACKUP%/bin/goodies
第3章 NBU服务自启动、停止脚本位置
1、 AIX
/etc/rc.netbackup.aix
/etc/rc.client.netbackup
2、 Alpha Tru64
/sbin/rc3.d/S77netbackup /sbin/rc0.d/K01netbackup /sbin/init.d/netbackup
3、 HP-UX
/sbin/rc2.d/S777netbackup /sbin/rc1.d/K001netbackup /sbin/init.d/netbackup /sbin/rc1.d/K001nbclient /sbin/rc2.d/S951nbclient /sbin/init.d/nbclient
4、 Linux Red Hat
/etc/rc.d/init.d/netbackup /etc/rc.d/rc0.d/K01netbackup /etc/rc.d/rc1.d/K01netbackup /etc/rc.d/rc2.d/S77netbackup /etc/rc.d/rc3.d/S77netbackup /etc/rc.d/rc5.d/S77netbackup /etc/rc.d/rc6.d/K01netbackup /etc/rc.d/init.d/nbclient /etc/rc.d/rc0.d/K01nbclient /etc/rc.d/rc1.d/K01nbclient /etc/rc.d/rc2.d/S95nbclient /etc/rc.d/rc3.d/S95nbclient /etc/rc.d/rc5.d/S95nbclient /etc/rc.d/rc6.d/K01nbclient
5、 Linux SuSE
/etc/init.d/netbackup
/etc/init.d/rc0.d/K01netbackup /etc/init.d/rc2.d/S77netbackup /etc/init.d/rc3.d/S77netbackup /etc/init.d/rc5.d/S77netbackup
/etc/init.d/rc6.d/K01netbackup
/etc/init.d/nbclient /etc/init.d/rc0.d/K01nbclient
/etc/init.d/rc2.d/S95nbclient
/etc/init.d/rc3.d/S95nbclient
/etc/init.d/rc5.d/S95nbclient
/etc/init.d/rc6.d/K01nbclient
第4章 NBU启动、关闭的方法 1、 Master主机的关闭NBU的方法:
/usr/openv/netbackup/bin/bp.kill_all; 或
/usr/openv/netbackup/bin/goodies/netbackup stop 2、 Master主机的启动NBU的方法:
/usr/openv/netbackup/bin/bp.start_all 或
/usr/openv/netbackup/bin/goodies/netbackup 3、 Media Server主机的关闭NBU的方法:
/usr/openv/netbackup/bin/bp.kill_all; 或
/usr/openv/netbackup/bin/goodies/netbackup stop 4、 Media Server主机的启动NBU的方法:
/usr/openv/netbackup/bin/bp.start_all 或
/usr/openv/netbackup/bin/goodies/netbackup 5、 使用bpps检查进程启动情况
/usr/openv/netbackup/bin/bpps -x
对于windows:
Bpup –v
Bpdown –v
Bpps
第5章 日常维护健康检查的内容和示
例运行结果
NBU的项目
、安装、配置等实施工艺必须严谨
,细小技术环节处理的得当,没有发现严重影响系统稳定运行的部分,检查内容和范围请参考如下内容: , 进程检查
bpps –x
, 备份情况检查
都是蓝色的小人表示成功,黄色表示部分成功,红色表示不成功,绿色表示正在进
行备份
, 备份索引完整性检查(3个月):
bpcatlist –online –since-months 3
此命令是记录所有备份操作在NBU的catalog的记录;
检查结果输出(举例说明):
Backupid Backup Date Files Size Sched
Policy Catarcid S C Files file
dms1_1195384006 Nov 18 11:06:46 2007 1 288k Default-Application-Backup dms1rmanfull 0 1 0 dms1rmanfull_1195384006_UBAK.f
dms1_1195383991 Nov 18 11:06:31 2007 1 11.0M Default-Application-Backup dms1rmanfull 0 1 0 dms1rmanfull_1195383991_UBAK.f
dms1_1195383802 Nov 18 11:03:22 2007 1 5.0G Default-Application-Backup dms1rmanfull 0 1 0 dms1rmanfull_1195383802_UBAK.f
dms1_1195383794 Nov 18 11:03:14 2007 1 8.0G Default-Application-Backup dms1rmanfull 0 1 0 FULL.f
, 主机全局变量配置检查:
bpconfig –U
bpconfig 命令显示 NetBackup 全局配置属性。这些属性影响所有策略和
客户机的操作;此项配置符合工程要求;
检查结果输出:
Admin Mail Address:
Job Retry Delay: 30 minutes
Max Simultaneous Jobs/Client: 99
Backup Tries: 2 time(s) in 12 hour(s)
Keep Error/Debug Logs: 28 days
Max drives this master: 0
Keep TrueImageRecovery Info: 1 days
Compress Image DB Files: (not enabled)
Media Mount Timeout: 0 minutes (unlimited)
Shared Media Mount Timeout:0 minutes (unlimited)
Display Reports: 24 hours ago
Preprocess Interval: 4 hours (default)
Maximum Backup Copies: 2
Image DB Cleanup Interval: 12 hours
Policy Update Interval: 10 minutes
, 备份作业检查
bpdbjobs -summary –U
bpdbjobs 与作业数据库进行交互,使用 bpdbjobs 输出整个作业数据库、输出数据库
的等信息。
, 备份异常事件检查
bperror -U -d /mm/dd/yyyy -e /mm/dd/yyyy
bperror 显示来自 NetBackup 错误目录库的信息。
STATUS CLIENT POLICY SCHED SERVER TIME COMPLETED
6 dms1 dms1arch Full nbu_master 11/06/2007 08:12:55
(the backup failed to back up the requested files)
6 dms2 dms2arch Full nbu_master 11/06/2007 08:19:09
(the backup failed to back up the requested files)
0 dms1 dms1test dms1test nbu_master 11/06/2007 08:24:40
0 dms2 dms2test dms2test nbu_master 11/06/2007 08:25:09
0 dms2 dms2arch Default-Ap nbu_master 11/06/2007 08:51:23
0 dms2 dms2arch Default-Ap nbu_master 11/06/2007
08:51:23
0 dms2 dms2arch Default-Ap nbu_master 11/06/2007 08:53:54
0 dms2 dms2arch Default-Ap nbu_master 11/06/2007 08:55:15
0 dms2 dms2arch Full nbu_master 11/06/2007 08:55:42
0 dms2 dms2archlog Default-Ap nbu_master 11/06/2007 09:02:11
0 dms2 dms2archlog Default-Ap nbu_master 11/06/2007 09:02:27
0 dms2 dms2archlog Default-Ap nbu_master 11/06/2007 09:03:37
0 dms2 dms2archlog dms2archlo nbu_master 11/06/2007 09:04:03
0 dms1 dms1arch Default-Ap nbu_master 11/06/2007 09:15:09
0 dms1 dms1arch Default-Ap nbu_master 11/06/2007 09:15:13
0 dms1 dms1arch Default-Ap nbu_master 11/06/2007 09:15:43
0 dms1 dms1arch Full nbu_master 11/06/2007 09:16:10
0 dms1 dms1rmanfull Default-Ap nbu_master 11/06/2007 09:22:21
0 dms1 dms1rmanfull Default-Ap nbu_master 11/06/2007 09:25:21
0 dms1 dms1rmanfull Default-Ap nbu_master 11/06/2007 09:25:35
0 dms1 dms1rmanfull Default-Ap nbu_master 11/06/2007 09:26:13
0 dms1 dms1rmanfull Default-Ap nbu_master 11/06/2007 09:26:47
0 dms1 dms1rmanfull Default-Ap nbu_master 11/06/2007 09:27:54
0 dms1 dms1rmanfull rmanfull nbu_master 11/06/2007 09:28:21
0 nbu_master NBU-Catalogbacku Full nbu_master 11/07/2007 00:01:23
0 nbu_master NBU-Catalogbacku Full nbu_master 11/07/2007 00:01:54
0 nbu_master NBU-Catalogbacku Full unknown 11/07/2007 00:01:58
0 nbu_master NBU-Catalogbacku Full nbu_master 11/08/2007
00:03:48
0 nbu_master NBU-Catalogbacku Full nbu_master 11/08/2007 00:04:40
0 nbu_master NBU-Catalogbacku Full unknown 11/08/2007 00:04:42
0 nbu_master NBU-Catalogbacku Full nbu_master 11/09/2007 00:01:36
0 nbu_master NBU-Catalogbacku Full nbu_master 11/09/2007 00:02:22
0 nbu_master NBU-Catalogbacku Full unknown 11/09/2007 00:02:22
, NBU配置检查
bpgetconfig –L
用于获取配置信息的助手程序;
Client/Master = Master
NetBackup Client Platform = RS6000, AIX5
NetBackup Client Protocol Level = 6.0.0.0.4.4
Product = NetBackup
Version Name = 6.0
Version Number = 600000
NetBackup Installation Path = /usr/openv/netbackup/bin
Client OS/Release = AIX 5.3
, 供紧急恢复时的备份镜像保存信息
bpimagelist -U
bpimagelist 使用指定的格式来
与从命令选项发送的属性相匹配的目录库映像
或可移动介质;
注:-policy -st来检查关键数据的全备和增量镜像所在介质号
, 检查磁带机清洗状况
tpclean -L
输出内容举例如下:
Drive Name Type Mount Time Frequency Last Cleaned Comment
********** **** ********** ********* **************** ******* HP.ULTRIUM3-SCSI.001 hcart3* 0.1 0 N/A HP.ULTRIUM3-SCSI.000 hcart3* 0.0 0 N/A IBM.ULTRIUM-TD2.003 hcart2* 0.3 0 N/A IBM.ULTRIUM-TD2.001 hcart2* 0.3 0 N/A IBM.ULTRIUM-TD2.000 hcart2* 0.3 0 N/A
IBM.ULTRIUM-TD2.002 hcart2* 0.2 0 N/A
, 检查磁带介质是否都正常
bpmedialist
bpmedialist 查询一个或多个 NetBackup 介质目录库,并生成 NetBackup 介质状态
报告;经检查,部分磁带介质有被frozen的现象,此操作并不影响正常备份,可以
在将来方便的时候(比如系统周期维护日)对其状态进行重置,如继续发现读写错
误,建议更换磁带;
, 检查是否还有足够可用磁带
available_media
部分磁带介质有被frozen的现象,此操作并不影响正常备份,可以在将来方便的时
候(比如系统周期维护日)对其状态进行重置,如继续发现读写错误,建议更换磁
带
media media robot robot robot side/ ret size status
ID type type # slot face level KBytes
----------------------------------------------------------------------------
CE1_FL_POOL_1 pool
000051 HCART3 ACS 0 - - 9 44562752 ACTIVE 000052 HCART3 ACS 0 - - - - AVAILABLE
CE1_FL_POOL_2 pool
000053 HCART3 ACS 0 - - 9 44562752 ACTIVE 000054 HCART3 ACS 0 - - - - AVAILABLE
CE1_L0_POOL_1 pool
000036 HCART3 ACS 0 - - 9 67703936 ACTIVE 000035 HCART3 ACS 0 - - - - AVAILABLE
CE1_L0_POOL_2 pool
000038 HCART3 ACS 0 - - 9 67703936 ACTIVE 000037 HCART3 ACS 0 - - - - AVAILABLE
CE1_L1_POOL_1 pool
000055 HCART3 ACS 0 - - 9 21280 ACTIVE 000056 HCART3 ACS 0 - - 9 9696640 ACTIVE 000039 HCART3 ACS 0 - - 9 0 FROZEN 000040 HCART3 ACS 0 - - 9 0 FROZEN
, SAN环境下检查所有Media Server健康状态
vmdareq –display
输出内容举例如下:
HP.ULTRIUM3-SCSI.000 - AVAILABLE
cec164 SCAN_HOST UP
cec163 UP
cec106 UP
cec104 UP
HP.ULTRIUM3-SCSI.001 - AVAILABLE
cec164 UP
cec163 UP
cec106 SCAN_HOST UP
cec104 UP
IBM.ULTRIUM-TD2.000 - AVAILABLE
cec164 UP
cec163 UP
cec106 SCAN_HOST UP
cec104 UP
IBM.ULTRIUM-TD2.001 - AVAILABLE
cec164 SCAN_HOST UP
cec163 UP
cec106 UP
cec104 UP
IBM.ULTRIUM-TD2.002 - AVAILABLE
cec164 SCAN_HOST UP
cec163 UP
cec106 UP
cec104 UP
IBM.ULTRIUM-TD2.003 - AVAILABLE
cec164 SCAN_HOST UP
cec163 UP
cec106 UP
cec104 UP
, 检查磁带库设备状态
tpconfig –l
输出内容举例如下:
Type Num Index Type DrNum Status Comment Name
Path
robot 0 - TLD - - - - cec106
drive - 2 hcart2 1 UP - IBM.ULTRIUM-TD2.003
/dev/rmt2.1
drive - 3 hcart2 2 UP - IBM.ULTRIUM-TD2.001
/dev/rmt3.1
drive - 4 hcart2 3 UP - IBM.ULTRIUM-TD2.000
/dev/rmt4.1
drive - 5 hcart2 4 UP - IBM.ULTRIUM-TD2.002
/dev/rmt5.1
robot 1 - TLD - - - - cec106
drive - 0 hcart3 1 UP - HP.ULTRIUM3-SCSI.001
/dev/rmt0.1
drive - 1 hcart3 2 UP - HP.ULTRIUM3-SCSI.000
/dev/rmt1.1
, 检查所有策略定义
bppllist -allpolicies –U
bppllist 列出 NetBackup 数据库中的策略;
输出结果举例说明:
Policy Name: CE1_FL_POLICY
Policy Type: Standard
Active: yes
Effective date: 12/28/2006 10:01:11
Client Compress: no
Follow NFS Mounts: no
Cross Mount Points: no
Collect TIR info: no
Block Incremental: no
Mult. Data Streams: no
Client Encrypt: no
Checkpoint: no
Policy Priority: 0
Max Jobs/Policy: Unlimited
Disaster Recovery: 0
Collect BMR info: no
Residence: nbu_master-hcart3-robot-acs-0
Volume Pool: CE1_FL_POOL_1
Keyword: (none specified)
HW/OS/Client: RS6000 AIX5 dms1
RS6000 AIX5 dms2
Include: (none defined)
Schedule: CE1_FL_SCH
Type: User Backup
Maximum MPX: 1
Synthetic: 0
PFI Recovery: 0
Retention Level: 9 (infinity) 9 (infinity)
Number Copies: 2
Fail on Error: 0 0
Residence: nbu_master-hcart3-robot-acs-0 nbu_master-hcart3-robot-acs-0
Volume Pool: CE1_FL_POOL_1 CE1_FL_POOL_2
Daily Windows:
Sunday 00:00:00 --> Saturday 23:59:59
------------------------------------------------------------
, 检查存储单元配置
bpstulist -U -show_available
bpstulist 命令显示 NetBackup 存储单元或存储单元组的属性。
输出结果,举例说明如下:
Label: nbu_master-hcart3-robot-acs-0 Storage Unit Type: Media Manager
Host Connection: nbu_master
Number of Drives: 3
On Demand Only: no
Density: hcart3 (20)
Robot Type/Number: ACS (1) / 0
Max Fragment Size: 1048576
Max MPX/drive: 1
Label: ets1-hcart3-robot-acs-0 Storage Unit Type: Media Manager
Host Connection: ets1
Number of Drives: 3
On Demand Only: no
Density: hcart3 (20)
Robot Type/Number: ACS (1) / 0
Max Fragment Size: 1048576
Max MPX/drive: 1
, 根据实际情况和时间确认是否要进行数据检查
bpverify
bpverify 通过读取备份卷,并将其内容与 NetBackup 目录库进行比较来验证一个或
多个备份的内容。该操作并不将卷数据与客户机磁盘的内容进行比较。它读取映像
中的每个块以验证卷是否为可读。
, 卷池信息一致性检查
vmpool –listall
列出卷池;经检查,符合要求。
输出结果,举例说明如下:
===================================================================
pool number: 0
pool name: None
description: the None pool pool host: ANYHOST
pool user: ANY
pool group: NONE
===================================================================
=============
pool number: 1
pool name: NetBackup
description: the NetBackup pool pool host: ANYHOST
pool user: 0 (root)
pool group: NONE
===================================================================
=============
pool number: 2
pool name: DataStore
description: the DataStore pool pool host: ANYHOST
pool user: 0 (root)
pool group: NONE
===================================================================
=============
vmcheckxxx
报告磁带库的介质内容,并选择将它的内容与卷配置进行比较。
, NBU系统全面性检查:
support
support命令检查全面的系统运行状况。
第6章 磁带出入库操作
6.1 从磁带库取出磁带
进入NBU管理界面,选择MEDIA MANAGEMENT。选取PTL0011-ZH-ARC-OFL或者PTL0011-XX-ARC-OFL介质池中,状态为FULL的磁带,右键点击:
在菜单中选择EJECT VOLUMES FROM ROBOT:
按EJECT按钮弹出磁带到磁带库的交换口,然后取出磁带。
6.2 向磁带库加入磁带
这个操作可以完成
1 向磁带库加入新磁带。添加的新磁带条形码标签一定不能与原有磁带重复。 2 向磁带库加入包含恢复数据的离线保存磁带。
如果是离线保存的磁带,可以利用恢复界面查找磁带编号。 进入NBU backup restore archive界面,选取需要恢复的数据内容:
按preview按钮:
将指定编号的磁带或者新添加的磁带放入磁带库的交换口。
在NBU界面中选取Device,按鼠标右键显示菜单:
选择Inventory Robot:
选取正确的磁带库,选择Update volume caonfiguration操作,选择Empty media access port
prior to update。按Start按钮。