溫馨提示×

溫馨提示×

您好,登錄后才能下訂單哦!

密碼登錄×
登錄注冊×
其他方式登錄
點擊 登錄注冊 即表示同意《億速云用戶服務(wù)條款》

NetApp存儲無法開機(jī)問題處理-(初始化重裝系統(tǒng))

發(fā)布時間:2020-07-08 14:28:13 來源:網(wǎng)絡(luò) 閱讀:4582 作者:qingfenghaha 欄目:系統(tǒng)運(yùn)維

測試環(huán)境:    
原有存儲是兩個獨立控制器+磁盤柜,目前是一個控制器+磁盤柜。開機(jī)啟動時,先開啟擴(kuò)展柜,一分鐘后開啟控制器。發(fā)現(xiàn)系統(tǒng)起不來,經(jīng)過多次嘗試失敗后,決定通過維護(hù)模式進(jìn)入系統(tǒng)進(jìn)行查看。(類似于Windows7的維護(hù)模式一樣)

問題處理:    
開機(jī)boot啟動項,按Ctrl+C命令中斷正常啟動,進(jìn)入到boot menu菜單。    
Starting AUTOBOOT press Ctrl-C to abort...    
Loading X86_64/freebsd/p_w_picpath2/kernel:0x200000/10088648 0xb9f0c8/4301024 Entry at 0x80271e20    
Loading X86_64/freebsd/p_w_picpath2/platform.ko:0xfba000/1990365 0x11a0000/296352 0x11e85a0/273360    
Starting program at 0x80271e20    
NetApp Data ONTAP 8.3.1P2    
Copyright (C) 1992-2015 NetApp.    
All rights reserved.    
Checking boot device filesystem    
** /dev/da0s1    
** Phase 1 - Read and Compare FATs    
** Phase 2 - Check Cluster Chains    
** Phase 3 - Checking Directories    
** Phase 4 - Checking for Lost Files    
69 files, 1011584 free (31612 clusters)    
MARK FILE SYSTEM CLEAN? yes    
MARKING FILE SYSTEM CLEAN    
Retry #1 of 5: /sbin/fsck_msdosfs /dev/da0s1    
Retry #2 of 5: /sbin/fsck_msdosfs /dev/da0s1    
Repaired boot device filesystem    
*******************************    
*                             *    
* Press Ctrl-C for Boot Menu. *    
*                             *    
*******************************     
^CBoot Menu will be available.

WARNING:  The battery is unfit to retain data during a power  
          outage.  This is likely because the battery is    
          discharged but could be due to other temporary    
          conditions.    
          When the battery is ready, the boot process will    
          complete and services will be engaged.    
          To override this delay, press 'c' followed by 'Enter'    
c

CAUTION: Using this appliance without NVRAM  
         battery backup coupled with a power    
         failure condition CAN CAUSE DATA LOSS.    
Are you sure you want to continue (y or n)? y    
Proceeding without NVRAM battery backup.

Please choose one of the following:

(1) Normal Boot.        #正常啟動    
(2) Boot without /etc/rc.   #啟動存儲時,不執(zhí)行/etc/rc設(shè)置參數(shù)    
(3) Change password.      #如果忘記了超級用戶密碼,可以在此修改    
(4) Clean configuration and initialize all disks. #清除配置,初始化所有的磁盤    
(5) Maintenance mode boot.  #進(jìn)入維護(hù)模式,當(dāng)系統(tǒng)進(jìn)不去的時候可以嘗試用維護(hù)模式進(jìn)入    
(6) Update flash from backup config. #從備份配置中升級flash    
(7) Install new software first.    #安裝新的軟件    
(8) Reboot node.         #重啟節(jié)點    
Selection (1-8)? 5    
ixgbe: e1a: ** JUMBOMBUF DEBUG ** switching to large buffers(9k -> 3k): (sz = 5120)!    
ixgbe: e1b: ** JUMBOMBUF DEBUG ** switching to large buffers(9k -> 3k): (sz = 5120)!    
ixgbe: e2a: ** JUMBOMBUF DEBUG ** switching to large buffers(9k -> 3k): (sz = 5120)!    
ixgbe: e2b: ** JUMBOMBUF DEBUG ** switching to large buffers(9k -> 3k): (sz = 5120)!    
Ipspace "iwarp-ipspace" created    
WAFL CPLEDGER is enabled. Checklist = 0x7ff841ff    
add host 127.0.10.1: gateway 127.0.20.1    
5    
    You have selected the maintenance boot option:    
    the system has booted in maintenance mode allowing the    
    following operations to be performed:    
    ?                     acorn            
    acpadmin              aggr             
    cna_flash             disk             
    disk_latency          disk_list        
    disk_mung             disk_shelf       
    diskcopy              disktest         
    dumpblock             environment      
    fcadmin               fcstat           
    fctest                fru_led          
    ha-config             halt             
    help                  ifconfig         
    key_manager           led_off          
    led_on                nv8              
    raid_config           sasadmin         
    sasstat               scsi             
    sesdiag               sldiag           
    storage               stsb             
    sysconfig             systemshell      
    ucadmin               version          
    vmservices            vol              
    vol_db                vsa              
    xortest          
    Type "help <command>" for more details.

    In a High Availablity configuration, you MUST ensure that the    
    partner node is (and remains) down, or that takeover is manually    
    disabled on the partner node, because High Availability    
    software is not started or fully enabled in Maintenance mode.

    FAILURE TO DO SO CAN RESULT IN YOUR FILESYSTEMS BEING DESTROYED

    NOTE: It is okay to use 'show/status' sub-commands such as  
    'disk show or aggr status' in Maintenance mode while the partner is up    
Continue with boot? y    
y    
Ipspace "acp-ipspace" created    
original max threads=40, original heap size=41943040    
bip_nitro Virtual Size Limit=79455027 Bytes    
bip_nitro: user memory=724406272, actual max threads=41, actual heap size=43201331    
WARNING: Giving up waiting for mroot    
Tue Feb 14 07:59:49 UTC 2017    
*> ? #可以看到在維護(hù)模式下支持的命令參數(shù)    
?                   disktest            key_manager         stsb               
acorn               dumpblock           led_off             sysconfig          
acpadmin            environment         led_on              systemshell        
aggr                fcadmin             nv8                 ucadmin            
cna_flash           fcstat              raid_config         version            
disk                fctest              sasadmin            vmservices         
disk_latency        fru_led             sasstat             vol                
disk_list           ha-config           scsi                vol_db             
disk_mung           halt                sesdiag             vsa                
disk_shelf          help                sldiag              xortest            
diskcopy            ifconfig            storage            
*> disk show    
Local System ID: 1575136460

  DISK       OWNER                    POOL   SERIAL NUMBER         HOME                    DR HOME    
------------ -------------            -----  -------------         -------------           -------------           
0a.10.6      sz-3240-02(1575136687)    Pool0  LXW6RH4M              sz-3240-02(1575136687)                        
0b.10.7      sz-3240-02(1575136687)    Pool0  LXW63XYM              sz-3240-02(1575136687)                        
0a.10.2      sz-3240-01(1575136460)    Pool0  LXW72ZGM              sz-3240-01(1575136460)                        
0b.10.5      sz-3240-01(1575136460)    Pool0  LXW1W02M              sz-3240-01(1575136460)                        
0b.10.11     sz-3240-02(1575136687)    Pool0  LXW6364M              sz-3240-02(1575136687)                        
0a.10.8      sz-3240-02(1575136687)    Pool0  LXV3HE7L              sz-3240-02(1575136687)                        
0b.10.9      sz-3240-02(1575136687)    Pool0  LXW5YNSM              sz-3240-02(1575136687)                        
0a.10.4      sz-3240-01(1575136460)    Pool0  LXWT76HL              sz-3240-01(1575136460)                        
0b.10.3      sz-3240-01(1575136460)    Pool0  LXW6ELRM              sz-3240-01(1575136460)                        
0a.10.10     sz-3240-02(1575136687)    Pool0  LXW6DTTM              sz-3240-02(1575136687)                        
0b.10.1      sz-3240-01(1575136460)    Pool0  LXW6R84M              sz-3240-01(1575136460)                        
0a.10.0      sz-3240-01(1575136460)    Pool0  LXV3GV4L              sz-3240-01(1575136460)                        
由上圖可以看到,存儲12塊磁盤被平均分配到了兩個控制器上,由于目前只有一個控制器,所以很有可能系統(tǒng)在另外一個控制器上,而另外一個控制器缺少,導(dǎo)致開機(jī)無法啟動。    
現(xiàn)在手工把所有的磁盤都分配到當(dāng)前控制器上。      
*> disk reassign -s 1575136687 -d 1575136460 
#把1575136687控制器上的磁盤都重新分配給1575136460控制器    
#reassign {-s <old_sysid>} [-d <new_sysid>] [-p <partner_sysid>]- reassign disks from old filer    
Partner node must not be in Takeover mode during disk reassignment from maintenance mode.    
Serious problems could result!!    
Do not proceed with reassignment if the partner is in takeover mode. Abort reassignment (y/n)? n    
After the node becomes operational, you must perform a takeover and giveback of the HA partner node to ensure disk reassignment is successful.    
Do you want to continue (y/n)? y    
Disk ownership will be updated on all disks previously belonging to Filer with sysid 1575136687.    
Do you want to continue (y/n)? y    
Cannot do remote rescan. Use 'run local disk show' on the console of sz-3240-01 for it to scan the newly assigned disks    
Feb 14 08:04:52 [sz-3240-01:diskown.RescanMessageFailed:warning]: Could not send rescan message to sz-3240-01. Use the "disk show" command in nodeshell of sz-3240-01 for it to scan the newly inserted disks.    
*> disk show                               
Local System ID: 1575136460

  DISK       OWNER                    POOL   SERIAL NUMBER         HOME                    DR HOME    
------------ -------------            -----  -------------         -------------           -------------           
0a.10.6   sz-3240-01(1575136460)    Pool0  LXW6RH4M              sz-3240-01(1575136460)                        
0b.10.7   sz-3240-01(1575136460)    Pool0  LXW63XYM              sz-3240-01(1575136460)                        
0a.10.2   sz-3240-01(1575136460)    Pool0  LXW72ZGM              sz-3240-01(1575136460)                        
0b.10.5   sz-3240-01(1575136460)    Pool0  LXW1W02M              sz-3240-01(1575136460)                        
0b.10.11  sz-3240-01(1575136460)    Pool0  LXW6364M              sz-3240-01(1575136460)                        
0a.10.8   sz-3240-01(1575136460)    Pool0  LXV3HE7L              sz-3240-01(1575136460)                        
0b.10.9   sz-3240-01(1575136460)    Pool0  LXW5YNSM              sz-3240-01(1575136460)                        
0a.10.4   sz-3240-01(1575136460)    Pool0  LXWT76HL              sz-3240-01(1575136460)                        
0b.10.3   sz-3240-01(1575136460)    Pool0  LXW6ELRM              sz-3240-01(1575136460)                        
0a.10.10  sz-3240-01(1575136460)    Pool0  LXW6DTTM              sz-3240-01(1575136460)                        
0b.10.1   sz-3240-01(1575136460)    Pool0  LXW6R84M              sz-3240-01(1575136460)                        
0a.10.0   sz-3240-01(1575136460)    Pool0  LXV3GV4L              sz-3240-01(1575136460)                        
現(xiàn)在所有的磁盤都已經(jīng)劃分到現(xiàn)有控制器下了,接下來重新安裝存儲操作系統(tǒng):    
*> halt    
Waiting for PIDS:  624.    
Terminated    
Uptime: 8m46s    
System halting...    
Phoenix TrustedCore(tm) Server    
Copyright 1985-2006 Phoenix Technologies Ltd.    
All Rights Reserved    
BIOS version: 5.3.0    
Portions Copyright (c) 2007-2014 NetApp, Inc. All Rights Reserved

CPU = 1 Processors Detected, Cores per Processor = 4  
Intel(R) Xeon(R) CPU           L5410  @ 2.33GHz    
Testing RAM    
512MB RAM tested    
8192MB RAM installed    
6144 KB L2 Cache    
System BIOS shadowed    
USB 2.0: MICRON eUSB DISK    
BIOS is scanning PCI Option ROMs, this may take a few seconds...    
+++++++++++++++++++    
Boot Loader version 3.6    
Copyright (C) 2000-2003 Broadcom Corporation.    
Portions Copyright (C) 2002-2014 NetApp, Inc. All Rights Reserved.    
CPU Type: Intel(R) Xeon(R) CPU           L5410  @ 2.33GHz    
機(jī)器起來后,要手工啟動存儲的系統(tǒng):    
LOADER-A> boot_ontap    
Loading X86_64/freebsd/p_w_picpath2/kernel:0x200000/10088648 0xb9f0c8/4301024 Entry at 0x80271e20    
Loading X86_64/freebsd/p_w_picpath2/platform.ko:0xfba000/1990365 0x11a0000/296352 0x11e85a0/273360    
Starting program at 0x80271e20    
NetApp Data ONTAP 8.3.1P2    
Copyright (C) 1992-2015 NetApp.    
All rights reserved.    
*******************************    
*                             *    
* Press Ctrl-C for Boot Menu. *    
*                             *    
*******************************     
^CBoot Menu will be available.

WARNING:  The battery is unfit to retain data during a power  
          outage.  This is likely because the battery is    
          discharged but could be due to other temporary    
          conditions.    
          When the battery is ready, the boot process will    
          complete and services will be engaged.    
          To override this delay, press 'c' followed by 'Enter'    
c

CAUTION: Using this appliance without NVRAM  
         battery backup coupled with a power    
         failure condition CAN CAUSE DATA LOSS.    
Are you sure you want to continue (y or n)? y    
Proceeding without NVRAM battery backup.

Please choose one of the following:

(1) Normal Boot.  
(2) Boot without /etc/rc.    
(3) Change password.    
(4) Clean configuration and initialize all disks.    
(5) Maintenance mode boot.    
(6) Update flash from backup config.    
(7) Install new software first.    
(8) Reboot node.    
Selection (1-8)? 4    
ixgbe: e1a: ** JUMBOMBUF DEBUG ** switching to large buffers(9k -> 3k): (sz = 5120)!    
ixgbe: e1b: ** JUMBOMBUF DEBUG ** switching to large buffers(9k -> 3k): (sz = 5120)!    
ixgbe: e2a: ** JUMBOMBUF DEBUG ** switching to large buffers(9k -> 3k): (sz = 5120)!    
ixgbe: e2b: ** JUMBOMBUF DEBUG ** switching to large buffers(9k -> 3k): (sz = 5120)!    
Ipspace "iwarp-ipspace" created    
WAFL CPLEDGER is enabled. Checklist = 0x7ff841ff    
add host 127.0.10.1: gateway 127.0.20.1    
Zero disks, reset config and install a new file system?:    
Please answer yes or no    
Zero disks, reset config and install a new file system?: yes    
This will erase all the data on the disks, are you sure?: y    
Rebooting to finish wipeconfig request.    
Waiting for PIDS:  615.    
Skipped backing up /var file system to CF.    
Terminated    
.    
Uptime: 3m13s    
System rebooting...

Phoenix TrustedCore(tm) Server    
Copyright 1985-2006 Phoenix Technologies Ltd.    
All Rights Reserved    
BIOS version: 5.3.0    
Portions Copyright (c) 2007-2014 NetApp, Inc. All Rights Reserved

CPU = 1 Processors Detected, Cores per Processor = 4  
Intel(R) Xeon(R) CPU           L5410  @ 2.33GHz    
Testing RAM    
512MB RAM tested    
8192MB RAM installed    
6144 KB L2 Cache    
System BIOS shadowed    
USB 2.0: MICRON eUSB DISK    
BIOS is scanning PCI Option ROMs, this may take a few seconds...    
+++++++++++++++++++    
Boot Loader version 3.6    
Copyright (C) 2000-2003 Broadcom Corporation.    
Portions Copyright (C) 2002-2014 NetApp, Inc. All Rights Reserved.

CPU Type: Intel(R) Xeon(R) CPU           L5410  @ 2.33GHz

Starting AUTOBOOT press Ctrl-C to abort...  
Loading X86_64/freebsd/p_w_picpath2/kernel:0x200000/10088648 0xb9f0c8/4301024 Entry at 0x80271e20    
Loading X86_64/freebsd/p_w_picpath2/platform.ko:0xfba000/1990365 0x11a0000/296352 0x11e85a0/273360    
Starting program at 0x80271e20    
NetApp Data ONTAP 8.3.1P2    
Copyright (C) 1992-2015 NetApp.    
All rights reserved.    
*******************************    
*                             *    
* Press Ctrl-C for Boot Menu. *    
*                             *    
*******************************     
Wipe filer procedure requested.

WARNING:  The battery is unfit to retain data during a power  
          outage.  This is likely because the battery is    
          discharged but could be due to other temporary    
          conditions.    
          When the battery is ready, the boot process will    
          complete and services will be engaged.    
          To override this delay, press 'c' followed by 'Enter'    
c

CAUTION: Using this appliance without NVRAM  
         battery backup coupled with a power    
         failure condition CAN CAUSE DATA LOSS.    
Are you sure you want to continue (y or n)? y    
Proceeding without NVRAM battery backup.    
ixgbe: e1a: ** JUMBOMBUF DEBUG ** switching to large buffers(9k -> 3k): (sz = 5120)!    
ixgbe: e1b: ** JUMBOMBUF DEBUG ** switching to large buffers(9k -> 3k): (sz = 5120)!    
ixgbe: e2a: ** JUMBOMBUF DEBUG ** switching to large buffers(9k -> 3k): (sz = 5120)!    
ixgbe: e2b: ** JUMBOMBUF DEBUG ** switching to large buffers(9k -> 3k): (sz = 5120)!    
original max threads=40, original heap size=41943040    
bip_nitro Virtual Size Limit=80844390 Bytes    
bip_nitro: user memory=742682624, actual max threads=42, actual heap size=44459622    
Ipspace "iwarp-ipspace" created    
WAFL CPLEDGER is enabled. Checklist = 0x7ff841ff    
add host 127.0.10.1bootarg.bootmenu.selection is |4a|    
: gateway 127.0.20.1    
..............................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................    
接下來就漫長的等待了,初始化的時候是所有的硬盤同時做條帶化,與硬盤數(shù)目多少無關(guān),只與硬盤容量和轉(zhuǎn)數(shù)相關(guān)。    
重啟完成后,就會進(jìn)入到初始化配置界面,包括集群設(shè)置、IP地址設(shè)置等等(后面會介紹,盡情期待)

向AI問一下細(xì)節(jié)

免責(zé)聲明:本站發(fā)布的內(nèi)容(圖片、視頻和文字)以原創(chuàng)、轉(zhuǎn)載和分享為主,文章觀點不代表本網(wǎng)站立場,如果涉及侵權(quán)請聯(lián)系站長郵箱:is@yisu.com進(jìn)行舉報,并提供相關(guān)證據(jù),一經(jīng)查實,將立刻刪除涉嫌侵權(quán)內(nèi)容。

AI