rebuilding failed after repplacing broken disk on raid1

Issues related to hardware problems
Post Reply
Moon1216
Posts: 1
Joined: 2014/03/10 07:55:31

rebuilding failed after repplacing broken disk on raid1

Post by Moon1216 » 2014/03/10 08:15:48

Hello!
I would like to get some advice/help from here for my problems related raid1 disk rebuilding.

Problme 1)
I found there was a failed disk on the server A and then replace broken disk. Since disks were configured by raid1, i though it'll be rebuidling automatically.
Once rebuilding was start and then it seems okey until 20%, but, after rebuiding process is rearched 20% as below,

# mpt-status -n
ioc:0 vol_id:0 type:IM raidlevel:RAID-1 num_disks:2 size(GB):135 state: DEGRADED flags: ENABLED RESYNC_IN_PROGRESS
ioc:0 phys_id:0 scsi_id:1 vendor:IBM-ESXS product_id:ST9146802SS revision:B526 size(GB):136 state: ONLINE flags: NONE sync_state: 20 ASC/ASCQ:0x11/0x00 SMART ASC/ASCQ:0xff/0xff
ioc:0 phys_id:1 scsi_id:4 vendor:IBM-ESXS product_id:ST9600205SS revision:B552 size(GB):558 state: ONLINE flags: OUT_OF_SYNC sync_state: 20 ASC/ASCQ:0x11/0x00 SMART ASC/ASCQ:0xff/0xff
scsi_id:0 20%
scsi_id:1 20%

it come back to 0% and then rebuilding again :(

# mpt-status -n
ioc:0 vol_id:0 type:IM raidlevel:RAID-1 num_disks:2 size(GB):135 state: DEGRADED flags: ENABLED RESYNC_IN_PROGRESS
ioc:0 phys_id:0 scsi_id:1 vendor:IBM-ESXS product_id:ST9146802SS revision:B526 size(GB):136 state: ONLINE flags: NONE sync_state: 0 ASC/ASCQ:0x11/0x00 SMART ASC/ASCQ:0xff/0xff
ioc:0 phys_id:1 scsi_id:4 vendor:IBM-ESXS product_id:ST9600205SS revision:B552 size(GB):558 state: ONLINE flags: OUT_OF_SYNC sync_state: 0 ASC/ASCQ:0x11/0x00 SMART ASC/ASCQ:0xff/0xff
scsi_id:0 0%
scsi_id:1 0%

I took a look at logs and can see this message - mptbase: ioc0: Initiating recovery
I'm not sure why rebulding is failed. could someone can help this issue?
here are my envirements.
- Server : ibm hs21
- disk : 146GB / 600GB (differnt size)
- OS : CentOS release 5.5 (Final)


Problme 2)
there was a server B which is almost same with problem 1.
after replacing broken 1 disk, this server can't find boot image even the disk was raid 1 and another disk was okay.
should I rebuild OS in this case? is there any way to fix without reinstall OS again ?

Thanks a lot!

roklebor
Posts: 55
Joined: 2012/04/15 06:58:35

Re: rebuilding failed after repplacing broken disk on raid1

Post by roklebor » 2014/03/10 17:10:18

As for a server B:

Boot into LiveCD, configure the network if necessary, then run command below and post the result here.

Code: Select all

dd if=/dev/sda bs=1 skip=$((16#140)) count=192 2>/dev/null | xxd -g1
*EDIT: I adjusted the values "skip" and "count" in the command.

Post Reply