Exadata Hard disk or Flash card replacement
To turn off storage cell
first we need to take grid disk offline
Login as celladmin or root and check the status of grid disks using
cellcli -e list griddisk attributes name,asmmodestatus,asmdeactivationoutcome
It should show Online and yes
If it shows dropped then it means there is a bad drive
Replace the drive first
Let it rebalance
To check rebalance status
select INST_ID, OPERATION, STATE, POWER, SOFAR, EST_WORK, EST_RATE, EST_MINUTES from GV$ASM_OPERATION;
Once rebalance complete
run verify the status again
cellcli -e list griddisk attributes name,asmmodestatus,asmdeactivationoutcome
If everything is online and yes
Then take grid disks offline
cellcli -e alter griddisk all inactive
cellcli -e list griddisk
Now you should see offline
Now you can reboot the server
Note: See how long your maintenance takes place
If it is more than 3.6 hours please make sure you change the settings on asm diskgroup so that it will just sync back everything once it comes online
To check the configured time
select dg.name,a.value from v$asm_diskgroup
dg, v$asm_attribute a where dg.group_number=a.group_number and
a.name='disk_repair_time';
To change the time.. Do this to all the diskgroups
ALTER DISKGROUP DATA SET ATTRIBUTE 'DISK_REPAIR_TIME'='8.5H';
After maintenance
Make all grid disks active
cellcli -e alter griddisk all active
check status
cellcli -e list griddisk attributes name, asmmodestatus
You might see some of the disks are syncing.. Give it some time and re check it should be done fairly soon..
There will not be db downtime if you have asm redundancy set to normal or high..
To turn off storage cell
first we need to take grid disk offline
Login as celladmin or root and check the status of grid disks using
cellcli -e list griddisk attributes name,asmmodestatus,asmdeactivationoutcome
It should show Online and yes
If it shows dropped then it means there is a bad drive
Replace the drive first
Let it rebalance
To check rebalance status
select INST_ID, OPERATION, STATE, POWER, SOFAR, EST_WORK, EST_RATE, EST_MINUTES from GV$ASM_OPERATION;
Once rebalance complete
run verify the status again
cellcli -e list griddisk attributes name,asmmodestatus,asmdeactivationoutcome
If everything is online and yes
Then take grid disks offline
cellcli -e alter griddisk all inactive
cellcli -e list griddisk
Now you should see offline
Now you can reboot the server
Note: See how long your maintenance takes place
If it is more than 3.6 hours please make sure you change the settings on asm diskgroup so that it will just sync back everything once it comes online
To check the configured time
select dg.name,a.value from v$asm_diskgroup
dg, v$asm_attribute a where dg.group_number=a.group_number and
a.name='disk_repair_time';
To change the time.. Do this to all the diskgroups
ALTER DISKGROUP DATA SET ATTRIBUTE 'DISK_REPAIR_TIME'='8.5H';
After maintenance
Make all grid disks active
cellcli -e alter griddisk all active
check status
cellcli -e list griddisk attributes name, asmmodestatus
You might see some of the disks are syncing.. Give it some time and re check it should be done fairly soon..
There will not be db downtime if you have asm redundancy set to normal or high..
No comments:
Post a Comment