Thursday, September 19, 2013

Exadata: How to Run "ipmitool bmc reset cold" on a Particular Cell (OR) to reset ILOM ?

Applies to Oracle Engineered Systems.

1. Please try to run "ipmitool bmc reset cold" on this particular cell.

Please run ipmitool bmc reset cold for all the cells or just on the cell where the temperature is more.

After ILOM is started up again, check the ILOM alert log for a record of this cold reset.  This reset should get the new thresholds to be used instead of the system default thresholds.

e.g.,

[root@cel03 ~]# date
Thu Jul 19 09:15:54 IDT 2012

[root@cel03 ~]# uname -a
Linux cel03.co.il 2.6.18-274.18.1.0.1.el5 #1 SMP Thu Feb 9
19:07:16 EST 2012 x86_64 x86_64 x86_64 GNU/Linux

[root@cel03 ~]# ipmitool bmc reset cold
Sent cold reset command to MC

[root@cel03 ~]# cellcli
CellCLI: Release 11.2.3.1.0 - Production on Thu Jul 19 09:23:56 IDT 2012
...
Cell Efficiency Ratio: 6

CellCLI> list threshold where name = 'CL_TEMP.cl_threshold' detail
name:               CL_TEMP.cl_threshold
comparison:         >
critical:           37.0
warning:           35.0



Reference / Read More:

ILOM reference is located at:
http://docs.oracle.com/cd/E19860-01/E21549/index.html

Please monitor the system for temperature alerts after this reset to see if alerts continue to trigger regardless of the current settings.


2. Please refer to Temperature and Humidity Requirements, and Ventilation and Cooling Requirements sections in chapter 2 Site Requirements for Oracle Exadata Database Machine and Oracle Exadata Storage Expansion Rack.

Oracle® Exadata Database Machine Owner's Guide
11g Release 2 (11.2)
Part Number E13874-24

3. Following commands will reboot/recycle cell.

First, try cold reset method:

#ipmitool bmc reset cold

If above method fails to reset, try different path/interface method:

#ipmitool sunoem cli 'reset -script /SP'

If both the above methods fails to bring the ILOM back to normal, then MS will send cell alert/ASR message. 

Info: "ILOM has stopped responding, and did not reset after issuing reset commands"


You can try to reset the ILOM manually.

Action:

"Manual intervention is necessary to power cycle the ILOM. Use SSH to connect to the ILOM from this cell or another machine.
At the ILOM prompt, enter 'reset /SP'.  
If unable to connect using SSH, then try resetting ilomserver by login to ILOM/Remote console (Go to tab Maintenance -> ResetSP -> and click on 'ResetSP' button).
If that also doesn't help, then unplug the ILOM power supply. This action power cycles the server as well as the ILOM."

You can also reset the ILOM in any of the following ways:

1) On the cell:

  CELLCLI> alter cell restart bmc

2) On the compute node:

  ipmitool bmc reset cold

3) Reset SP in Web interface:

  Maintenance -> Reset SP

Thanks.

1 comment:

  1. Casino Site Review - Lucky Club
    The online casino site is filled with players from around the world and is licensed by the Gambling Commission. The platform provides a whole new range luckyclub of casino Bonus: 100% up to €100Withdrawal: Within 24 hoursMinimum Deposit: €20

    ReplyDelete