Please see my other blog for Oracle EBusiness Suite Posts - EBMentors

Search This Blog

Note: All the posts are based on practical approach avoiding lengthy theory. All have been tested on some development servers. Please don’t test any post on production servers until you are sure.

Thursday, November 27, 2014

Exadata: Using Integrated Lights Out Manager (ILOM) CLI

What is ILOM?
Oracle ILOM is integrated service processor hardware and software that is preinstalled on Oracle servers, including the storage and database servers in Oracle Exadata. The service processor runs its own embedded operating system and has a dedicated Ethernet port to provide out-of-band server monitoring and management capabilities. Oracle ILOM can be accessed via a browser-based web interface or a command-line interface, and it also provides an SNMP interface and IPMI support.
The Oracle ILOM service processor boots automatically when a power cable is connected to your Oracle Sun server. Oracle ILOM is shipped from the factory with a pre configured root user account and a password. You should use the preconfigured account for initial login and user account setup. 
To prevent unauthorized access to your system, you should change the preconfigured root account and password on each service processor (SP) or chassis monitoring module (CMM).


What is ILOM used for?
ILOM enables you to actively manage and monitor nodes independently of the operating system; providing you with a reliable way to manage Exadata without direct access.
  • Using ILOM, you can proactively manage the following:
  • Identify hardware error and faults
  • Remotely control the power of the node
  • View the graphical and non-graphical console of the host
  • View current status of sensors and indicators of the system
  • Identify the hardware configuration of the system
  • Receive alerts that are generated about system events
Addresses for the ILOM
Addresses for any of the Exadata components (eg; SCAN, name server,compute or cell nodes) can be found in the “dbm.dat” file. This file is located under /opt/oracle.Support/onecommand. In order to access the “dbm.dat” file, you will need root access.
$ cd /opt/oracle.Support/onecommand
$ cat dbm.dat

# Iloms

ILOM "Ilom" pk3-iub-rp-ilo01 192.168.159.16
ILOM "Ilom" pk3-iub-rp-ilo02 192.168.159.17
ILOM "Ilom" pk3-iub-cel-ilo01 192.168.159.18
ILOM "Ilom" pk3-iub-cel-ilo02 192.168.159.19
ILOM "Ilom" pk3-iub-cel-ilo03 192.168.159.20


Usage Reference 
ILOM uses a hierarchical namespace. It contains a predefined tree /SP namespace manages the ILOM. e.g. to manage users and clock settings. The SP namespace is generic over all platforms using the ILOM. /SYS namespace manages the host system. e.g. change the host state, read sensor information, and access other information for managed system hardware. The SYS namespace is unique to each platform.

1- Adding SSH TO ILOM server (From DB node), Connecting to  ILOM
[root@pk3-iub-rp-od01 ~]# ssh root@192.168.159.16
The authenticity of host '192.168.159.16 (192.168.159.16)' can't be established.
RSA key fingerprint is 84:99:65:92:e0:a7:b9:75:68:f1:ce:ac:97:5a:7e:29.
Are you sure you want to continue connecting (yes/no)? yes
Warning: Permanently added '192.168.159.16' (RSA) to the list of known hosts.
Password:

Oracle(R) Integrated Lights Out Manager

Version 3.1.2.10 r74387

Copyright (c) 2012, Oracle and/or its affiliates. All rights reserved.

->

2- You can version details by running the version command from the shell prompt  

-> version
SP firmware 3.1.2.10
SP firmware build number: 74387
SP firmware date: Tue Jun 19 15:08:47 EDT 2012
SP filesystem version: 0.1.23

3- Display help information about commands and targets
-> help
The help command is used to view information about commands and targets

Usage: help [-format wrap|nowrap] [-o|-output terse|verbose]
[|legal|targets|| ]

Special characters used in the help command are
[]   encloses optional keywords or options
<>   encloses a description of the keyword
     (If <> is not present, an actual keyword is indicated)
|    indicates a choice of keywords or options

help              displays description if this target and its
properties
help   displays description of this property of this target
help targets               displays a list of targets
help legal                 displays the product legal notice

Commands are:
cd
create
delete
dump
exit
help
load
reset
set
show
start
stop
version

->

4- Creating your own user for ILOM
-> create /SP/users/orailom
Creating user...
Enter new password: *******
create: Password length must be between 8 and 16 characters
Enter new password: ********
Enter new password again: ********  (welcome1)

Created /SP/users/orailom
5- Grant the role
-> set /SP/users/orailom role=aucro
Set 'role' to 'aucro'

Note – aurco is equivalent to the setting the Administrator role profile option in the web interface.

- To monitor the system fault state from a server SP
-> show /SYS fault_state

  /SYS
    Properties:
        fault_state = OK

6- To view a tabular output of faulty system targets, properties, and values

-> show faulty
Target              | Property               | Value
--------------------+------------------------+----------------------

->


7- To clear the state of a faulty component (after fixing or replacing the faulted component

-> set component_path clear_fault_action=true
Are you sure you want to clear component_path (y/n)? y
Set ‘clear_fault_action’ to ‘true’

Where component_path is one of the following faulted components:
■ Host CPU (/SYS/MB/P#)
■ Memory Riser (/SYS/MB/P0/MR#)
■ DIMM (/SYS/MB/P0/MR0/D#)
■ Motherboard (/SYS/MB)
■ Fan module (/SYS/FM#)
■ Power supply (/SYS/PS#)
■ CMM (/CH/CMM)
■ NEM (/CH/NEM#)
■ PCI card (SYS/MB/PCIE#)

Example

-> set /SYS/MB/P0 clear_fault_action=true
Are you sure you want to clear /SYS/MB/P0 (y/n)? y
Set ‘clear_fault_action’ to ‘true’

8- To view the system event log entries from the CLI
-> show /SP/logs/event/list

Event
ID     Date/Time                 Class     Type      Severity
-----  ------------------------  --------  --------  --------
229    Thu Nov  6 13:29:26 2014  IPMI      Log       minor
       ID =   9b : 11/06/2014 : 13:29:26 : System Firmware Progress : SMI Handle
       r : System boot initiated : Asserted
228    Thu Nov  6 13:28:54 2014  IPMI      Log       minor
       ID =   9a : 11/06/2014 : 13:28:54 : System Firmware Progress : SMI Handle
       r : Option ROM initialization : Asserted

9- To clear all system event log entries
-> set /SP/logs/event clear=true

10- To view the total system power consumptions from the CLI

-> show /SP/powermgmt actual_power

  /SP/powermgmt
    Properties:
        actual_power = 222

OR

-> show /SYS/VPS

 /SYS/VPS
    Targets:
        history

    Properties:
        type = Power Unit
        ipmi_name = /SYS/VPS
        class = Threshold Sensor
        value = 210.750 Watts
        upper_nonrecov_threshold = N/A
        upper_critical_threshold = N/A
        upper_noncritical_threshold = 750.000 Watts
        lower_noncritical_threshold = N/A
        lower_critical_threshold = N/A
        lower_nonrecov_threshold = N/A
        alarm_status = cleared

    Commands:
        cd
        show

11- To view the sum of power allocated to all components in the system
-> show /SP/powermgmt/budget

 /SP/powermgmt/budget
    Targets:

    Properties:
        activation_state = disabled
        status = ok
        powerlimit = 740 (watts)
        timelimit = default (10 seconds)
        violation_actions = none
        min_powerlimit = 128
        pendingpowerlimit = 740 (watts)
        pendingtimelimit = default
        pendingviolation_actions = none
        commitpending = (Cannot show property)

    Commands:
        cd
        set
        show

12- To reboot the host server power state
-> reset /SYS

13- To immediately turn off the power to the remote host server,
-> stop -force /SYS

14- To shut down the OS gracefully prior to powering off the host server
-> stop /SYS

15- To turn on full power to the remote host server
-> start /SYS

16- To reset the Oracle ILOM server SP
If you need to reset your Oracle ILOM service processor (SP), you can do so without affecting the host OS. However, resetting the SP disconnects your current Oracle ILOM session and renders the SP unmanageable during reset.

-> reset /SP

No comments: