PFW1548

This MAP provides guidance on how to repair a platform firmware hang condition that indicates a boot device problem. These conditions are indicated by a CA00E1Fx code (displayed on the HMC Virtual Control panel) that remains for longer than 10 minutes.

PFW1548 Section-1

Procedure

  1. Did you wait a maximum of 10 minutes for the progress code to change?
    • Yes, continue with the next step.
    • No, wait for the progress code to change, and then continue with the next step.
  2. Use Service Management > Manage Serviceable Events to check whether you have a serviceable event to repair for the failing CEC enclosure. Did you find a serviceable event to repair?
    • Yes, repair the problem by using the serviceable event.
    • No, continue with the next step.
  3. Use the Service Processor ASMI menu to check for event logs that indicate a problem:
    1. Access the ASMI menu. See MAP6F00 Accessing the ASMI.
    2. Select System Service Aids > Error/Event logs.
    3. View any error logs from the current UTC time.
  4. Are there any service processor error logs that indicate a failing FRU?
    • Yes, replace the FRUs using the HMC (use the Storage Facility Management > storage facility > Exchange Parts > Exchange CEC Components > Show FRUs > Exchange FRU option).
    • No, continue with the next step.
  5. Attempt to identify a root cause by performing a slow boot of the CEC enclosure. Use the following steps:
    1. Quiesce all the storage facility LPARs for the failing CEC enclosure as follows:
      1. From the navigation area, click Storage Facility Management > storage facility > SF image.
      2. From the bottom Task area, click Service Utilities > Change/Show LPAR State.
      3. On the Server Control Panel, click Quiesce LPAR.
    2. Power off the failing CEC enclosure as follows:
      1. From the navigation area, click Storage Facility Management > storage facility > Server View > server.
      2. From the bottom Task area, click Service Utilities > Storage System Power control.
      3. On the System Power Control window, click Power Off System.
      4. Wait for the CEC enclosure to power down.
    3. Perform a slow boot of the CEC enclosure. Go to MAP4820 Performing a slow boot of a CEC enclosure.
    4. Monitor the power on and boot of the CEC enclosure using the CEC enclosure control panel and status panels for servers and partitions.
    5. View the serviceable events.
  6. Did the CEC enclosure boot up to normal completion?
    • Yes, go to step 9.
    • No, continue with the next step.
  7. Did the slow boot of the CEC enclosure fail with a serviceable event?
    • Yes, repair the problem by using the serviceable event.
    • No, continue with the next step.
  8. Did the slow boot of the CEC enclosure fail with an SRC displayed on the CEC enclosure control panel or the HMC Virtual Operator panel?
    • Yes, go to Messages and codes, look up the SRC, and take the recommended action.
    • No, contact your next level of support.
  9. Retest the CEC enclosure boot up in normal mode. Perform the following steps:
    1. Power off the CEC enclosure as follows:
      1. From the navigation area, click Storage Facility Management > storage facility > Server View > server.
      2. From the bottom Task area, click Service Utilities > Storage System Power control.
      3. On the System Power Control window, click Power Off System.
      4. Wait for the CEC enclosure to power down.
    2. Power on the CEC enclosure as follows:
      • On the System Power Control window, click Power On System to Ready.
      • Monitor the power on and boot of the CEC enclosure by using the CEC enclosure control panel and status panels for servers and partitions.
  10. Did the CEC enclosure boot up to normal completion?
    • Yes, go to step 11.
    • No, if a serviceable event was created, repair the problem by using the serviceable event. If the same SRN error occurs, contact your next level of support.
  11. Resume the LPAR(s) on the CEC enclosure as follows:
    1. From the navigation area, click Storage Facility Management > storage facility > SF image.
    2. From the bottom Task area, click Service Utilities > Change/Show LPAR State.
    3. On the Server Control Panel, click Resume LPAR.
    4. If the Resume was successful, then this is the end of the repair.