MAP4070 CEC memory problem

CEC memory has been detected as not being fully available.

MAP4070 Section-1

Procedure

  1. Are you in the middle of a memory upgrade MES that changes the amount of memory in the CEC enclosures?
    • Yes, go to the next step.
    • No, go to step 4.
  2. Have you just finished upgrading the memory in the first CEC enclosure?
    • Yes, go to the next step.
    • No, go to step 4.
  3. The memory size in the first CEC has been upgraded and the memory size in the second CEC has not. There is a memory mismatch between the two CECs.
    Find the SRC in the serviceable event that sent you here.
    • Read the MES to determine whether a memory mismatch error is expected and should be ignored.
    • Read the SRC definition to see whether it says that there is a memory mismatch. This might be normal.
    • Display open serviceable events to see whether there are any other serviceable events that include memory DIMMs or a memory card in the FRU list. If there are, use that serviceable event to repair the problem.
    • Call the next level of support for help.
  4. Is the CEC being repaired in a 242x Model 961?

MAP4070 Section-2

About this task

You are isolating a memory problem for a 242x Model 961.

Procedure

  1. Display open serviceable events for the CEC enclosure. Other than the serviceable event that sent you to this MAP, is there any other related open serviceable event that lists the memory DIMMs or memory cards as FRUs?
    • Yes, exit this MAP and repair the related serviceable event. If the repair is successful, remember to close the serviceable event that sent you here. If you were doing a memory upgrade MES, exit this MAP and continue the MES now.
    • No, go to the next step.
  2. Does the serviceable event FRU list that sent you here include one or more memory DIMM locations to be replaced?
    • Yes, exit this MAP and return to the FRU list. Replace the listed memory DIMMs. If it still fails, return here and continue to the next step.
    • No, go to the next step.
  3. Log in to the ASM menu for the failing CEC enclosure. Type admin in the User ID field and admin2107 in the Password field. If the login fails, log in as admin with a password of admin210. See MAP6F10 Accessing the ASMI using the management console.
  4. Display the error and event logs to determine any related problems.
    1. Select System Service Aids > Error/Event logs.
    2. Display the log details and look for information that is related to memory DIMM errors or locations.
    3. Are there any logs that identify a failing memory DIMM or memory card?
    • Yes, if you can determine the failing DIMMs, exit this MAP and use Storage Facility Management > storage facility > Exchange Parts to replace the DIMMs.
    • No, go to the next step.
  5. Display the memory deconfiguration status.
    1. Select System Configuration > Hardware Deconfiguration > Memory Deconfiguration. The total memory, configured memory, and deconfigured memory are displayed.
    2. Is there any deconfigured memory?
    • Yes, go to step 7.
    • No, it is possible to have no deconfigured memory and not have missing memory capacity. For example, if one or more memory DIMMs were left unplugged and the CEC was powered on, there might not be configured memory. However, the total memory and configured memory capacities would be less than normal. The CEC does not know how much memory capacity it should have and only displays what it finds during power-on. Go to the next step.
  6. Compare the total memory and configured memory capacities for the failing CEC enclosure to the working CEC enclosure.
    Are there any differences between the capacities of the CEC enclosures?
    • Yes, go to the next step.
    • No, exit this MAP and contact your next level of support. The serviceable event that sent you here indicates whether a memory problem was detected, yet all memory appears to be available.
  7. Display the details.
    1. Click the radio button of the Processing Unit with the deconfigured memory.
    2. Click Continue. The status for each memory DIMM is displayed.
  8. See Table 1 for an example of the deconfigured memory details screen.
    Table 1. Deconfigured memory details (example) (Model 961)
    Memory DIMM Location code Size State Error type Change settings
    0 U78AA.001.WIH002V-P1-C18-C1 8192 MB Configured None (0) Configured
  9. Are any memory DIMMs deconfigured?
    • Yes, go to the next step.
    • No, exit this MAP and contact your next level of support. The serviceable event that sent you here indicates that a memory problem was detected, yet all memory appears to be available.
  10. Replace the memory DIMM FRUs for the location codes that are listed for the deconfigured memory banks. See Figure 1 and Figure 2.
    Figure 1. Memory module locations on the memory card (Un-P1-Cx) (Model 961)
    Memory module locations on the memory card (Un-P1-Cx)
    Memory module location is P1-Cx-Cy, where:
    X = Memory card location 15, 16, 17, or 18
    y = Memory DIMM slot 1 through 4 or 7 through 10, as shown
    Notes:
    • The first pair of memory modules is plugged into memory module slots P1-Cx-C1 and P1-Cx-C3.
    • The second pair of memory modules is plugged into memory module slots P1-Cx-C8 and P1-Cx-C10.
    • The third pair of memory modules is plugged into memory module slots P1-Cx-C2 and P1-Cx-C4.
    • The fourth pair of memory modules is plugged into memory module slots P1-Cx-C7 and P1-Cx-C9.
    Figure 2. CEC enclosure location codes (top) (Model 961)
    CEC enclosure location codes (top)
  11. The possible failing FRUs are the memory DIMMs or the memory card.
    Is the FRU to be replaced listed in the serviceable event FRU list?
    1. Click Storage Facility Management > storage facility > Exchange Parts.
      Note: It is not recommended that you set the state of memory capacity back to Configured or to attempt a pseudo repair. For example, do not leave the original failing FRU installed. If you choose to do this to further isolate the failure, you must move the memory DIMM in the slots that are deconfigured to a different slot. On power-up, the firmware checks the serial number of the DIMM in each slot and does not configure it if the serial number has not changed.

MAP4070 Section-3

About this task

You are isolating a memory problem for a model 98x.

Procedure

  1. Display open serviceable events for the CEC enclosure. Other than the serviceable event that sent you to this MAP, is there any other related open serviceable event that lists the memory DIMMs or system processor cards as FRUs?
    • Yes, exit this MAP and repair the related serviceable event. If the repair is successful, remember to close the serviceable event that sent you here. If you were doing a memory upgrade MES, exit this MAP and continue the MES now.
    • No, go to the next step.
  2. Does the serviceable event FRU list that sent you here include one or more memory DIMM locations to be replaced?
    • Yes, exit this MAP and return to the FRU list. Replace the listed memory DIMMs. If it still fails, return here and continue to the next step.
    • No, go to the next step.
  3. Log in to the ASM menu for the failing CEC enclosure. Type admin in the User ID field and admin2107 in the Password field. If the login fails, log in as admin with a password of admin210. See MAP6F10 Accessing the ASMI using the management console.
  4. Display the error and event logs to determine any related problems.
    1. Select System Service Aids > Error/Event logs.
    2. Display the log details and look for information that is related to memory DIMM errors or locations.
    3. Are there any logs that identify a failing memory DIMM or system processor card?
    • Yes, if you can determine the failing DIMMs, exit this MAP and use Storage Facility Management > storage facility > Exchange Parts to replace the DIMMs.
    • No, go to the next step.
  5. Display the memory deconfiguration status.
    1. Select System Configuration > Hardware Deconfiguration > Memory Deconfiguration. The total memory, configured memory, and deconfigured memory are displayed.
    2. Is there any deconfigured memory?
    • Yes, go to step 7.
    • No, it is possible to have no deconfigured memory and not have missing memory capacity. For example, if one or more memory DIMMs were left unplugged and the CEC was powered on, there might not be configured memory, but the total memory and configured memory capacities would be less than normal. The CEC does not know how much memory capacity it should have and only displays what it finds during power-on. Go to the next step.
  6. Compare the total memory and configured memory capacities for the failing CEC enclosure to the working CEC enclosure.
    Are there any differences between the capacities of the CEC enclosures?
    • Yes, go to the next step.
    • No, exit this MAP and contact your next level of support. The serviceable event that sent you here indicates whether a memory problem was detected, yet all memory appears to be available.
  7. Display the details.
    1. Click the radio button of the Processing Unit with the deconfigured memory.
    2. Click Continue. The status for each memory bank is displayed.
  8. See Table 2 for an example of the deconfigured memory details screen.
    Table 2. Deconfigured memory details example, Models 98x
    Location code Size Memory DIMM Size Functional state Error type Change settings
    U78C9.001.WZS03K0-P1-C16 16384 MB  
      0 4096 MB Configured None (0x0) Not Applicable
    1 4096 MB Configured None (0x0) Not Applicable
    2 4096 MB Configured None (0x0) Not Applicable
    3 4096 MB Configured None (0x0) Not Applicable
  9. Are any memory banks deconfigured?
    • Yes, go to the next step.
    • No, exit this MAP and contact your next level of support. The serviceable event that sent you here indicates that a memory problem was detected, yet all memory appears to be available.
  10. Replace the memory DIMM FRUs for the location codes that are listed for the deconfigured memory banks. See Figure 3, Figure 4, and Figure 5.
    Figure 3. CEC enclosure location codes (top), Models 980, 983, 984
    CEC enclosure location codes (top), Model 980
    Figure 4. CEC enclosure location codes (top), Models 981, 985, 986
    CEC enclosure location codes (top), Model 981
    Figure 5. CEC enclosure location codes (top view) (Models 982, 988)
    CEC enclosure location codes (top view) (Model 982)
  11. The possible failing FRUs are the memory DIMMs or the system processor card.
    Is the FRU to be replaced listed in the serviceable event FRU list?
    1. Click Storage Facility Management > storage facility > Exchange Parts.
      Note: It is not recommended that you set the state of memory capacity back to Configured or to attempt a pseudo repair. For example, do not leave the original failing FRU installed. If you choose to do this to further isolate the failure, you must move the memory DIMM in the slots that are deconfigured to a different slot. On power-up, the firmware checks the serial number of the DIMM in each slot and does not configure it if the serial number has not changed.