MAP4968 Recovery actions for special PCIe-related I/O enclosure errors (Model 951)
This MAP calls for SRCs that require special repair actions to be completed by the service representative or the next level of support.
MAP4968 Section-1
Procedure
- Does the FRU list, in the serviceable event, that sent
you here contain a symbolic FRU similar to Invalid-MTMS-cpssebay**?
- Yes, continue at the next step.
- No, go to MAP4968 Section-2.
- When the FRU list contains a symbolic FRU similar to Invalid-MTMS-cpssebay** the location code is invalid and cannot be used to determine the failing I/O enclosure.
-
To determine the cpssebay** value from the symbolic FRU of
Invalid-MTMS-cpssebay**, use the first column of Table 1.
Table 1. Symbolic FRU location to type-location translation Symbolic FRU location code Location Type-location I/O enclosure number cpssebay00 1B1 1400-1B1 0 cpssebay01 1B2 1400-1B2 1 cpssebay02 1B3 1400-1B3 2 cpssebay03 1B4 1400-1B4 3 cpssebay04 2B1 1400-2B1 4 cpssebay05 2B2 1400-2B2 5 cpssebay06 2B3 1400-2B3 6 cpssebay07 2B4 1400-2B4 7 - Determine the location code in Table 1, second column for the symbolic FRU location code in the FRU list.
-
Convert the three-character location code from the prior step to a physical location of the I/O
enclosure in the rack. See Figure 1.
Figure 1. I/O enclosure locations in front of rack 
- Determine the serial number of the I/O enclosure by reading the MTMS label.
-
Open the Advanced System Management (ASM) menu:
- From the navigation area, click Storage Facility Management > Server view.
- From the bottom Task area, click Operations > Launch Advanced System Management (ASM).
- On the launch ASM interface confirmation, click OK.
- The management console web browser is opened, and the ASM login panel is displayed.
- Log in as admin with a password
of admin2107. Notes:
- If you are logged in and not active for 15 minutes, your session expires.
- If you make five invalid login attempts, your user account is locked out for five minutes and none of the other accounts are affected.
- Reset the I/O enclosure MTMS from the ASM menu:
- Expand System Configuration.
- Select Configure IO Enclosures.
- Observe the Type-Model column in the displayed Enclosure Configuration table.
- Find the row that contains the Type-Model determined from step 3.
- Select the radio button for that I/O enclosure.
- Click Change settings.
- Modify the Type-Model field to match and Type-Location field in Table 1 for the I/O enclosure.
- Modify the Serial number field to match the serial number read from the I/O enclosure machine/type/model/serial number label.
- Click Save Settings.
-
Update the HMC microcode objects for the I/O enclosure machine/type/model/serial number by
using a pseudo repair of the PCIe and SPCN card FRU. The update causes the I/O enclosure to be
power-cycled.
- From the navigation area, click Storage Facility Management > storage facility.
- From the Task area, click Exchange Parts > Exchange IO Enclosure and Components.
- Click Show I/O Enclosures and select the enclosure location.
- Click Show FRUS.
- Select I/O Enclosure PCIe/SPCN Card and then click Exchange FRU.
-
When prompted to replace the FRU, do not disconnect the PCIe and SPCN cables from the card. Do not remove
the card.
Continue the repair.
- If the repair is successful, exit this MAP and ensure that any related serviceable events are closed.
- If the repair fails with the same error, replace the I/O enclosure PCIe / SPCN card.
MAP4968 Section-2
Procedure
- Find your SRC in Table 2.
Table 2. Repair actions for special SRCs SRCs that require special repairs SRCs Action BE370012 PCIe I/O enclosure discovery failure (missing I/O enclosure). Go to MAP4968 Section-4. BE38256B PCIe enclosure discovery/configuration failure. Could not initialize path from local server to I/O enclosure. Go to MAP4968 Section-3. BE38256C I/O enclosure FPGA update image corrupted on local server. Contact your next level of support. BE38256D PCIe I/O enclosure FPGA error. Contact your next level of support. BE38256E PCIe I/O enclosure MTMS unknown/invalid. Contact your next level of support. BE38256F PCIe I/O enclosure mis-cabling detected. Go to MAP4968 Section-3. BE382572 Error occurred during I/O enclosure error data collection. Go to MAP4968 Section-3. BE38257B PCIe interface to PCIe I/O enclosure down. Go to MAP4968 Section-3. BE382563 Multi-PCIe link degraded detected on the local server. Contact your next level of support. BE382566 PCIe I/O enclosure discovery/configuration failure. Go to MAP4968 Section-3. BE382567 Invalid server config. Contact your next level of support. BE382574 One LPAR cannot communicate the I/O enclosure; a system failover is required. Go to MAP4968 Section-3. BE382575 PCIe I/O enclosure discovery failure (missing an I/O enclosure). Go to MAP4968 Section-4. Any other SRC Contact your next level of support. - Use the Action column entry to continue the repair.
MAP4968 Section-3
About this task
Important: Both ends of each PCIe cable are displayed in the FRU list. Only the
first cable location code is available to select for repair or replace for each cable in the FRU
list. The subsequent CBLCONT location code shows where a cable continues to connect to, but is not
available to select for repair or replace.
Procedure
- Inspect both ends of each PCIe cable listed
in the FRU list.
- Do not plug or unplug the cable.
-
Refer to Figure 2, Figure 3, and
Figure 4 cabling diagrams based on the number of installed I/O
enclosures in the machine. The CBLCONT location code that is listed is the port on the I/O enclosure
where the cable is supposed to be connected.
Based on the appropriate cable figure, check each end of the cable that is listed on the screen that sent you here to ensure that it is properly plugged into the correct connector.
- Observe the body of the cable to ensure that it is not damaged.
Figure 2. Model 951, two I/O enclosures 
Figure 3. Model 951, four I/O enclosures 
Figure 4. Model 951, eight I/O enclosures 
- Is the PCIe cable properly
plugged and not damaged?
- Yes, go to the next step.
- No, go to step 5
-
The cable is properly plugged and is not damaged. Did
you reach this step after replacing both the I/O enclosure PCIe and SPCN card and the I/O enclosure backplane?- No, go to the next step.
- Yes, a pseudo-repair of the PCIe and SPCN card might recover this condition. Continue with
the following steps:
- Return to the screen that sent you here.
- To the question, "What was the result of using the service procedure from Infocenter?" click Problem not fixed and then click Next.
- To the question, "Did you exchange any parts,?" click No and then click Next.
- To the question, "Did you isolate the problem,"? click Yes and then click Next.
- The current repair action ends, but the serviceable event remains open. Use
the Exchange Parts menu to complete a pseudo-repair of the I/O enclosure PCIe and SPCN card:- Storage Facility Management > > storage facility > > Exchange Parts
Remove I/O enclosure power when instructed to do so in the exchange procedure, but you do not need to uncable or remove the PCIe and SPCN card.
-
The cable is properly plugged and is not damaged. The
I/O enclosure PCIe and SPCN card and the I/O enclosure backplane were not both replaced.- Return to the screen that sent you here.
- To the question, "What was the result of using the service procedure from Infocenter?" click Problem not fixed and then click Next.
- To the question, "Did you exchange any parts?" click No and then click Next.
- To the question, "Did you isolate the problem?" click No and then click Next.
- The next FRU in the list is displayed. Continue the repair by replacing the remaining FRUs until the problem is fixed. Exit this MAP.
-
The cable is incorrectly plugged or damaged. Did a failed IO enclosure
installation lead you to this MAP?
- Yes
- Exit this repair.
- Retry the original MES installation with cables properly connected.
- No, the incorrect plugging of the cable or damage to the cable occurred during a repair.
- Return to the screen that sent you here.
- To the question, "What was the result of using the service procedure from Infocenter?" click Problem not fixed and then click Next.
- To the question, "Did you exchange any parts?" click No and then click Next.
- To the question, "Did you isolate the problem?" click No and then click Next.
- When the next FRU in the list is displayed, pretend that the other FRUs in the previous FRU list are not available onsite to be replaced.
- When asked if the FRU is available to be replaced, answer no. This answer causes each FRU in the
list to be displayed until the incorrectly plugged cable or the damaged cable is displayed.
When the incorrectly plugged cable or the damaged cable is displayed, do a normal FRU replace.
- When the repair is complete, exit this MAP.
- Yes
MAP4968 Section-4
Procedure
- Observe the FRU list in the serviceable event details
that sent you here. It should include one or more of the following
FRUs:
- I/O enclosure PCIe and SPCN card
- I/O enclosure backplane
-
Display open serviceable
events that need repair. Is
there any other serviceable event with either FRUs
determined in step 1 or with other FRUs such as power supply
or fan from this I/O enclosure?
- Yes, exit this MAP and attempt to repair that serviceable event first.
If that repair does not correct this problem, return here and continue with the next step.
If that repair does correct this problem, remember to also close this serviceable event.
- No, go to the next step.
- Yes, exit this MAP and attempt to repair that serviceable event first.
- Inspect both ends of both PCIe cables
that are associated with the I/O enclosure listed in the FRU list,
that is, intended to be connected to this I/O enclosure.
- Do not plug or unplug the cables.
- Refer to Figure 2, Figure 3, and Figure 4 cabling diagrams based on the number of installed I/O enclosures in the machine. Based on the appropriate cable figure, visually check each end of both cables that are intended to be connected to this I/O enclosure to see whether they are properly plugged into the correct connector.
- Observe the body of the cable to ensure that it is not damaged.
- Are the PCIe cables
to the I/O enclosure properly plugged and not damaged?
- Yes, go to the next step.
- No, go to step 6.
- The cables are properly plugged and are not damaged.
- Return to the screen that sent you here.
- To the question, "What was the result of using the service procedure from Infocenter?" click Problem not fixed and then click Next.
- To the question, "Did you exchange any parts?" click No and then click Next.
- To the question, "Did you isolate the problem?" click No and then click Next.
-
The next FRU in the list is displayed. Continue the repair by replacing the remaining FRUs
until the problem is fixed.
Exit this MAP.
-
At least one cable is incorrectly plugged or damaged. Did a failed IO enclosure
installation lead you to this MAP?
- Yes
- Exit this repair.
- Retry the original MES installation with cables properly connected.
- No, the incorrect plugging of the cable or damage to the cable occurred during a repair.
- Return to the screen that sent you here.
- To the question, "What was the result of using the service procedure from Infocenter?" click Problem not fixed and then click Next.
- To the question, "Did you exchange any parts?" click No and then click Next.
- To the question, "Did you isolate the problem?" click No and then click Next.
- The next FRU in the list is displayed. Continue the repair on this FRU, but when instructed to replace the FRU, do not replace that FRU, but instead replace the damaged cables connected to the I/O enclosure.
- If the repair completes successfully, exit this MAP. Otherwise, contact your next level of support.
- Yes