This is a procedure how to replace a failing HBA or fibre channel adapter, when used in combination with SDD storage:
- Determine which adapter is failing (0, 1, 2, etcetera):# datapath query adapter 
- Check if there are dead paths for any vpaths:# datapath query device 
- Try to set a "degraded" adapter back to online using: # datapath set adapter 1 offline (that is, if adapter "1" is failing, replace it with the correct adapter number).
 # datapath set adapter 1 online
- If the adapter is still in a "degraded" status, open a call with IBM. They most likely require you to take a snap from the system, and send the snap file to IBM for them to analyze and they will conclude if the adapter needs to be replaced or not.
- Involve the SAN storage team if the adapter needs to be replaced. They will have to update the WWN of the failing adapter when the adapter is replaced for a new one with a new WWN.
- If the adapter needs to be replaced, wait for the IBM CE to be onsite with the new HBA adapter. Note the new WWN and supply that to the SAN storage team.
- Remove the adapter: # datapath remove adapter 1 (replace the "1" with the correct adapter that is failing).
- Check if the vpaths now all have one less path: # datapath query device | more 
- De-configure the adapter (this will also de-configure all the child devices, so you won't have to do this manually), by running: diag, choose Task Selection, Hot Plug Task, PCI Hot Plug manager, Unconfigure a Device. Select the correct adapter, e.g. fcs1, set "Unconfigure any Child Devices" to "yes", and "KEEP definition in database" to "no". Hit ENTER.
- Replace the adapter: Run diag and choose Task Selection, Hot Plug Task, PCI Hot Plug manager, Replace/Remove a PCI Hot Plug Adapter. Choose the correct device (be careful, you won't see the adapter name here, but only "Unknown", because the device was unconfigured).
- Have the IBM CE replace the adapter.
- Close any events on the failing adapter on the HMC.
- Validate that the notification LED is now off on the system, if not, go back into diag, choose Task Selection, Hot Plug Task, PCI Hot Plug Manager, and Disable the attention LED.
- Check the adapter firmware level using: # lscfg -vl fcs1 (replace this with the actual adapter name).
 And if required, update the adapter firmware microcode. Validate if the adapter is still functioning correctly by running:# errpt 
 # lsdev -Cc adapter
- Have the SAN admin update the WWN.
- Run: # cfgmgr -S 
- Check the adapter and the child devices: # lsdev -Cc adapter# lsdev -p fcs1 (replace this with the correct adapter name).
 # lsdev -p fscsi1
- Add the paths to the device:# addpaths 
- Check if the vpaths have all paths again: # datapath query device | more 
If you found this useful, here's more on the same topic(s) in our blog:
- Using tmp.mount
- Reset reservation bit
- RHEL 7: Set up storage multi-pathing
- EMC PowerPath key installation
- Allocating shared storage to VIOS clients
UNIX Health Check delivers software to scan Linux and AIX systems for potential issues. Run our software on your system, and receive a report in just a few minutes. UNIX Health Check is an automated check list. It will report on perfomance, capacity, stability and security issues. It will alert on configurations that can be improved per best practices, or items that should be improved per audit guidelines. A report will be generated in the format you wish, and the report includes the issues discovered and information on how to solve the issues as well.
Interested in learning more?
Interested in learning more?




