Product alert for 2.5-inch 300 GB, 600 GB, 900 GB 10000 rpm hard drive - IBM System x, BladeCenter, Flex Systems
RETAIN tip: H205434
The system's Serial Attached SCSI (SAS)/Serial Advanced Technology Attachment (SATA) controller logs indicate multiple 03/11 read errors.
The operating system hardware event logs indicate multiple 03/11 read errors.
A complete list of serial numbers for the affected 300 Gigabyte (GB), 600 GB, and 900 GB Hard Disk Drives is included in this Tip.
The system is configured with one or more of the following IBM option part numbers:
- 300 GB 10000 rpm SAS 2.5-inch SFF Hot-Swap hard drive, option part number 42D0612, replacement part number 42D0613
- 300 GB 10000 rpm SAS 2.5-inch SFF Simple-Swap hard drive, option part number 49Y1991, replacement part number 49Y1992
- 300 GB 10000 rpm SAS 2.5-inch SFF Slim Hot-Swap hard drive, option part number 42D0637, replacement part number 42D0638
- 300 GB 10000 rpm SAS 2.5-inch SFF Slim Hot-Swap hard drive, option part number 44W2193, replacement part number 42D0638
- 300 GB 10000 rpm SAS 2.5-inch SFF Slim Non Hot-Swap hard drive, option part number 42D0627, replacement part number 42D0628
- 600 GB 10000 rpm SAS 2.5-inch SFF Simple-Swap hard drive, option part number 49Y2027, replacement part number 49Y2028
- 600 GB 10000 rpm SAS 2.5-inch SFF Slim hard drive, option part number 49Y2022, replacement part number 49Y2023
- 600 GB 10000 rpm SAS 2.5-inch SFF Hot-Swap hard drive, option part number 90Y8751, replacement part number 90Y8782
- 600 GB 10000 rpm SAS 2.5-inch SFF Slim Hot-Swap hard drive, option part number 49Y2003, replacement part number 49Y2004
- 900 GB 10000 rpm SAS 2.5-inch SFF, G2, Hot-Swap Express, hard drive, option part number 81Y9927, any replacement part number
- 900 GB 10000 rpm SAS 2.5-inch SFF, G2, Hot-Swap hard drive, option 81Y9650, any replacement part number
- 900 GB 10000 rpm SAS 2.5-inch SFF, G2, Simple-Swap hard drive, option 81Y9654, any model
This tip is not system specific.
This tip is not software specific.
The system has the symptom described above.
In order to determine which hard drives have to be replaced, check the following information:
- The had drive Vital Product Data (VPD) helps to isolate affected units and is available readily though the hardware summary of the operating system. VPD of the affected hard drive product is:
- For the 300 GB... Inquiry Data: IBM-ESXSST9300xxxxx
- For the 600 GB... Inquiry Data: IBM-ESXSST9600xxxxx
- For the 900 GB... Inquiry Data: IBM-ESXSST9900xxxxx
Examples of valid VPDs are:
- IBM-ESXSST9300603SS, IBM-ESXSST9300605SS etc.
- Check the hard drive serial number listing even if the media errors (03/11) are not shown in log files: http://download.boulder.ibm.com/ibmdl/pub/systems/support/system_x/migr-5089882_eca078_hdd_sn_v2.txt
To aid in the process of checking the hard drive VPD and serials numbers to validate that they qualify for replacement, a Dynamic System Analysis (DSA) log can be pulled on every suspected system and sent to IBM. Users then can contact the IBM System x help desk for analysis of those logs and, if they qualify, start the ECA078 process.
A list of phone numbers is available at: http://www.ibm.com/planetwide/region.html
For further assistance, users in the United States (US) and Canada can call the 800-IBM-SERV number.
For users affected with more than three hard drives, the System x Project Office must be engaged. For users with three or fewer affected hard drives, Customer Replaceable Unit (CRU) parts will be dispatched for Next Business Day (NBD) arrival (assuming that inventory is available), and the IBM support agent should mention ECA078 in the CRU order record.
For additional information, refer to the Workaround section of this Tip.
Any failed hard drives can be replaced by Business As Usual (BAU) processes, which aim to include Same-Day CRU processes if the machine is entitled to that level of support.
All users who require IBM to facilitate installation must engage the System x Project Office. CRU-only replacement is the prescribed method approved for ECA078.
For additional information, see ECA078 per RETAIN tip H205539 in support document MIGR-5090022.
Note: ECA078 is scheduled for withdrawal on October 10, 2014.
Before replacing any suspected hard drive, check the labels of each hard drive for the following information:
- 11S serial number, which must start with:
- 11S49Y7424 (300 GB disk drive)
- 11S49Y7426 (600 GB disk drive)
- 11S49Y7427 (900 GB disk drive)
- Manufacturing date: This must be between 2012-01-13 and 2012-04-12.
- The drive model, which must be:
- IBM-ESXSST9300xxxxx (300 GB disk drive)
- IBM-ESXSST9600xxxxx (600 GB disk drive)
- IBM-ESXSST9900xxxxx (900 GB disk drive)
- The drive serial number per the following serial number list:
- The manufacturing location must be China.
This is what is to be checked on a suspected drive before replacing it:
IMPORTANT: The disk drive is to be replaced only if all five criteria are met. This means that under certain circumstances the disk drive is not to be replaced, such as when the production date is outside the time frame 2012-01-13 and 2012-04-12, and so forth.
When the replacement parts arrive, take the following steps to migrate the disks into operation:
- For RAID-0 and Just a Bunch of Disks (JBOD)-configured hard drives, it is necessary to have a current valid backup available to restore the data to the newly integrated disk.
- For RAID-1, RAID-5, or RAID-6 volumes, which contain more than two physical disks, if possible add a new replacement disk to an open slot and, using IBM MegaRAID Storage Manager (MSM), configure the newly added disk as a hot spare.
- Use MegaRAID Storage Manager (MSM) to select one of the disks that needs to be replaced and mark it offline. The array degrades in performance until the hot spare begins the rebuild.
- Prepare the offline disk for removal and remove it from the server. Again, the hot spare automatically starts the rebuild operation as soon as the disk that was selected to be removed is marked offline.
- When the hot spare has completed the rebuild operation and the array reverts back to optimal, add another of the new replacement disks to the emptied slot, and configure it as a hot spare.
- Repeat the previous steps until all of the units that are listed on the serial number list have been replaced.
For RAID-1, RAID-5, or RAID-6 configurations where no hot spare slot is available, use MSM to mark a single disk offline, then prepare it for removal and remove it.
Insert a replacement disk into the same slot that was marked offline, and the rebuild will initiate. If the rebuild does not begin, use MSM or WebBIOS to configure the new disk as a hot spare, and the rebuild will then begin.
Repeat the outlined procedure until all disks have been replaced.
The IBM 2.5-inch 300 GB, 600 GB, and 900 GB 10000 rpm hard disk drives issue comes from a manufacturing process defect. The defect leads to drive contamination, and affects the drive's service life duration.
Replacement of all affected hard disk devices listed on the serial number listing is highly recommended.
Applicable countries and regions
Back to top
Document id: MIGR-5089882
Last modified: 2013-11-04
Copyright © 2014 IBM Corporation