HP P840 HDD RAID 5 many strange drive failuresHP SmartArray P400: How to repair failed logical drive?Reusing Raid 5 Drive?reliably and automatically determine connection path of physical position of HDD from /dev/sdX device fileHow to replace failed drive in RAID 5 array in HP DL380 G4 serverQuestions on increasing RAID 5 arrayRaid 10, Logical device are missingHP Code 341 “Physical Drive State: Predictive failure. This physical drive is predicted to fail soon.”HPE 1.92TB SATA 6G Mixed Use SFF SSD very slow compared to SAS HDD HP disksHP drive array “ready for rebuild” (RAID5)Hard Disc Failure or RAID Glitch

Limit max CPU usage SQL SERVER with WSRM

Can I say "fingers" when referring to toes?

Extracting patterns from a text

Would this string work as string?

Review your own paper in Mathematics

If the only attacker is removed from combat, is a creature still counted as having attacked this turn?

Can I run 125khz RF circuit on a breadboard?

What is it called to attack a person then say something uplifting?

When and why was runway 07/25 at Kai Tak removed?

Do I have to take mana from my deck or hand when tapping a dual land?

Storage of electrolytic capacitors - how long?

How to make a list of partial sums using forEach

How to Disable and Drop all Temporal Tables from a database

What properties make a magic weapon befit a Rogue more than a DEX-based Fighter?

What (if any) is the reason to buy in small local stores?

Has the laser at Magurele, Romania reached a tenth of the Sun's power?

Possible Eco thriller, man invents a device to remove rain from glass

How to make money from a browser who sees 5 seconds into the future of any web page?

Why is participating in the European Parliamentary elections used as a threat?

Is stochastic gradient descent pseudo-stochastic?

The Digit Triangles

Should a narrator ever describe things based on a character's view instead of facts?

Echo with obfuscation

Given this phrasing in the lease, when should I pay my rent?



HP P840 HDD RAID 5 many strange drive failures


HP SmartArray P400: How to repair failed logical drive?Reusing Raid 5 Drive?reliably and automatically determine connection path of physical position of HDD from /dev/sdX device fileHow to replace failed drive in RAID 5 array in HP DL380 G4 serverQuestions on increasing RAID 5 arrayRaid 10, Logical device are missingHP Code 341 “Physical Drive State: Predictive failure. This physical drive is predicted to fail soon.”HPE 1.92TB SATA 6G Mixed Use SFF SSD very slow compared to SAS HDD HP disksHP drive array “ready for rebuild” (RAID5)Hard Disc Failure or RAID Glitch













6















I've been using a RAID5 HDD storage (8x6TB) at my HP P840 for like 2 years now and it has always had unusually many drive failures. Everything was good for half a year, but now drives are failing in a strange way.
For example 2 new drives failed a few days after they have been added to the RAID.
I have also already replaced the RAID controller and am using up-to-date firmware on Mainboard and RAID controller.



Also I have tried to use different drives.
Initially there were HGST DeskStar 6TB drives used in that RAID, now I have been replacing them with HGST UltraStar 6TB when replacing failed drives. But the behaviour is the same.



Also it seems (most of) the drives are not really failed, because once I replaced the RAID controller, one failed drive was recognized as OK again and the rebuild started.



My hoster's support says the problem is that I'm actually using RAID5 and I should switch to RAID10 instead. It's hard for me to believe as I've been using RAID5 without problems at other systems (no drive failures in years).



Can anyone give me a hint, what else the culprit could be? Something wrong with the way the RAID controller is configured?



Thank you!



EDIT:

The server is a HP DL180 G9

Reason for drive failure is always "Write retries failed"



 Smart Array P840 in Slot 1 (sn: PDNNF0ARH321GD)


Port Name: 1I

Port Name: 2I

Internal Drive Cage at Port 1I, Box 2, OK

Internal Drive Cage at Port 1I, Box 2, OK

Internal Drive Cage at Port 2I, Box 1, OK
array A (Solid State SATA, Unused Space: 0 MB)


logicaldrive 1 (447.1 GB, RAID 1+0, OK)

physicaldrive 2I:1:1 (port 2I:box 1:bay 1, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:2 (port 2I:box 1:bay 2, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:3 (port 2I:box 1:bay 3, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:4 (port 2I:box 1:bay 4, Solid State SATA, 240.0 GB, OK)

array B (SATA, Unused Space: 0 MB)


logicaldrive 2 (38.2 TB, RAID 5, Interim Recovery Mode)

physicaldrive 1I:2:1 (port 1I:box 2:bay 1, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:2 (port 1I:box 2:bay 2, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:3 (port 1I:box 2:bay 3, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:4 (port 1I:box 2:bay 4, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:5 (port 1I:box 2:bay 5, SATA, 6001.1 GB, Failed)
physicaldrive 1I:2:6 (port 1I:box 2:bay 6, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:7 (port 1I:box 2:bay 7, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:8 (port 1I:box 2:bay 8, SATA, 6001.1 GB, OK)


Detailed Info:



 Smart Array P840 in Slot 1
Bus Interface: PCI
Slot: 1
Serial Number: PDNNF0ARH321GD
Cache Serial Number: PEYFP0BRH323YZ
RAID 6 (ADG) Status: Enabled
Controller Status: OK
Hardware Revision: B
Firmware Version: 6.60
Rebuild Priority: High
Expand Priority: Medium
Surface Scan Delay: 3 secs
Surface Scan Mode: Idle
Parallel Surface Scan Supported: Yes
Current Parallel Surface Scan Count: 1
Max Parallel Surface Scan Count: 16
Queue Depth: Automatic
Monitor and Performance Delay: 60 min
Elevator Sort: Enabled
Degraded Performance Optimization: Disabled
Inconsistency Repair Policy: Disabled
Wait for Cache Room: Disabled
Surface Analysis Inconsistency Notification: Disabled
Post Prompt Timeout: 15 secs
Cache Board Present: True
Cache Status: OK
Cache Ratio: 10% Read / 90% Write
Drive Write Cache: Enabled
Total Cache Size: 4.0 GB
Total Cache Memory Available: 3.2 GB
No-Battery Write Cache: Enabled
SSD Caching RAID5 WriteBack Enabled: True
SSD Caching Version: 2
Cache Backup Power Source: Batteries
Battery/Capacitor Count: 1
Battery/Capacitor Status: OK
SATA NCQ Supported: True
Spare Activation Mode: Activate on physical drive failure (default)
Controller Temperature (C): 51
Cache Module Temperature (C): 38
Number of Ports: 2 Internal only
Encryption: Disabled
Express Local Encryption: False
Driver Name: hpsa
Driver Version: 3.4.16
Driver Supports HP SSD Smart Path: True
PCI Address (Domain:Bus:Device.Function): 0000:06:00.0
Negotiated PCIe Data Rate: PCIe 3.0 x8 (7880 MB/s)
Controller Mode: RAID
Controller Mode Reboot: Not Required
Latency Scheduler Setting: Disabled
Current Power Mode: MaxPerformance
Host Serial Number: CZ270500GM
Sanitize Erase Supported: False
Primary Boot Volume: logicaldrive 1 (600508B1001CE0F9FACF3A1358647115)
Secondary Boot Volume: logicaldrive 1 (600508B1001CE0F9FACF3A1358647115)


Port Name: 1I
Port ID: 0
Port Connection Number: 0
SAS Address: 5001438038AD05A0
Port Location: Internal
Managed Cable Connected: False

Port Name: 2I
Port ID: 1
Port Connection Number: 1
SAS Address: 5001438038AD05A8
Port Location: Internal
Managed Cable Connected: False

Internal Drive Cage at Port 1I, Box 2, OK
Power Supply Status: Not Redundant
Drive Bays: 4
Port: 1I
Box: 2
Location: Internal

Physical Drives
physicaldrive 1I:2:1 (port 1I:box 2:bay 1, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:2 (port 1I:box 2:bay 2, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:3 (port 1I:box 2:bay 3, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:4 (port 1I:box 2:bay 4, SATA, 6001.1 GB, OK)
None attached


Internal Drive Cage at Port 1I, Box 2, OK
Power Supply Status: Not Redundant
Drive Bays: 4
Port: 1I
Box: 2
Location: Internal

Physical Drives
physicaldrive 1I:2:1 (port 1I:box 2:bay 1, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:2 (port 1I:box 2:bay 2, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:3 (port 1I:box 2:bay 3, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:4 (port 1I:box 2:bay 4, SATA, 6001.1 GB, OK)
None attached


Internal Drive Cage at Port 2I, Box 1, OK
Power Supply Status: Not Redundant
Drive Bays: 4
Port: 2I
Box: 1
Location: Internal

Physical Drives
physicaldrive 2I:1:1 (port 2I:box 1:bay 1, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:2 (port 2I:box 1:bay 2, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:3 (port 2I:box 1:bay 3, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:4 (port 2I:box 1:bay 4, Solid State SATA, 240.0 GB, OK)
None attached

Array: A
Interface Type: Solid State SATA
Unused Space: 0 MB (0.0%)
Used Space: 894.2 GB (100.0%)
Status: OK
MultiDomain Status: OK
Array Type: Data
HP SSD Smart Path: disable



Logical Drive: 1
Size: 447.1 GB
Fault Tolerance: 1+0
Heads: 255
Sectors Per Track: 32
Cylinders: 65535
Strip Size: 256 KB
Full Stripe Size: 512 KB
Status: OK
MultiDomain Status: OK
Caching: Enabled
Unique Identifier: 600508B1001CE0F9FACF3A1358647115
Disk Name: /dev/sda
Mount Points: / 18.6 GB Partition Number 2
OS Status: LOCKED
Logical Drive Label: 0216D6F9PDNNF0ARH502MC7DFA
Mirror Group 1:
physicaldrive 2I:1:1 (port 2I:box 1:bay 1, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:2 (port 2I:box 1:bay 2, Solid State SATA, 240.0 GB, OK)
Mirror Group 2:
physicaldrive 2I:1:3 (port 2I:box 1:bay 3, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:4 (port 2I:box 1:bay 4, Solid State SATA, 240.0 GB, OK)
Drive Type: Data
LD Acceleration Method: Controller Cache

physicaldrive 2I:1:1
Port: 2I
Box: 1
Bay: 1
Status: OK
Drive Type: Data Drive
Interface Type: Solid State SATA
Size: 240.0 GB
Drive exposed to OS: False
Native Block Size: 4096
Firmware Revision: N2010101
Serial Number: PHDV712004AG240AGN
Model: ATA INTEL SSDSC2BB24
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 31
Maximum Temperature (C): 39
SSD Smart Trip Wearout: Not Supported
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 2I:1:2
Port: 2I
Box: 1
Bay: 2
Status: OK
Drive Type: Data Drive
Interface Type: Solid State SATA
Size: 240.0 GB
Drive exposed to OS: False
Native Block Size: 4096
Firmware Revision: N2010101
Serial Number: PHDV706303CH240AGN
Model: ATA INTEL SSDSC2BB24
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 29
Maximum Temperature (C): 36
SSD Smart Trip Wearout: Not Supported
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 2I:1:3
Port: 2I
Box: 1
Bay: 3
Status: OK
Drive Type: Data Drive
Interface Type: Solid State SATA
Size: 240.0 GB
Drive exposed to OS: False
Native Block Size: 4096
Firmware Revision: N2010101
Serial Number: PHDV712003V8240AGN
Model: ATA INTEL SSDSC2BB24
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 29
Maximum Temperature (C): 35
SSD Smart Trip Wearout: Not Supported
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 2I:1:4
Port: 2I
Box: 1
Bay: 4
Status: OK
Drive Type: Data Drive
Interface Type: Solid State SATA
Size: 240.0 GB
Drive exposed to OS: False
Native Block Size: 4096
Firmware Revision: N2010101
Serial Number: PHDV712004GA240AGN
Model: ATA INTEL SSDSC2BB24
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 31
Maximum Temperature (C): 37
SSD Smart Trip Wearout: Not Supported
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False


Array: B
Interface Type: SATA
Unused Space: 0 MB (0.0%)
Used Space: 43.7 TB (100.0%)
Status: Failed Physical Drive
MultiDomain Status: OK
Array Type: Data
HP SSD Smart Path: disable

Warning: One of the drives on this array have failed or has been removed.




Logical Drive: 2
Size: 38.2 TB
Fault Tolerance: 5
Heads: 255
Sectors Per Track: 32
Cylinders: 65535
Strip Size: 256 KB
Full Stripe Size: 1792 KB
Status: Interim Recovery Mode
MultiDomain Status: OK
Caching: Enabled
Parity Initialization Status: Initialization Failed
Unique Identifier: 600508B1001CF94F84873C91FD89B549
Disk Name: /dev/sdb
Mount Points: None
Logical Drive Label: 04DA1DD6PDNNF0ARH502MC546F
Drive Type: Data
LD Acceleration Method: Controller Cache

physicaldrive 1I:2:1
Port: 1I
Box: 2
Bay: 1
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: APGNW7JH
Serial Number: NAHN3UZY
Model: ATA HGST HDN726060AL
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 37
Maximum Temperature (C): 43
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:2
Port: 1I
Box: 2
Bay: 2
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: APGNT517
Serial Number: NAHLKP0X
Model: ATA HGST HDN726060AL
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 37
Maximum Temperature (C): 56
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:3
Port: 1I
Box: 2
Bay: 3
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: T7MH
Serial Number: NCH8E81Z
Model: ATA HUS726060ALE610
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 33
Maximum Temperature (C): 41
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:4
Port: 1I
Box: 2
Bay: 4
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: APGNW7JH
Serial Number: NAHYMAUY
Model: ATA HGST HDN726060AL
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 34
Maximum Temperature (C): 41
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:5
Port: 1I
Box: 2
Bay: 5
Status: Failed
Last Failure Reason: Write retries failed
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: T7MH
Serial Number: K1H942MD
Model: ATA HUS726060ALE610
SATA NCQ Capable: True
SATA NCQ Enabled: True
Maximum Temperature (C): 43
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Applicable
Sanitize Erase Supported: False

physicaldrive 1I:2:6
Port: 1I
Box: 2
Bay: 6
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: TDR2
Serial Number: K8JM5TKN
Model: ATA HUS726060ALE610
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 33
Maximum Temperature (C): 38
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:7
Port: 1I
Box: 2
Bay: 7
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: APGNW7JH
Serial Number: K8H9BW2N
Model: ATA HGST HDN726060AL
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 34
Maximum Temperature (C): 39
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:8
Port: 1I
Box: 2
Bay: 8
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: T7MH
Serial Number: K1H623JD
Model: ATA HUS726060ALE610
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 35
Maximum Temperature (C): 40
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False









share|improve this question



















  • 1





    One of the drives reports 56C maybe that’s a thermal problem? All your drives seem 4K, so I guess the stripes would be correctly aligned.

    – eckes
    Mar 17 at 16:16















6















I've been using a RAID5 HDD storage (8x6TB) at my HP P840 for like 2 years now and it has always had unusually many drive failures. Everything was good for half a year, but now drives are failing in a strange way.
For example 2 new drives failed a few days after they have been added to the RAID.
I have also already replaced the RAID controller and am using up-to-date firmware on Mainboard and RAID controller.



Also I have tried to use different drives.
Initially there were HGST DeskStar 6TB drives used in that RAID, now I have been replacing them with HGST UltraStar 6TB when replacing failed drives. But the behaviour is the same.



Also it seems (most of) the drives are not really failed, because once I replaced the RAID controller, one failed drive was recognized as OK again and the rebuild started.



My hoster's support says the problem is that I'm actually using RAID5 and I should switch to RAID10 instead. It's hard for me to believe as I've been using RAID5 without problems at other systems (no drive failures in years).



Can anyone give me a hint, what else the culprit could be? Something wrong with the way the RAID controller is configured?



Thank you!



EDIT:

The server is a HP DL180 G9

Reason for drive failure is always "Write retries failed"



 Smart Array P840 in Slot 1 (sn: PDNNF0ARH321GD)


Port Name: 1I

Port Name: 2I

Internal Drive Cage at Port 1I, Box 2, OK

Internal Drive Cage at Port 1I, Box 2, OK

Internal Drive Cage at Port 2I, Box 1, OK
array A (Solid State SATA, Unused Space: 0 MB)


logicaldrive 1 (447.1 GB, RAID 1+0, OK)

physicaldrive 2I:1:1 (port 2I:box 1:bay 1, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:2 (port 2I:box 1:bay 2, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:3 (port 2I:box 1:bay 3, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:4 (port 2I:box 1:bay 4, Solid State SATA, 240.0 GB, OK)

array B (SATA, Unused Space: 0 MB)


logicaldrive 2 (38.2 TB, RAID 5, Interim Recovery Mode)

physicaldrive 1I:2:1 (port 1I:box 2:bay 1, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:2 (port 1I:box 2:bay 2, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:3 (port 1I:box 2:bay 3, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:4 (port 1I:box 2:bay 4, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:5 (port 1I:box 2:bay 5, SATA, 6001.1 GB, Failed)
physicaldrive 1I:2:6 (port 1I:box 2:bay 6, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:7 (port 1I:box 2:bay 7, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:8 (port 1I:box 2:bay 8, SATA, 6001.1 GB, OK)


Detailed Info:



 Smart Array P840 in Slot 1
Bus Interface: PCI
Slot: 1
Serial Number: PDNNF0ARH321GD
Cache Serial Number: PEYFP0BRH323YZ
RAID 6 (ADG) Status: Enabled
Controller Status: OK
Hardware Revision: B
Firmware Version: 6.60
Rebuild Priority: High
Expand Priority: Medium
Surface Scan Delay: 3 secs
Surface Scan Mode: Idle
Parallel Surface Scan Supported: Yes
Current Parallel Surface Scan Count: 1
Max Parallel Surface Scan Count: 16
Queue Depth: Automatic
Monitor and Performance Delay: 60 min
Elevator Sort: Enabled
Degraded Performance Optimization: Disabled
Inconsistency Repair Policy: Disabled
Wait for Cache Room: Disabled
Surface Analysis Inconsistency Notification: Disabled
Post Prompt Timeout: 15 secs
Cache Board Present: True
Cache Status: OK
Cache Ratio: 10% Read / 90% Write
Drive Write Cache: Enabled
Total Cache Size: 4.0 GB
Total Cache Memory Available: 3.2 GB
No-Battery Write Cache: Enabled
SSD Caching RAID5 WriteBack Enabled: True
SSD Caching Version: 2
Cache Backup Power Source: Batteries
Battery/Capacitor Count: 1
Battery/Capacitor Status: OK
SATA NCQ Supported: True
Spare Activation Mode: Activate on physical drive failure (default)
Controller Temperature (C): 51
Cache Module Temperature (C): 38
Number of Ports: 2 Internal only
Encryption: Disabled
Express Local Encryption: False
Driver Name: hpsa
Driver Version: 3.4.16
Driver Supports HP SSD Smart Path: True
PCI Address (Domain:Bus:Device.Function): 0000:06:00.0
Negotiated PCIe Data Rate: PCIe 3.0 x8 (7880 MB/s)
Controller Mode: RAID
Controller Mode Reboot: Not Required
Latency Scheduler Setting: Disabled
Current Power Mode: MaxPerformance
Host Serial Number: CZ270500GM
Sanitize Erase Supported: False
Primary Boot Volume: logicaldrive 1 (600508B1001CE0F9FACF3A1358647115)
Secondary Boot Volume: logicaldrive 1 (600508B1001CE0F9FACF3A1358647115)


Port Name: 1I
Port ID: 0
Port Connection Number: 0
SAS Address: 5001438038AD05A0
Port Location: Internal
Managed Cable Connected: False

Port Name: 2I
Port ID: 1
Port Connection Number: 1
SAS Address: 5001438038AD05A8
Port Location: Internal
Managed Cable Connected: False

Internal Drive Cage at Port 1I, Box 2, OK
Power Supply Status: Not Redundant
Drive Bays: 4
Port: 1I
Box: 2
Location: Internal

Physical Drives
physicaldrive 1I:2:1 (port 1I:box 2:bay 1, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:2 (port 1I:box 2:bay 2, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:3 (port 1I:box 2:bay 3, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:4 (port 1I:box 2:bay 4, SATA, 6001.1 GB, OK)
None attached


Internal Drive Cage at Port 1I, Box 2, OK
Power Supply Status: Not Redundant
Drive Bays: 4
Port: 1I
Box: 2
Location: Internal

Physical Drives
physicaldrive 1I:2:1 (port 1I:box 2:bay 1, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:2 (port 1I:box 2:bay 2, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:3 (port 1I:box 2:bay 3, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:4 (port 1I:box 2:bay 4, SATA, 6001.1 GB, OK)
None attached


Internal Drive Cage at Port 2I, Box 1, OK
Power Supply Status: Not Redundant
Drive Bays: 4
Port: 2I
Box: 1
Location: Internal

Physical Drives
physicaldrive 2I:1:1 (port 2I:box 1:bay 1, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:2 (port 2I:box 1:bay 2, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:3 (port 2I:box 1:bay 3, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:4 (port 2I:box 1:bay 4, Solid State SATA, 240.0 GB, OK)
None attached

Array: A
Interface Type: Solid State SATA
Unused Space: 0 MB (0.0%)
Used Space: 894.2 GB (100.0%)
Status: OK
MultiDomain Status: OK
Array Type: Data
HP SSD Smart Path: disable



Logical Drive: 1
Size: 447.1 GB
Fault Tolerance: 1+0
Heads: 255
Sectors Per Track: 32
Cylinders: 65535
Strip Size: 256 KB
Full Stripe Size: 512 KB
Status: OK
MultiDomain Status: OK
Caching: Enabled
Unique Identifier: 600508B1001CE0F9FACF3A1358647115
Disk Name: /dev/sda
Mount Points: / 18.6 GB Partition Number 2
OS Status: LOCKED
Logical Drive Label: 0216D6F9PDNNF0ARH502MC7DFA
Mirror Group 1:
physicaldrive 2I:1:1 (port 2I:box 1:bay 1, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:2 (port 2I:box 1:bay 2, Solid State SATA, 240.0 GB, OK)
Mirror Group 2:
physicaldrive 2I:1:3 (port 2I:box 1:bay 3, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:4 (port 2I:box 1:bay 4, Solid State SATA, 240.0 GB, OK)
Drive Type: Data
LD Acceleration Method: Controller Cache

physicaldrive 2I:1:1
Port: 2I
Box: 1
Bay: 1
Status: OK
Drive Type: Data Drive
Interface Type: Solid State SATA
Size: 240.0 GB
Drive exposed to OS: False
Native Block Size: 4096
Firmware Revision: N2010101
Serial Number: PHDV712004AG240AGN
Model: ATA INTEL SSDSC2BB24
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 31
Maximum Temperature (C): 39
SSD Smart Trip Wearout: Not Supported
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 2I:1:2
Port: 2I
Box: 1
Bay: 2
Status: OK
Drive Type: Data Drive
Interface Type: Solid State SATA
Size: 240.0 GB
Drive exposed to OS: False
Native Block Size: 4096
Firmware Revision: N2010101
Serial Number: PHDV706303CH240AGN
Model: ATA INTEL SSDSC2BB24
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 29
Maximum Temperature (C): 36
SSD Smart Trip Wearout: Not Supported
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 2I:1:3
Port: 2I
Box: 1
Bay: 3
Status: OK
Drive Type: Data Drive
Interface Type: Solid State SATA
Size: 240.0 GB
Drive exposed to OS: False
Native Block Size: 4096
Firmware Revision: N2010101
Serial Number: PHDV712003V8240AGN
Model: ATA INTEL SSDSC2BB24
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 29
Maximum Temperature (C): 35
SSD Smart Trip Wearout: Not Supported
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 2I:1:4
Port: 2I
Box: 1
Bay: 4
Status: OK
Drive Type: Data Drive
Interface Type: Solid State SATA
Size: 240.0 GB
Drive exposed to OS: False
Native Block Size: 4096
Firmware Revision: N2010101
Serial Number: PHDV712004GA240AGN
Model: ATA INTEL SSDSC2BB24
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 31
Maximum Temperature (C): 37
SSD Smart Trip Wearout: Not Supported
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False


Array: B
Interface Type: SATA
Unused Space: 0 MB (0.0%)
Used Space: 43.7 TB (100.0%)
Status: Failed Physical Drive
MultiDomain Status: OK
Array Type: Data
HP SSD Smart Path: disable

Warning: One of the drives on this array have failed or has been removed.




Logical Drive: 2
Size: 38.2 TB
Fault Tolerance: 5
Heads: 255
Sectors Per Track: 32
Cylinders: 65535
Strip Size: 256 KB
Full Stripe Size: 1792 KB
Status: Interim Recovery Mode
MultiDomain Status: OK
Caching: Enabled
Parity Initialization Status: Initialization Failed
Unique Identifier: 600508B1001CF94F84873C91FD89B549
Disk Name: /dev/sdb
Mount Points: None
Logical Drive Label: 04DA1DD6PDNNF0ARH502MC546F
Drive Type: Data
LD Acceleration Method: Controller Cache

physicaldrive 1I:2:1
Port: 1I
Box: 2
Bay: 1
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: APGNW7JH
Serial Number: NAHN3UZY
Model: ATA HGST HDN726060AL
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 37
Maximum Temperature (C): 43
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:2
Port: 1I
Box: 2
Bay: 2
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: APGNT517
Serial Number: NAHLKP0X
Model: ATA HGST HDN726060AL
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 37
Maximum Temperature (C): 56
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:3
Port: 1I
Box: 2
Bay: 3
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: T7MH
Serial Number: NCH8E81Z
Model: ATA HUS726060ALE610
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 33
Maximum Temperature (C): 41
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:4
Port: 1I
Box: 2
Bay: 4
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: APGNW7JH
Serial Number: NAHYMAUY
Model: ATA HGST HDN726060AL
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 34
Maximum Temperature (C): 41
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:5
Port: 1I
Box: 2
Bay: 5
Status: Failed
Last Failure Reason: Write retries failed
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: T7MH
Serial Number: K1H942MD
Model: ATA HUS726060ALE610
SATA NCQ Capable: True
SATA NCQ Enabled: True
Maximum Temperature (C): 43
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Applicable
Sanitize Erase Supported: False

physicaldrive 1I:2:6
Port: 1I
Box: 2
Bay: 6
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: TDR2
Serial Number: K8JM5TKN
Model: ATA HUS726060ALE610
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 33
Maximum Temperature (C): 38
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:7
Port: 1I
Box: 2
Bay: 7
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: APGNW7JH
Serial Number: K8H9BW2N
Model: ATA HGST HDN726060AL
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 34
Maximum Temperature (C): 39
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:8
Port: 1I
Box: 2
Bay: 8
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: T7MH
Serial Number: K1H623JD
Model: ATA HUS726060ALE610
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 35
Maximum Temperature (C): 40
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False









share|improve this question



















  • 1





    One of the drives reports 56C maybe that’s a thermal problem? All your drives seem 4K, so I guess the stripes would be correctly aligned.

    – eckes
    Mar 17 at 16:16













6












6








6








I've been using a RAID5 HDD storage (8x6TB) at my HP P840 for like 2 years now and it has always had unusually many drive failures. Everything was good for half a year, but now drives are failing in a strange way.
For example 2 new drives failed a few days after they have been added to the RAID.
I have also already replaced the RAID controller and am using up-to-date firmware on Mainboard and RAID controller.



Also I have tried to use different drives.
Initially there were HGST DeskStar 6TB drives used in that RAID, now I have been replacing them with HGST UltraStar 6TB when replacing failed drives. But the behaviour is the same.



Also it seems (most of) the drives are not really failed, because once I replaced the RAID controller, one failed drive was recognized as OK again and the rebuild started.



My hoster's support says the problem is that I'm actually using RAID5 and I should switch to RAID10 instead. It's hard for me to believe as I've been using RAID5 without problems at other systems (no drive failures in years).



Can anyone give me a hint, what else the culprit could be? Something wrong with the way the RAID controller is configured?



Thank you!



EDIT:

The server is a HP DL180 G9

Reason for drive failure is always "Write retries failed"



 Smart Array P840 in Slot 1 (sn: PDNNF0ARH321GD)


Port Name: 1I

Port Name: 2I

Internal Drive Cage at Port 1I, Box 2, OK

Internal Drive Cage at Port 1I, Box 2, OK

Internal Drive Cage at Port 2I, Box 1, OK
array A (Solid State SATA, Unused Space: 0 MB)


logicaldrive 1 (447.1 GB, RAID 1+0, OK)

physicaldrive 2I:1:1 (port 2I:box 1:bay 1, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:2 (port 2I:box 1:bay 2, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:3 (port 2I:box 1:bay 3, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:4 (port 2I:box 1:bay 4, Solid State SATA, 240.0 GB, OK)

array B (SATA, Unused Space: 0 MB)


logicaldrive 2 (38.2 TB, RAID 5, Interim Recovery Mode)

physicaldrive 1I:2:1 (port 1I:box 2:bay 1, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:2 (port 1I:box 2:bay 2, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:3 (port 1I:box 2:bay 3, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:4 (port 1I:box 2:bay 4, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:5 (port 1I:box 2:bay 5, SATA, 6001.1 GB, Failed)
physicaldrive 1I:2:6 (port 1I:box 2:bay 6, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:7 (port 1I:box 2:bay 7, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:8 (port 1I:box 2:bay 8, SATA, 6001.1 GB, OK)


Detailed Info:



 Smart Array P840 in Slot 1
Bus Interface: PCI
Slot: 1
Serial Number: PDNNF0ARH321GD
Cache Serial Number: PEYFP0BRH323YZ
RAID 6 (ADG) Status: Enabled
Controller Status: OK
Hardware Revision: B
Firmware Version: 6.60
Rebuild Priority: High
Expand Priority: Medium
Surface Scan Delay: 3 secs
Surface Scan Mode: Idle
Parallel Surface Scan Supported: Yes
Current Parallel Surface Scan Count: 1
Max Parallel Surface Scan Count: 16
Queue Depth: Automatic
Monitor and Performance Delay: 60 min
Elevator Sort: Enabled
Degraded Performance Optimization: Disabled
Inconsistency Repair Policy: Disabled
Wait for Cache Room: Disabled
Surface Analysis Inconsistency Notification: Disabled
Post Prompt Timeout: 15 secs
Cache Board Present: True
Cache Status: OK
Cache Ratio: 10% Read / 90% Write
Drive Write Cache: Enabled
Total Cache Size: 4.0 GB
Total Cache Memory Available: 3.2 GB
No-Battery Write Cache: Enabled
SSD Caching RAID5 WriteBack Enabled: True
SSD Caching Version: 2
Cache Backup Power Source: Batteries
Battery/Capacitor Count: 1
Battery/Capacitor Status: OK
SATA NCQ Supported: True
Spare Activation Mode: Activate on physical drive failure (default)
Controller Temperature (C): 51
Cache Module Temperature (C): 38
Number of Ports: 2 Internal only
Encryption: Disabled
Express Local Encryption: False
Driver Name: hpsa
Driver Version: 3.4.16
Driver Supports HP SSD Smart Path: True
PCI Address (Domain:Bus:Device.Function): 0000:06:00.0
Negotiated PCIe Data Rate: PCIe 3.0 x8 (7880 MB/s)
Controller Mode: RAID
Controller Mode Reboot: Not Required
Latency Scheduler Setting: Disabled
Current Power Mode: MaxPerformance
Host Serial Number: CZ270500GM
Sanitize Erase Supported: False
Primary Boot Volume: logicaldrive 1 (600508B1001CE0F9FACF3A1358647115)
Secondary Boot Volume: logicaldrive 1 (600508B1001CE0F9FACF3A1358647115)


Port Name: 1I
Port ID: 0
Port Connection Number: 0
SAS Address: 5001438038AD05A0
Port Location: Internal
Managed Cable Connected: False

Port Name: 2I
Port ID: 1
Port Connection Number: 1
SAS Address: 5001438038AD05A8
Port Location: Internal
Managed Cable Connected: False

Internal Drive Cage at Port 1I, Box 2, OK
Power Supply Status: Not Redundant
Drive Bays: 4
Port: 1I
Box: 2
Location: Internal

Physical Drives
physicaldrive 1I:2:1 (port 1I:box 2:bay 1, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:2 (port 1I:box 2:bay 2, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:3 (port 1I:box 2:bay 3, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:4 (port 1I:box 2:bay 4, SATA, 6001.1 GB, OK)
None attached


Internal Drive Cage at Port 1I, Box 2, OK
Power Supply Status: Not Redundant
Drive Bays: 4
Port: 1I
Box: 2
Location: Internal

Physical Drives
physicaldrive 1I:2:1 (port 1I:box 2:bay 1, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:2 (port 1I:box 2:bay 2, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:3 (port 1I:box 2:bay 3, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:4 (port 1I:box 2:bay 4, SATA, 6001.1 GB, OK)
None attached


Internal Drive Cage at Port 2I, Box 1, OK
Power Supply Status: Not Redundant
Drive Bays: 4
Port: 2I
Box: 1
Location: Internal

Physical Drives
physicaldrive 2I:1:1 (port 2I:box 1:bay 1, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:2 (port 2I:box 1:bay 2, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:3 (port 2I:box 1:bay 3, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:4 (port 2I:box 1:bay 4, Solid State SATA, 240.0 GB, OK)
None attached

Array: A
Interface Type: Solid State SATA
Unused Space: 0 MB (0.0%)
Used Space: 894.2 GB (100.0%)
Status: OK
MultiDomain Status: OK
Array Type: Data
HP SSD Smart Path: disable



Logical Drive: 1
Size: 447.1 GB
Fault Tolerance: 1+0
Heads: 255
Sectors Per Track: 32
Cylinders: 65535
Strip Size: 256 KB
Full Stripe Size: 512 KB
Status: OK
MultiDomain Status: OK
Caching: Enabled
Unique Identifier: 600508B1001CE0F9FACF3A1358647115
Disk Name: /dev/sda
Mount Points: / 18.6 GB Partition Number 2
OS Status: LOCKED
Logical Drive Label: 0216D6F9PDNNF0ARH502MC7DFA
Mirror Group 1:
physicaldrive 2I:1:1 (port 2I:box 1:bay 1, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:2 (port 2I:box 1:bay 2, Solid State SATA, 240.0 GB, OK)
Mirror Group 2:
physicaldrive 2I:1:3 (port 2I:box 1:bay 3, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:4 (port 2I:box 1:bay 4, Solid State SATA, 240.0 GB, OK)
Drive Type: Data
LD Acceleration Method: Controller Cache

physicaldrive 2I:1:1
Port: 2I
Box: 1
Bay: 1
Status: OK
Drive Type: Data Drive
Interface Type: Solid State SATA
Size: 240.0 GB
Drive exposed to OS: False
Native Block Size: 4096
Firmware Revision: N2010101
Serial Number: PHDV712004AG240AGN
Model: ATA INTEL SSDSC2BB24
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 31
Maximum Temperature (C): 39
SSD Smart Trip Wearout: Not Supported
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 2I:1:2
Port: 2I
Box: 1
Bay: 2
Status: OK
Drive Type: Data Drive
Interface Type: Solid State SATA
Size: 240.0 GB
Drive exposed to OS: False
Native Block Size: 4096
Firmware Revision: N2010101
Serial Number: PHDV706303CH240AGN
Model: ATA INTEL SSDSC2BB24
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 29
Maximum Temperature (C): 36
SSD Smart Trip Wearout: Not Supported
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 2I:1:3
Port: 2I
Box: 1
Bay: 3
Status: OK
Drive Type: Data Drive
Interface Type: Solid State SATA
Size: 240.0 GB
Drive exposed to OS: False
Native Block Size: 4096
Firmware Revision: N2010101
Serial Number: PHDV712003V8240AGN
Model: ATA INTEL SSDSC2BB24
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 29
Maximum Temperature (C): 35
SSD Smart Trip Wearout: Not Supported
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 2I:1:4
Port: 2I
Box: 1
Bay: 4
Status: OK
Drive Type: Data Drive
Interface Type: Solid State SATA
Size: 240.0 GB
Drive exposed to OS: False
Native Block Size: 4096
Firmware Revision: N2010101
Serial Number: PHDV712004GA240AGN
Model: ATA INTEL SSDSC2BB24
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 31
Maximum Temperature (C): 37
SSD Smart Trip Wearout: Not Supported
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False


Array: B
Interface Type: SATA
Unused Space: 0 MB (0.0%)
Used Space: 43.7 TB (100.0%)
Status: Failed Physical Drive
MultiDomain Status: OK
Array Type: Data
HP SSD Smart Path: disable

Warning: One of the drives on this array have failed or has been removed.




Logical Drive: 2
Size: 38.2 TB
Fault Tolerance: 5
Heads: 255
Sectors Per Track: 32
Cylinders: 65535
Strip Size: 256 KB
Full Stripe Size: 1792 KB
Status: Interim Recovery Mode
MultiDomain Status: OK
Caching: Enabled
Parity Initialization Status: Initialization Failed
Unique Identifier: 600508B1001CF94F84873C91FD89B549
Disk Name: /dev/sdb
Mount Points: None
Logical Drive Label: 04DA1DD6PDNNF0ARH502MC546F
Drive Type: Data
LD Acceleration Method: Controller Cache

physicaldrive 1I:2:1
Port: 1I
Box: 2
Bay: 1
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: APGNW7JH
Serial Number: NAHN3UZY
Model: ATA HGST HDN726060AL
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 37
Maximum Temperature (C): 43
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:2
Port: 1I
Box: 2
Bay: 2
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: APGNT517
Serial Number: NAHLKP0X
Model: ATA HGST HDN726060AL
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 37
Maximum Temperature (C): 56
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:3
Port: 1I
Box: 2
Bay: 3
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: T7MH
Serial Number: NCH8E81Z
Model: ATA HUS726060ALE610
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 33
Maximum Temperature (C): 41
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:4
Port: 1I
Box: 2
Bay: 4
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: APGNW7JH
Serial Number: NAHYMAUY
Model: ATA HGST HDN726060AL
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 34
Maximum Temperature (C): 41
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:5
Port: 1I
Box: 2
Bay: 5
Status: Failed
Last Failure Reason: Write retries failed
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: T7MH
Serial Number: K1H942MD
Model: ATA HUS726060ALE610
SATA NCQ Capable: True
SATA NCQ Enabled: True
Maximum Temperature (C): 43
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Applicable
Sanitize Erase Supported: False

physicaldrive 1I:2:6
Port: 1I
Box: 2
Bay: 6
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: TDR2
Serial Number: K8JM5TKN
Model: ATA HUS726060ALE610
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 33
Maximum Temperature (C): 38
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:7
Port: 1I
Box: 2
Bay: 7
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: APGNW7JH
Serial Number: K8H9BW2N
Model: ATA HGST HDN726060AL
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 34
Maximum Temperature (C): 39
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:8
Port: 1I
Box: 2
Bay: 8
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: T7MH
Serial Number: K1H623JD
Model: ATA HUS726060ALE610
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 35
Maximum Temperature (C): 40
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False









share|improve this question
















I've been using a RAID5 HDD storage (8x6TB) at my HP P840 for like 2 years now and it has always had unusually many drive failures. Everything was good for half a year, but now drives are failing in a strange way.
For example 2 new drives failed a few days after they have been added to the RAID.
I have also already replaced the RAID controller and am using up-to-date firmware on Mainboard and RAID controller.



Also I have tried to use different drives.
Initially there were HGST DeskStar 6TB drives used in that RAID, now I have been replacing them with HGST UltraStar 6TB when replacing failed drives. But the behaviour is the same.



Also it seems (most of) the drives are not really failed, because once I replaced the RAID controller, one failed drive was recognized as OK again and the rebuild started.



My hoster's support says the problem is that I'm actually using RAID5 and I should switch to RAID10 instead. It's hard for me to believe as I've been using RAID5 without problems at other systems (no drive failures in years).



Can anyone give me a hint, what else the culprit could be? Something wrong with the way the RAID controller is configured?



Thank you!



EDIT:

The server is a HP DL180 G9

Reason for drive failure is always "Write retries failed"



 Smart Array P840 in Slot 1 (sn: PDNNF0ARH321GD)


Port Name: 1I

Port Name: 2I

Internal Drive Cage at Port 1I, Box 2, OK

Internal Drive Cage at Port 1I, Box 2, OK

Internal Drive Cage at Port 2I, Box 1, OK
array A (Solid State SATA, Unused Space: 0 MB)


logicaldrive 1 (447.1 GB, RAID 1+0, OK)

physicaldrive 2I:1:1 (port 2I:box 1:bay 1, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:2 (port 2I:box 1:bay 2, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:3 (port 2I:box 1:bay 3, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:4 (port 2I:box 1:bay 4, Solid State SATA, 240.0 GB, OK)

array B (SATA, Unused Space: 0 MB)


logicaldrive 2 (38.2 TB, RAID 5, Interim Recovery Mode)

physicaldrive 1I:2:1 (port 1I:box 2:bay 1, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:2 (port 1I:box 2:bay 2, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:3 (port 1I:box 2:bay 3, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:4 (port 1I:box 2:bay 4, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:5 (port 1I:box 2:bay 5, SATA, 6001.1 GB, Failed)
physicaldrive 1I:2:6 (port 1I:box 2:bay 6, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:7 (port 1I:box 2:bay 7, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:8 (port 1I:box 2:bay 8, SATA, 6001.1 GB, OK)


Detailed Info:



 Smart Array P840 in Slot 1
Bus Interface: PCI
Slot: 1
Serial Number: PDNNF0ARH321GD
Cache Serial Number: PEYFP0BRH323YZ
RAID 6 (ADG) Status: Enabled
Controller Status: OK
Hardware Revision: B
Firmware Version: 6.60
Rebuild Priority: High
Expand Priority: Medium
Surface Scan Delay: 3 secs
Surface Scan Mode: Idle
Parallel Surface Scan Supported: Yes
Current Parallel Surface Scan Count: 1
Max Parallel Surface Scan Count: 16
Queue Depth: Automatic
Monitor and Performance Delay: 60 min
Elevator Sort: Enabled
Degraded Performance Optimization: Disabled
Inconsistency Repair Policy: Disabled
Wait for Cache Room: Disabled
Surface Analysis Inconsistency Notification: Disabled
Post Prompt Timeout: 15 secs
Cache Board Present: True
Cache Status: OK
Cache Ratio: 10% Read / 90% Write
Drive Write Cache: Enabled
Total Cache Size: 4.0 GB
Total Cache Memory Available: 3.2 GB
No-Battery Write Cache: Enabled
SSD Caching RAID5 WriteBack Enabled: True
SSD Caching Version: 2
Cache Backup Power Source: Batteries
Battery/Capacitor Count: 1
Battery/Capacitor Status: OK
SATA NCQ Supported: True
Spare Activation Mode: Activate on physical drive failure (default)
Controller Temperature (C): 51
Cache Module Temperature (C): 38
Number of Ports: 2 Internal only
Encryption: Disabled
Express Local Encryption: False
Driver Name: hpsa
Driver Version: 3.4.16
Driver Supports HP SSD Smart Path: True
PCI Address (Domain:Bus:Device.Function): 0000:06:00.0
Negotiated PCIe Data Rate: PCIe 3.0 x8 (7880 MB/s)
Controller Mode: RAID
Controller Mode Reboot: Not Required
Latency Scheduler Setting: Disabled
Current Power Mode: MaxPerformance
Host Serial Number: CZ270500GM
Sanitize Erase Supported: False
Primary Boot Volume: logicaldrive 1 (600508B1001CE0F9FACF3A1358647115)
Secondary Boot Volume: logicaldrive 1 (600508B1001CE0F9FACF3A1358647115)


Port Name: 1I
Port ID: 0
Port Connection Number: 0
SAS Address: 5001438038AD05A0
Port Location: Internal
Managed Cable Connected: False

Port Name: 2I
Port ID: 1
Port Connection Number: 1
SAS Address: 5001438038AD05A8
Port Location: Internal
Managed Cable Connected: False

Internal Drive Cage at Port 1I, Box 2, OK
Power Supply Status: Not Redundant
Drive Bays: 4
Port: 1I
Box: 2
Location: Internal

Physical Drives
physicaldrive 1I:2:1 (port 1I:box 2:bay 1, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:2 (port 1I:box 2:bay 2, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:3 (port 1I:box 2:bay 3, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:4 (port 1I:box 2:bay 4, SATA, 6001.1 GB, OK)
None attached


Internal Drive Cage at Port 1I, Box 2, OK
Power Supply Status: Not Redundant
Drive Bays: 4
Port: 1I
Box: 2
Location: Internal

Physical Drives
physicaldrive 1I:2:1 (port 1I:box 2:bay 1, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:2 (port 1I:box 2:bay 2, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:3 (port 1I:box 2:bay 3, SATA, 6001.1 GB, OK)
physicaldrive 1I:2:4 (port 1I:box 2:bay 4, SATA, 6001.1 GB, OK)
None attached


Internal Drive Cage at Port 2I, Box 1, OK
Power Supply Status: Not Redundant
Drive Bays: 4
Port: 2I
Box: 1
Location: Internal

Physical Drives
physicaldrive 2I:1:1 (port 2I:box 1:bay 1, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:2 (port 2I:box 1:bay 2, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:3 (port 2I:box 1:bay 3, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:4 (port 2I:box 1:bay 4, Solid State SATA, 240.0 GB, OK)
None attached

Array: A
Interface Type: Solid State SATA
Unused Space: 0 MB (0.0%)
Used Space: 894.2 GB (100.0%)
Status: OK
MultiDomain Status: OK
Array Type: Data
HP SSD Smart Path: disable



Logical Drive: 1
Size: 447.1 GB
Fault Tolerance: 1+0
Heads: 255
Sectors Per Track: 32
Cylinders: 65535
Strip Size: 256 KB
Full Stripe Size: 512 KB
Status: OK
MultiDomain Status: OK
Caching: Enabled
Unique Identifier: 600508B1001CE0F9FACF3A1358647115
Disk Name: /dev/sda
Mount Points: / 18.6 GB Partition Number 2
OS Status: LOCKED
Logical Drive Label: 0216D6F9PDNNF0ARH502MC7DFA
Mirror Group 1:
physicaldrive 2I:1:1 (port 2I:box 1:bay 1, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:2 (port 2I:box 1:bay 2, Solid State SATA, 240.0 GB, OK)
Mirror Group 2:
physicaldrive 2I:1:3 (port 2I:box 1:bay 3, Solid State SATA, 240.0 GB, OK)
physicaldrive 2I:1:4 (port 2I:box 1:bay 4, Solid State SATA, 240.0 GB, OK)
Drive Type: Data
LD Acceleration Method: Controller Cache

physicaldrive 2I:1:1
Port: 2I
Box: 1
Bay: 1
Status: OK
Drive Type: Data Drive
Interface Type: Solid State SATA
Size: 240.0 GB
Drive exposed to OS: False
Native Block Size: 4096
Firmware Revision: N2010101
Serial Number: PHDV712004AG240AGN
Model: ATA INTEL SSDSC2BB24
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 31
Maximum Temperature (C): 39
SSD Smart Trip Wearout: Not Supported
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 2I:1:2
Port: 2I
Box: 1
Bay: 2
Status: OK
Drive Type: Data Drive
Interface Type: Solid State SATA
Size: 240.0 GB
Drive exposed to OS: False
Native Block Size: 4096
Firmware Revision: N2010101
Serial Number: PHDV706303CH240AGN
Model: ATA INTEL SSDSC2BB24
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 29
Maximum Temperature (C): 36
SSD Smart Trip Wearout: Not Supported
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 2I:1:3
Port: 2I
Box: 1
Bay: 3
Status: OK
Drive Type: Data Drive
Interface Type: Solid State SATA
Size: 240.0 GB
Drive exposed to OS: False
Native Block Size: 4096
Firmware Revision: N2010101
Serial Number: PHDV712003V8240AGN
Model: ATA INTEL SSDSC2BB24
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 29
Maximum Temperature (C): 35
SSD Smart Trip Wearout: Not Supported
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 2I:1:4
Port: 2I
Box: 1
Bay: 4
Status: OK
Drive Type: Data Drive
Interface Type: Solid State SATA
Size: 240.0 GB
Drive exposed to OS: False
Native Block Size: 4096
Firmware Revision: N2010101
Serial Number: PHDV712004GA240AGN
Model: ATA INTEL SSDSC2BB24
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 31
Maximum Temperature (C): 37
SSD Smart Trip Wearout: Not Supported
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False


Array: B
Interface Type: SATA
Unused Space: 0 MB (0.0%)
Used Space: 43.7 TB (100.0%)
Status: Failed Physical Drive
MultiDomain Status: OK
Array Type: Data
HP SSD Smart Path: disable

Warning: One of the drives on this array have failed or has been removed.




Logical Drive: 2
Size: 38.2 TB
Fault Tolerance: 5
Heads: 255
Sectors Per Track: 32
Cylinders: 65535
Strip Size: 256 KB
Full Stripe Size: 1792 KB
Status: Interim Recovery Mode
MultiDomain Status: OK
Caching: Enabled
Parity Initialization Status: Initialization Failed
Unique Identifier: 600508B1001CF94F84873C91FD89B549
Disk Name: /dev/sdb
Mount Points: None
Logical Drive Label: 04DA1DD6PDNNF0ARH502MC546F
Drive Type: Data
LD Acceleration Method: Controller Cache

physicaldrive 1I:2:1
Port: 1I
Box: 2
Bay: 1
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: APGNW7JH
Serial Number: NAHN3UZY
Model: ATA HGST HDN726060AL
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 37
Maximum Temperature (C): 43
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:2
Port: 1I
Box: 2
Bay: 2
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: APGNT517
Serial Number: NAHLKP0X
Model: ATA HGST HDN726060AL
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 37
Maximum Temperature (C): 56
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:3
Port: 1I
Box: 2
Bay: 3
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: T7MH
Serial Number: NCH8E81Z
Model: ATA HUS726060ALE610
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 33
Maximum Temperature (C): 41
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:4
Port: 1I
Box: 2
Bay: 4
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: APGNW7JH
Serial Number: NAHYMAUY
Model: ATA HGST HDN726060AL
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 34
Maximum Temperature (C): 41
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:5
Port: 1I
Box: 2
Bay: 5
Status: Failed
Last Failure Reason: Write retries failed
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: T7MH
Serial Number: K1H942MD
Model: ATA HUS726060ALE610
SATA NCQ Capable: True
SATA NCQ Enabled: True
Maximum Temperature (C): 43
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Applicable
Sanitize Erase Supported: False

physicaldrive 1I:2:6
Port: 1I
Box: 2
Bay: 6
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: TDR2
Serial Number: K8JM5TKN
Model: ATA HUS726060ALE610
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 33
Maximum Temperature (C): 38
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:7
Port: 1I
Box: 2
Bay: 7
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: APGNW7JH
Serial Number: K8H9BW2N
Model: ATA HGST HDN726060AL
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 34
Maximum Temperature (C): 39
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False

physicaldrive 1I:2:8
Port: 1I
Box: 2
Bay: 8
Status: OK
Drive Type: Data Drive
Interface Type: SATA
Size: 6001.1 GB
Drive exposed to OS: False
Native Block Size: 4096
Rotational Speed: 7200
Firmware Revision: T7MH
Serial Number: K1H623JD
Model: ATA HUS726060ALE610
SATA NCQ Capable: True
SATA NCQ Enabled: True
Current Temperature (C): 35
Maximum Temperature (C): 40
PHY Count: 1
PHY Transfer Rate: 6.0Gbps
Drive Authentication Status: Not Authenticated. Smart Array will not control drive LEDs.
Sanitize Erase Supported: False






raid raid5 hp-smart-array drive-failure raid-controller






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Mar 17 at 14:36







Laord

















asked Mar 17 at 9:03









LaordLaord

384




384







  • 1





    One of the drives reports 56C maybe that’s a thermal problem? All your drives seem 4K, so I guess the stripes would be correctly aligned.

    – eckes
    Mar 17 at 16:16












  • 1





    One of the drives reports 56C maybe that’s a thermal problem? All your drives seem 4K, so I guess the stripes would be correctly aligned.

    – eckes
    Mar 17 at 16:16







1




1





One of the drives reports 56C maybe that’s a thermal problem? All your drives seem 4K, so I guess the stripes would be correctly aligned.

– eckes
Mar 17 at 16:16





One of the drives reports 56C maybe that’s a thermal problem? All your drives seem 4K, so I guess the stripes would be correctly aligned.

– eckes
Mar 17 at 16:16










2 Answers
2






active

oldest

votes


















8














You probably have an heavily punctured array, which cause an early "planned death" of the replaced due to failed stripe reconstruction. You can read more information here and here



The solution is to backup, destroy the array, recreate it and restore from backup.



Next time avoid using a RAID5 array with such big drives. I strongly suggest using RAID6 or, even better, RAID10.






share|improve this answer






























    4














    You should be using RAID6 with the size and types of disks in the system. However, some details about the server may be helpful.



    Is this an HPE server, or are you just using an HPE controller?



    These don't appear to be HPE drives or HPE drive carriers. That's a bad sign.



    The hpssacli output you've provided would also show the reason for the disk failure. If you're not on an HPE server and there's a backplane issue or SATA timeouts (noticed you're on SATA disks), there's chance that you're getting false positives.



    Example: (see the Last Failure Reason line):



     physicaldrive 2I:2:8
    Port: 2I
    Box: 2
    Bay: 8
    Status: Failed
    Last Failure Reason: Aborted Command
    Drive Type: Data Drive





    share|improve this answer























    • Thanks for your answer. The server is a HP DL180 G9; Reason for drive failure is always "Write retries failed". I have added this info to my question.

      – Laord
      Mar 17 at 14:37











    • Why aren't you using HP disks on this system? Are these genuine HP drive carriers?

      – ewwhite
      Mar 17 at 14:47











    • This is rented hardware. I'm using these drives because they are the ones the hoster is offering. Actually i cannot find any info on whether these drives are somewhat certified for HP.

      – Laord
      Mar 17 at 14:59






    • 1





      They are not certified. Your hosting provider is using cheap consumer disks.

      – ewwhite
      Mar 17 at 15:03










    Your Answer








    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "2"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader:
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    ,
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );













    draft saved

    draft discarded


















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fserverfault.com%2fquestions%2f958650%2fhp-p840-hdd-raid-5-many-strange-drive-failures%23new-answer', 'question_page');

    );

    Post as a guest















    Required, but never shown

























    2 Answers
    2






    active

    oldest

    votes








    2 Answers
    2






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    8














    You probably have an heavily punctured array, which cause an early "planned death" of the replaced due to failed stripe reconstruction. You can read more information here and here



    The solution is to backup, destroy the array, recreate it and restore from backup.



    Next time avoid using a RAID5 array with such big drives. I strongly suggest using RAID6 or, even better, RAID10.






    share|improve this answer



























      8














      You probably have an heavily punctured array, which cause an early "planned death" of the replaced due to failed stripe reconstruction. You can read more information here and here



      The solution is to backup, destroy the array, recreate it and restore from backup.



      Next time avoid using a RAID5 array with such big drives. I strongly suggest using RAID6 or, even better, RAID10.






      share|improve this answer

























        8












        8








        8







        You probably have an heavily punctured array, which cause an early "planned death" of the replaced due to failed stripe reconstruction. You can read more information here and here



        The solution is to backup, destroy the array, recreate it and restore from backup.



        Next time avoid using a RAID5 array with such big drives. I strongly suggest using RAID6 or, even better, RAID10.






        share|improve this answer













        You probably have an heavily punctured array, which cause an early "planned death" of the replaced due to failed stripe reconstruction. You can read more information here and here



        The solution is to backup, destroy the array, recreate it and restore from backup.



        Next time avoid using a RAID5 array with such big drives. I strongly suggest using RAID6 or, even better, RAID10.







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Mar 17 at 9:55









        shodanshokshodanshok

        26.5k34787




        26.5k34787























            4














            You should be using RAID6 with the size and types of disks in the system. However, some details about the server may be helpful.



            Is this an HPE server, or are you just using an HPE controller?



            These don't appear to be HPE drives or HPE drive carriers. That's a bad sign.



            The hpssacli output you've provided would also show the reason for the disk failure. If you're not on an HPE server and there's a backplane issue or SATA timeouts (noticed you're on SATA disks), there's chance that you're getting false positives.



            Example: (see the Last Failure Reason line):



             physicaldrive 2I:2:8
            Port: 2I
            Box: 2
            Bay: 8
            Status: Failed
            Last Failure Reason: Aborted Command
            Drive Type: Data Drive





            share|improve this answer























            • Thanks for your answer. The server is a HP DL180 G9; Reason for drive failure is always "Write retries failed". I have added this info to my question.

              – Laord
              Mar 17 at 14:37











            • Why aren't you using HP disks on this system? Are these genuine HP drive carriers?

              – ewwhite
              Mar 17 at 14:47











            • This is rented hardware. I'm using these drives because they are the ones the hoster is offering. Actually i cannot find any info on whether these drives are somewhat certified for HP.

              – Laord
              Mar 17 at 14:59






            • 1





              They are not certified. Your hosting provider is using cheap consumer disks.

              – ewwhite
              Mar 17 at 15:03















            4














            You should be using RAID6 with the size and types of disks in the system. However, some details about the server may be helpful.



            Is this an HPE server, or are you just using an HPE controller?



            These don't appear to be HPE drives or HPE drive carriers. That's a bad sign.



            The hpssacli output you've provided would also show the reason for the disk failure. If you're not on an HPE server and there's a backplane issue or SATA timeouts (noticed you're on SATA disks), there's chance that you're getting false positives.



            Example: (see the Last Failure Reason line):



             physicaldrive 2I:2:8
            Port: 2I
            Box: 2
            Bay: 8
            Status: Failed
            Last Failure Reason: Aborted Command
            Drive Type: Data Drive





            share|improve this answer























            • Thanks for your answer. The server is a HP DL180 G9; Reason for drive failure is always "Write retries failed". I have added this info to my question.

              – Laord
              Mar 17 at 14:37











            • Why aren't you using HP disks on this system? Are these genuine HP drive carriers?

              – ewwhite
              Mar 17 at 14:47











            • This is rented hardware. I'm using these drives because they are the ones the hoster is offering. Actually i cannot find any info on whether these drives are somewhat certified for HP.

              – Laord
              Mar 17 at 14:59






            • 1





              They are not certified. Your hosting provider is using cheap consumer disks.

              – ewwhite
              Mar 17 at 15:03













            4












            4








            4







            You should be using RAID6 with the size and types of disks in the system. However, some details about the server may be helpful.



            Is this an HPE server, or are you just using an HPE controller?



            These don't appear to be HPE drives or HPE drive carriers. That's a bad sign.



            The hpssacli output you've provided would also show the reason for the disk failure. If you're not on an HPE server and there's a backplane issue or SATA timeouts (noticed you're on SATA disks), there's chance that you're getting false positives.



            Example: (see the Last Failure Reason line):



             physicaldrive 2I:2:8
            Port: 2I
            Box: 2
            Bay: 8
            Status: Failed
            Last Failure Reason: Aborted Command
            Drive Type: Data Drive





            share|improve this answer













            You should be using RAID6 with the size and types of disks in the system. However, some details about the server may be helpful.



            Is this an HPE server, or are you just using an HPE controller?



            These don't appear to be HPE drives or HPE drive carriers. That's a bad sign.



            The hpssacli output you've provided would also show the reason for the disk failure. If you're not on an HPE server and there's a backplane issue or SATA timeouts (noticed you're on SATA disks), there's chance that you're getting false positives.



            Example: (see the Last Failure Reason line):



             physicaldrive 2I:2:8
            Port: 2I
            Box: 2
            Bay: 8
            Status: Failed
            Last Failure Reason: Aborted Command
            Drive Type: Data Drive






            share|improve this answer












            share|improve this answer



            share|improve this answer










            answered Mar 17 at 13:11









            ewwhiteewwhite

            174k75368720




            174k75368720












            • Thanks for your answer. The server is a HP DL180 G9; Reason for drive failure is always "Write retries failed". I have added this info to my question.

              – Laord
              Mar 17 at 14:37











            • Why aren't you using HP disks on this system? Are these genuine HP drive carriers?

              – ewwhite
              Mar 17 at 14:47











            • This is rented hardware. I'm using these drives because they are the ones the hoster is offering. Actually i cannot find any info on whether these drives are somewhat certified for HP.

              – Laord
              Mar 17 at 14:59






            • 1





              They are not certified. Your hosting provider is using cheap consumer disks.

              – ewwhite
              Mar 17 at 15:03

















            • Thanks for your answer. The server is a HP DL180 G9; Reason for drive failure is always "Write retries failed". I have added this info to my question.

              – Laord
              Mar 17 at 14:37











            • Why aren't you using HP disks on this system? Are these genuine HP drive carriers?

              – ewwhite
              Mar 17 at 14:47











            • This is rented hardware. I'm using these drives because they are the ones the hoster is offering. Actually i cannot find any info on whether these drives are somewhat certified for HP.

              – Laord
              Mar 17 at 14:59






            • 1





              They are not certified. Your hosting provider is using cheap consumer disks.

              – ewwhite
              Mar 17 at 15:03
















            Thanks for your answer. The server is a HP DL180 G9; Reason for drive failure is always "Write retries failed". I have added this info to my question.

            – Laord
            Mar 17 at 14:37





            Thanks for your answer. The server is a HP DL180 G9; Reason for drive failure is always "Write retries failed". I have added this info to my question.

            – Laord
            Mar 17 at 14:37













            Why aren't you using HP disks on this system? Are these genuine HP drive carriers?

            – ewwhite
            Mar 17 at 14:47





            Why aren't you using HP disks on this system? Are these genuine HP drive carriers?

            – ewwhite
            Mar 17 at 14:47













            This is rented hardware. I'm using these drives because they are the ones the hoster is offering. Actually i cannot find any info on whether these drives are somewhat certified for HP.

            – Laord
            Mar 17 at 14:59





            This is rented hardware. I'm using these drives because they are the ones the hoster is offering. Actually i cannot find any info on whether these drives are somewhat certified for HP.

            – Laord
            Mar 17 at 14:59




            1




            1





            They are not certified. Your hosting provider is using cheap consumer disks.

            – ewwhite
            Mar 17 at 15:03





            They are not certified. Your hosting provider is using cheap consumer disks.

            – ewwhite
            Mar 17 at 15:03

















            draft saved

            draft discarded
















































            Thanks for contributing an answer to Server Fault!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fserverfault.com%2fquestions%2f958650%2fhp-p840-hdd-raid-5-many-strange-drive-failures%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Adding axes to figuresAdding axes labels to LaTeX figuresLaTeX equivalent of ConTeXt buffersRotate a node but not its content: the case of the ellipse decorationHow to define the default vertical distance between nodes?TikZ scaling graphic and adjust node position and keep font sizeNumerical conditional within tikz keys?adding axes to shapesAlign axes across subfiguresAdding figures with a certain orderLine up nested tikz enviroments or how to get rid of themAdding axes labels to LaTeX figures

            Tähtien Talli Jäsenet | Lähteet | NavigointivalikkoSuomen Hippos – Tähtien Talli

            Do these cracks on my tires look bad? The Next CEO of Stack OverflowDry rot tire should I replace?Having to replace tiresFishtailed so easily? Bad tires? ABS?Filling the tires with something other than air, to avoid puncture hassles?Used Michelin tires safe to install?Do these tyre cracks necessitate replacement?Rumbling noise: tires or mechanicalIs it possible to fix noisy feathered tires?Are bad winter tires still better than summer tires in winter?Torque converter failure - Related to replacing only 2 tires?Why use snow tires on all 4 wheels on 2-wheel-drive cars?