Page 2 of 3

Re: Problème de disque

Posted: 27 Jul 2015 19:33
by lulu80
bon voici se que j'ai obtenu après avoir suivi la marche a suivre de sleid cela est-il correct ?
"zpool replace pool1 11425908422794113397 ada4"
changement disqueada4.PNG

Re: Problème de disque

Posted: 27 Jul 2015 19:47
by sleid
Parfait, il faut attendre la fin du resilvering.

Re: Problème de disque

Posted: 27 Jul 2015 19:56
by lulu80
environ 0h40 a cette heure d'après le suivi dans "Disques|ZFS|Pools|Informations"

Re: Problème de disque

Posted: 27 Jul 2015 20:10
by sleid
Au passage une précision car au début je croyais que c'était du mirroring, mais non c'est du raidZ1 donc un seul disque peut tomber en panne à la fois, ce qui m'avait égaré c'est le nombre de disques :4 or pour un fonctionnement optimal de raidZ1 c'est 2n + 1 donc 3 ou 5 mais pas 4 qui est optimal pour du raidZ2.
Donc pendant la reconstruction croisez les doigts pour qu'aucun autre disque ne lache (je pense à ada3).

Re: Problème de disque

Posted: 27 Jul 2015 20:31
by lulu80
voilà ou je suis arriver

Code: Select all

Informations et état du pool
  pool: pool1
 state: DEGRADED
status: One or more devices is currently being resilvered.  The pool will
	continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
  scan: resilver in progress since Mon Jul 27 19:26:43 2015
        523G scanned out of 576G at 140M/s, 0h6m to go
        125G resilvered, 90.80% done
config:

	NAME                        STATE     READ WRITE CKSUM
	pool1                       DEGRADED     3     0     0
	  raidz1-0                  DEGRADED     3     0     0
	    ada1                    ONLINE       0     0     0  (resilvering)
	    ada2                    ONLINE       0     0     0
	    ada3                    ONLINE       3     0     0  (resilvering)
	    replacing-3             OFFLINE      0     0     0
	      11425908422794113397  OFFLINE      0     0     0  was /dev/ada4/old
	      ada4                  ONLINE       0     0     0  (resilvering)

errors: Permanent errors have been detected in the following files:

        /mnt/pool1/films/American Heist (2014)/American Heist.avi
        /mnt/pool1/films/Marie Heurtin (2014)/Marie Heurtin.avi
        /mnt/pool1/DebianWheezyTemplate.ova

Re: Problème de disque

Posted: 27 Jul 2015 20:42
by lulu80
voici le dernier relevé je pense que le resilvering et fini

pool: pool1
state: DEGRADED
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
see: http://illumos.org/msg/ZFS-8000-8A
scan: resilvered 137G in 1h10m with 3 errors on Mon Jul 27 20:37:28 2015
config:

NAME STATE READ WRITE CKSUM
pool1 DEGRADED 3 0 0
raidz1-0 DEGRADED 3 0 0
ada1 ONLINE 0 0 0
ada2 ONLINE 0 0 0
ada3 ONLINE 3 0 0
replacing-3 OFFLINE 0 0 0
11425908422794113397 OFFLINE 0 0 0 was /dev/ada4/old
ada4 ONLINE 0 0 0

errors: Permanent errors have been detected in the following files:

/mnt/pool1/films/American Heist (2014)/American Heist.avi
/mnt/pool1/films/Marie Heurtin (2014)/Marie Heurtin.avi
/mnt/pool1/DebianWheezyTemplate.ova

Re: Problème de disque

Posted: 27 Jul 2015 20:57
by lulu80
j'ai maintenant les même erreurs que j'avais avec ada4 "error 5,retries exhausted
je pense que je doit maintenant changer le ada3 non ?

Re: Problème de disque

Posted: 27 Jul 2015 21:35
by lulu80
je n'arrive pas a mettre ada3 en offline ? que ce passe t'il
offline ada3.PNG

Re: Problème de disque

Posted: 27 Jul 2015 23:41
by mtiburs
@sleid

Perso, quand, j'ai des erreurs sur ZFS, je coupe de suite ce qui n'est pas bon.
Si cela était possible, je ferais un rm /mnt/pool1/films/American Heist (2014)/American Heist.avi puis des 2 autres fichiers

qu'est-ce que t'en penses ?
est-ce que tu penses que les 3 fichiers sont "cuits"

@ernie
tes 3 fichiers sont très important ? t'as des sauvegardes ?

Sinon, pour moi ZFS dit ce qu'il y a et ce qu'il faut faire
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.

Le point positif est que les applications peuvent être affectées (pas les données)
Donc, je dirais, qu'il ne faut plus toucher un seul fichier sur ce pool tant que la situation n'est pas revenue à la normale (changement de disque)

Re: Problème de disque

Posted: 28 Jul 2015 06:20
by sleid
Le problème vient du fait d'avoir 2 disques boiteux sur un raidZ1.
Un zpool clear serait le bienvenu, car à la fin du resilvering il ne devrait pas conserver
replacing-3 OFFLINE 0 0 0
11425908422794113397 OFFLINE 0 0 0 was /dev/ada4/old.

Re: Problème de disque

Posted: 28 Jul 2015 08:02
by lulu80
bonjour,
hier soir dans le doute j'ai viré c'est trois fichiers douteux qui ne sont pas important du tous ( que des films et en plus déjà vue :cry: )

toutes suite après le resilvering c'est remis en marche (pour environ 3h00)mais sur ada3 ? étant complètement néophyte sur se point et vue l'heure :roll: j'ai laisser tourner pour cette nuit .
Voici le zpool status de se matin .
changement disqueada4(bis).PNG

Re: Problème de disque

Posted: 28 Jul 2015 09:07
by mtiburs
c'est joli !

Re: Problème de disque

Posted: 28 Jul 2015 09:11
by lulu80
ZFS et simplement génial :)

Re: Problème de disque

Posted: 28 Jul 2015 09:19
by lulu80
Un zpool clear serait le bienvenu
cela sert a quoi ? nettoyer les traces du resilvering ?

Re: Problème de disque

Posted: 28 Jul 2015 09:19
by velivole18
Bonjour,

Il est donc peut-être temps de changer les disques avec l'esprit plus tranquille maintenant que l'environnement est plus stable ... :roll:

Cordialement.

Re: Problème de disque

Posted: 28 Jul 2015 09:21
by sleid
Non c'était suite au message précédent qui laissait la trace du remplacement, cela sert juste a effacer les erreurs

Un nouvel état smart pour TOUS les disques serait le bienvenu

Re: Problème de disque

Posted: 28 Jul 2015 09:34
by lulu80
je regarde et les post

Re: Problème de disque

Posted: 28 Jul 2015 10:11
by lulu80

Code: Select all

Périphérique /dev/ada1 - Western Digital Caviar Blue (SATA 6Gb/s)
=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Caviar Blue (SATA 6Gb/s)
Device Model:     WDC WD10EZEX-08M2NA0
Serial Number:    WD-WCC3F5330305
LU WWN Device Id: 5 0014ee 2b4eaaece
Firmware Version: 01.01A01
User Capacity:    1,000,204,886,016 bytes [1.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    7200 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2, ACS-3 T13/2161-D revision 3b
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Tue Jul 28 09:50:09 2015 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82)   Offline data collection activity
               was completed without error.
               Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0)   The previous self-test routine completed
               without error or no self-test has ever
               been run.
Total time to complete Offline
data collection:       (11280) seconds.
Offline data collection
capabilities:           (0x7b) SMART execute Offline immediate.
               Auto Offline data collection on/off support.
               Suspend Offline collection upon new
               command.
               Offline surface scan supported.
               Self-test supported.
               Conveyance Self-test supported.
               Selective Self-test supported.
SMART capabilities:            (0x0003)   Saves SMART data before entering
               power-saving mode.
               Supports SMART auto save timer.
Error logging capability:        (0x01)   Error logging supported.
               General Purpose Logging supported.
Short self-test routine
recommended polling time:     (   2) minutes.
Extended self-test routine
recommended polling time:     ( 117) minutes.
Conveyance self-test routine
recommended polling time:     (   5) minutes.
SCT capabilities:           (0x3035)   SCT Status supported.
               SCT Feature Control supported.
               SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   175   174   021    Pre-fail  Always       -       2208
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       420
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   092   092   000    Old_age   Always       -       6315
 10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       395
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       53
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       439
194 Temperature_Celsius     0x0022   115   103   000    Old_age   Always       -       28
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Code: Select all

Périphérique /dev/ada2 - Western Digital Caviar Blue (SATA 6Gb/s)
=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Caviar Blue (SATA 6Gb/s)
Device Model:     WDC WD10EZEX-08M2NA0
Serial Number:    WD-WCC3F5283191
LU WWN Device Id: 5 0014ee 25f94f575
Firmware Version: 01.01A01
User Capacity:    1,000,204,886,016 bytes [1.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    7200 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2, ACS-3 T13/2161-D revision 3b
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Tue Jul 28 09:50:09 2015 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82)   Offline data collection activity
               was completed without error.
               Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0)   The previous self-test routine completed
               without error or no self-test has ever
               been run.
Total time to complete Offline
data collection:       (11580) seconds.
Offline data collection
capabilities:           (0x7b) SMART execute Offline immediate.
               Auto Offline data collection on/off support.
               Suspend Offline collection upon new
               command.
               Offline surface scan supported.
               Self-test supported.
               Conveyance Self-test supported.
               Selective Self-test supported.
SMART capabilities:            (0x0003)   Saves SMART data before entering
               power-saving mode.
               Supports SMART auto save timer.
Error logging capability:        (0x01)   Error logging supported.
               General Purpose Logging supported.
Short self-test routine
recommended polling time:     (   2) minutes.
Extended self-test routine
recommended polling time:     ( 120) minutes.
Conveyance self-test routine
recommended polling time:     (   5) minutes.
SCT capabilities:           (0x3035)   SCT Status supported.
               SCT Feature Control supported.
               SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   173   169   021    Pre-fail  Always       -       2333
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       345
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   092   092   000    Old_age   Always       -       6253
 10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       342
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       31
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       387
194 Temperature_Celsius     0x0022   113   103   000    Old_age   Always       -       30
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Code: Select all

Périphérique /dev/ada3 - Seagate Barracuda 7200.14 (AF)
=== START OF INFORMATION SECTION ===
Model Family:     Seagate Barracuda 7200.14 (AF)
Device Model:     ST1000DM003-1CH162
Serial Number:    Z1D5HXQV
LU WWN Device Id: 5 000c50 05d084d2a
Firmware Version: CC49
User Capacity:    1,000,204,886,016 bytes [1.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    7200 rpm
Form Factor:      3.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2, ACS-3 T13/2161-D revision 3b
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Tue Jul 28 09:50:09 2015 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00)   Offline data collection activity
               was never started.
               Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)   The previous self-test routine completed
               without error or no self-test has ever
               been run.
Total time to complete Offline
data collection:       (  575) seconds.
Offline data collection
capabilities:           (0x73) SMART execute Offline immediate.
               Auto Offline data collection on/off support.
               Suspend Offline collection upon new
               command.
               No Offline surface scan supported.
               Self-test supported.
               Conveyance Self-test supported.
               Selective Self-test supported.
SMART capabilities:            (0x0003)   Saves SMART data before entering
               power-saving mode.
               Supports SMART auto save timer.
Error logging capability:        (0x01)   Error logging supported.
               General Purpose Logging supported.
Short self-test routine
recommended polling time:     (   1) minutes.
Extended self-test routine
recommended polling time:     ( 113) minutes.
Conveyance self-test routine
recommended polling time:     (   2) minutes.
SCT capabilities:           (0x3085)   SCT Status supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   117   099   006    Pre-fail  Always       -       118537248
  3 Spin_Up_Time            0x0003   097   097   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       254
  5 Reallocated_Sector_Ct   0x0033   100   100   010    Pre-fail  Always       -       72
  7 Seek_Error_Rate         0x000f   080   060   030    Pre-fail  Always       -       101182453
  9 Power_On_Hours          0x0032   094   094   000    Old_age   Always       -       5766
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       249
183 Runtime_Bad_Block       0x0032   100   100   000    Old_age   Always       -       0
184 End-to-End_Error        0x0032   100   100   099    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   001   001   000    Old_age   Always       -       141
188 Command_Timeout         0x0032   100   100   000    Old_age   Always       -       0 0 0
189 High_Fly_Writes         0x003a   099   099   000    Old_age   Always       -       1
190 Airflow_Temperature_Cel 0x0022   067   059   045    Old_age   Always       -       33 (Min/Max 22/36)
191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       41
193 Load_Cycle_Count        0x0032   097   097   000    Old_age   Always       -       6813
194 Temperature_Celsius     0x0022   033   041   000    Old_age   Always       -       33 (0 13 0 0 0)
197 Current_Pending_Sector  0x0012   100   099   000    Old_age   Always       -       56
198 Offline_Uncorrectable   0x0010   100   099   000    Old_age   Offline      -       56
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline      -       5591h+23m+51.966s
241 Total_LBAs_Written      0x0000   100   253   000    Old_age   Offline      -       4079832742
242 Total_LBAs_Read         0x0000   100   253   000    Old_age   Offline      -       12363763010

SMART Error Log Version: 1
ATA Error Count: 141 (device log contains only the most recent five errors)
   CR = Command Register [HEX]
   FR = Features Register [HEX]
   SC = Sector Count Register [HEX]
   SN = Sector Number Register [HEX]
   CL = Cylinder Low Register [HEX]
   CH = Cylinder High Register [HEX]
   DH = Device/Head Register [HEX]
   DC = Device Command Register [HEX]
   ER = Error register [HEX]
   ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 141 occurred at disk power-on lifetime: 5755 hours (239 days + 19 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 00 18 ff ff ff 4f 00      03:24:44.283  READ FPDMA QUEUED
  60 00 58 ff ff ff 4f 00      03:24:44.283  READ FPDMA QUEUED
  60 00 08 ff ff ff 4f 00      03:24:44.283  READ FPDMA QUEUED
  60 00 b0 ff ff ff 4f 00      03:24:44.283  READ FPDMA QUEUED
  60 00 58 ff ff ff 4f 00      03:24:44.282  READ FPDMA QUEUED

Error 140 occurred at disk power-on lifetime: 5755 hours (239 days + 19 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 00 18 ff ff ff 4f 00      03:24:40.651  READ FPDMA QUEUED
  60 00 58 ff ff ff 4f 00      03:24:40.651  READ FPDMA QUEUED
  60 00 08 ff ff ff 4f 00      03:24:40.651  READ FPDMA QUEUED
  60 00 b0 ff ff ff 4f 00      03:24:40.650  READ FPDMA QUEUED
  60 00 58 ff ff ff 4f 00      03:24:40.650  READ FPDMA QUEUED

Error 139 occurred at disk power-on lifetime: 5755 hours (239 days + 19 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 00 18 ff ff ff 4f 00      03:24:36.978  READ FPDMA QUEUED
  60 00 58 ff ff ff 4f 00      03:24:36.978  READ FPDMA QUEUED
  60 00 08 ff ff ff 4f 00      03:24:36.978  READ FPDMA QUEUED
  60 00 b0 ff ff ff 4f 00      03:24:36.978  READ FPDMA QUEUED
  60 00 58 ff ff ff 4f 00      03:24:36.978  READ FPDMA QUEUED

Error 138 occurred at disk power-on lifetime: 5755 hours (239 days + 19 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 00 58 ff ff ff 4f 00      03:24:32.923  READ FPDMA QUEUED
  60 00 08 ff ff ff 4f 00      03:24:32.923  READ FPDMA QUEUED
  60 00 b0 ff ff ff 4f 00      03:24:32.923  READ FPDMA QUEUED
  60 00 58 ff ff ff 4f 00      03:24:32.923  READ FPDMA QUEUED
  2f 00 01 10 00 00 00 00      03:24:32.819  READ LOG EXT

Error 137 occurred at disk power-on lifetime: 5755 hours (239 days + 19 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 ff ff ff 0f  Error: UNC at LBA = 0x0fffffff = 268435455

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 00 b0 ff ff ff 4f 00      03:24:29.215  READ FPDMA QUEUED
  60 00 58 ff ff ff 4f 00      03:24:29.214  READ FPDMA QUEUED
  61 00 58 ff ff ff 4f 00      03:24:29.204  WRITE FPDMA QUEUED
  61 00 10 ff ff ff 4f 00      03:24:29.188  WRITE FPDMA QUEUED
  61 00 10 ff ff ff 4f 00      03:24:29.185  WRITE FPDMA QUEUED

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Code: Select all

Périphérique /dev/ada4 - Seagate Barracuda 7200.14 (AF)
=== START OF INFORMATION SECTION ===
Model Family:     Seagate Barracuda 7200.14 (AF)
Device Model:     ST2000DM001-1ER164
Serial Number:    Z4Z101P5
LU WWN Device Id: 5 000c50 0797d3aa5
Firmware Version: CC25
User Capacity:    2,000,398,934,016 bytes [2.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    7200 rpm
Form Factor:      3.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2, ACS-3 T13/2161-D revision 3b
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Tue Jul 28 09:50:09 2015 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00)   Offline data collection activity
               was never started.
               Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)   The previous self-test routine completed
               without error or no self-test has ever
               been run.
Total time to complete Offline
data collection:       (   97) seconds.
Offline data collection
capabilities:           (0x73) SMART execute Offline immediate.
               Auto Offline data collection on/off support.
               Suspend Offline collection upon new
               command.
               No Offline surface scan supported.
               Self-test supported.
               Conveyance Self-test supported.
               Selective Self-test supported.
SMART capabilities:            (0x0003)   Saves SMART data before entering
               power-saving mode.
               Supports SMART auto save timer.
Error logging capability:        (0x01)   Error logging supported.
               General Purpose Logging supported.
Short self-test routine
recommended polling time:     (   1) minutes.
Extended self-test routine
recommended polling time:     ( 218) minutes.
Conveyance self-test routine
recommended polling time:     (   2) minutes.
SCT capabilities:           (0x1085)   SCT Status supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   114   099   006    Pre-fail  Always       -       64540128
  3 Spin_Up_Time            0x0003   096   096   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       490
  5 Reallocated_Sector_Ct   0x0033   100   100   010    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000f   063   060   030    Pre-fail  Always       -       2223749
  9 Power_On_Hours          0x0032   095   095   000    Old_age   Always       -       5037
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       124
183 Runtime_Bad_Block       0x0032   100   100   000    Old_age   Always       -       0
184 End-to-End_Error        0x0032   100   100   099    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   100   000    Old_age   Always       -       0 0 0
189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always       -       0
190 Airflow_Temperature_Cel 0x0022   067   062   045    Old_age   Always       -       33 (Min/Max 22/34)
191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       4
193 Load_Cycle_Count        0x0032   096   096   000    Old_age   Always       -       9601
194 Temperature_Celsius     0x0022   033   040   000    Old_age   Always       -       33 (0 13 0 0 0)
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline      -       142h+05m+37.784s
241 Total_LBAs_Written      0x0000   100   253   000    Old_age   Offline      -       2235152306
242 Total_LBAs_Read         0x0000   100   253   000    Old_age   Offline      -       36290475

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Re: Problème de disque

Posted: 28 Jul 2015 11:10
by sleid
Il reste donc ada3 qui a réalloué des secteurs donc à surveiller régulièrement ce nombre de 72
5 Reallocated_Sector_Ct 0x0033 100 100 010 Pre-fail Always - 72
et l'évolution de ceux-ci:
187 Reported_Uncorrect 0x0032 001 001 000 Old_age Always - 141
197 Current_Pending_Sector 0x0012 100 099 000 Old_age Always - 56
198 Offline_Uncorrectable 0x0010 100 099 000 Old_age Offline - 56

Re: Problème de disque

Posted: 28 Jul 2015 12:11
by lulu80
bien merci sleid
dés réception des disques neuf je le remplace en premier puis le ada4 par un de 1 To
je vais vérifier l'état du ada4 comme le préconise le site WD mais j'ai du mal a mis retrouver sur leurs site

Re: Problème de disque

Posted: 28 Jul 2015 12:21
by lulu80
velivole18 wrote:Bonjour,

Il est donc peut-être temps de changer les disques avec l'esprit plus tranquille maintenant que l'environnement est plus stable ... :roll:

Cordialement.
oui merci ... les disques neuf sont sur la route :) et le ada4 et déjà remplacer par un 2 To de dépannage donc dés leurs arriver
il prendront leurs place dans le pool

Re: Problème de disque

Posted: 28 Jul 2015 15:52
by lulu80
bonjour,
bon le diag avec "Data Lifeguard Diagnostics" le premier des tests "quick test" me dit FAIL!
d'après leurs notice ils disent de changer le disque "Important : En cas d'échec d'un test, y compris du test rapide, il faut remplacer le disque qui est testé."
mais je préfère demander , avant de faire un retour
merci de vos lumière encore une fois
changement disqueada4(bis1).PNG

Re: Problème de disque

Posted: 28 Jul 2015 16:00
by lulu80
cela fait parti du logiciel Data Lifeguard Diagnostics pour Windows
changement disqueada4(bis2).PNG
changement disqueada4(bis3).PNG

Re: Problème de disque

Posted: 28 Jul 2015 16:51
by sleid
En principe il faut faire le test avancé et envoyer avec le disque le résultat

Re: Problème de disque

Posted: 28 Jul 2015 16:57
by lulu80
ok donc je vais le faire ,moi j'avais suivi le conseil du site

combien de temps dur le test ? c'est simplement pour savoir si je doit le mettre en route le soir

Re: Problème de disque

Posted: 28 Jul 2015 21:21
by sleid
Tout dépend de l'état du disque mais c'est de l'ordre de l'heure en général

Re: Problème de disque

Posted: 28 Jul 2015 21:31
by lulu80
je vais le lancer pour se soir merci

edit: ada4 ne travail pas comme les autres ? c'est normal peut être vue sa grandeur
ada4.PNG

Re: Problème de disque

Posted: 29 Jul 2015 06:26
by sleid
Ce sont les reliquats de statistiques du resilvering

Re: Problème de disque

Posted: 29 Jul 2015 11:11
by lulu80
bonjour,
bon j'ai essayer de faire le test avancé (Data Lifeguard Diagnostics) sur le ada4 sur deux PCs différent cela se met en erreur "FAIL!"
je pense a un problème mécanique qui empêcherai le bon déroulement des tests ? je sais pas :?

Re: Problème de disque

Posted: 29 Jul 2015 22:32
by sleid
il est mort ce disque