lvm question

Gene Heskett gene.heskett at verizon.net
Thu Apr 12 17:28:45 UTC 2007


Greetings;

I've been through my own personal hell the last 18 hours.  First, I noted through a smartctl report that my hda was approaching the end of its life, the seek errors were up into the 300 million range, the the ECC errors wasn't far behind it.

So I started a dd if=/dev/hda of=/dev/hdd which cloned that 160GB drive to a 200GB drive.

While that was going on, verizon decided it was time I changed my password and disconnected me.

After the clone job, and re-arranging drives, removeing the old hda, moving the clone hdd to
its position both in the machine and on the cable, then putting a new 320GB in as hdd.  It 
wouldn't boot, mounting /boot and then FAIL without a reason.

So I got out the zod livedvd and booted it, doing the setup on hdd, but apparently I'd forgotten 
the order, so that was a three times around the loop deal.  Then, since I now had no partition 
for the last 40GB on hda, I fired up fdisk and added that.  The LVM didn't like that, and 
steadfastly refused to let me modify /dev/VolumeGroupyaddayadda in any way to fix it.

Several hours and many reboot attempts later it finally did let be run an e2fsck on it, which 
promptly (several hours at that) 'fixed' a bunch of stuff, and when it was done, about half 
of my kmail setup and mailboxes were toast.  I finally got all that sorted I believe but 2 
questions remain.

1.  How can I add this extra 40GB to that VolGroup now mounted on /

2.  This brand new 320GB maxtor is acting a lot like the old 160GB seagate was.

from a smartctl -a /dev/hda, the old 200GB maxtor:
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000b   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0007   125   124   021    Pre-fail  Always       -       4250
  4 Start_Stop_Count        0x0032   100   100   040    Old_age   Always       -       91
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000b   100   253   051    Pre-fail  Always       -       0
  9 Power_On_Hours          0x0032   072   072   000    Old_age   Always       -       20862
 10 Spin_Retry_Count        0x0013   100   253   051    Pre-fail  Always       -       0
 11 Calibration_Retry_Count 0x0013   100   253   051    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       88
194 Temperature_Celsius     0x0022   109   253   000    Old_age   Always       -       41
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0012   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0012   200   200   000    Old_age   Always       -       0
199 UDMA_CRC_Error_Count    0x000a   200   253   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0009   200   155   051    Pre-fail  Offline      -       0

Note then poweron hours, over 20K

Now, from this new 320GB drive:
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   117   100   006    Pre-fail  Always       -       142787210
  3 Spin_Up_Time            0x0003   097   097   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       3
  5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000f   100   253   030    Pre-fail  Always       -       373173
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       9
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       5
187 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       0
189 Unknown_Attribute       0x003a   100   100   000    Old_age   Always       -       0
190 Unknown_Attribute       0x0022   059   059   045    Old_age   Always       -       690290729
194 Temperature_Celsius     0x0022   041   041   000    Old_age   Always       -       41 (Lifetime Min/Max 0/31)
195 Hardware_ECC_Recovered  0x001a   062   062   000    Old_age   Always       -       184671713
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0000   100   253   000    Old_age   Offline      -       0
202 TA_Increase_Count       0x0032   100   253   000    Old_age   Always       -       0

Note Poweron hours, 9

And both the #7 and #195 errors are incrementing at several per second rates.

I can't believe this drive won't die shortly, or am I barking up the wrong tree
on both the old hda, and this new hdd?

-- 
Cheers, Gene
"There are four boxes to be used in defense of liberty:
 soap, ballot, jury, and ammo. Please use in that order."
-Ed Howdershelt (Author)
Here I am at the flea market but nobody is buying my urine sample bottles ...




More information about the fedora-list mailing list