<html xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40"><head><meta http-equiv=Content-Type content="text/html; charset=utf-8"><meta name=Generator content="Microsoft Word 15 (filtered medium)"><style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
.MsoChpDefault
{mso-style-type:export-only;}
@page WordSection1
{size:612.0pt 792.0pt;
margin:70.85pt 70.85pt 70.85pt 70.85pt;}
div.WordSection1
{page:WordSection1;}
--></style></head><body lang=NL link=blue vlink="#954F72" style='word-wrap:break-word'><div class=WordSection1><p class=MsoNormal>Michael,</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>Fdisk not necessary, I know the drive config. The disks are on a megaraid controller. SDA is the system/boot disk, which consists of a megaraid raid 1 containing two disks. SDB and SDC are the two disks that make up MD0.</p><p class=MsoNormal>I checked them all with MRM, and the output is they are all optimal. Checked the raid status in webmin, array is clean. </p><p class=MsoNormal>Looked up the DID’s, and checked with smartctl -a -d megaraid,DID /dev/sdX</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>SDC:</p><p class=MsoNormal>Vendor Specific SMART Attributes with Thresholds:</p><p class=MsoNormal>ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE</p><p class=MsoNormal> 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 41</p><p class=MsoNormal> 3 Spin_Up_Time 0x0027 173 169 021 Pre-fail Always - 4325</p><p class=MsoNormal> 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 62</p><p class=MsoNormal> 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0</p><p class=MsoNormal> 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0</p><p class=MsoNormal> 9 Power_On_Hours 0x0032 001 001 000 Old_age Always - 75968</p><p class=MsoNormal> 10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0</p><p class=MsoNormal> 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0</p><p class=MsoNormal> 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 58</p><p class=MsoNormal>192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 51</p><p class=MsoNormal>193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 10</p><p class=MsoNormal>194 Temperature_Celsius 0x0022 120 107 000 Old_age Always - 27</p><p class=MsoNormal>196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0</p><p class=MsoNormal>197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0</p><p class=MsoNormal>198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0</p><p class=MsoNormal>199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>SDB:</p><p class=MsoNormal>ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE</p><p class=MsoNormal> 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 16</p><p class=MsoNormal> 3 Spin_Up_Time 0x0027 173 170 021 Pre-fail Always - 2341</p><p class=MsoNormal> 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 39</p><p class=MsoNormal> 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0</p><p class=MsoNormal> 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0</p><p class=MsoNormal> 9 Power_On_Hours 0x0032 022 022 000 Old_age Always - 57667</p><p class=MsoNormal> 10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0</p><p class=MsoNormal> 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0</p><p class=MsoNormal> 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 39</p><p class=MsoNormal>192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 34</p><p class=MsoNormal>193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 1872</p><p class=MsoNormal>194 Temperature_Celsius 0x0022 120 114 000 Old_age Always - 23</p><p class=MsoNormal>196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0</p><p class=MsoNormal>197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0</p><p class=MsoNormal>198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0</p><p class=MsoNormal>199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0</p><p class=MsoNormal>200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>Some raw read errors found, but the disks are ancient…. They came with the server when I bought it. Soon had a disk failure so replaced two system/boot disks.</p><p class=MsoNormal>I guess they are EOL.</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>No problem though, the MD0 array is just there for overnight backups. Will be replacing the drives someday soon. Thnx for your help!</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal><o:p> </o:p></p><div style='mso-element:para-border-div;border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0cm 0cm 0cm'><p class=MsoNormal style='border:none;padding:0cm'><b>Van: </b><a href="mailto:mstauber@blueonyx.it">Michael Stauber</a><br><b>Verzonden: </b>vrijdag 11 juni 2021 19:32<br><b>Aan: </b><a href="mailto:blueonyx@mail.blueonyx.it">blueonyx@mail.blueonyx.it</a><br><b>Onderwerp: </b>[BlueOnyx:24969] Re: EXT4-FS error.</p></div><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>Hi Arie,</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>> Secure log:</p><p class=MsoNormal>> Jun 9 23:53:01 nuserver kernel: EXT4-fs error (device dm-0):</p><p class=MsoNormal>> ext4_ext_remove_space:2976: inode #1836090: comm rm: pblk 7374840 bad</p><p class=MsoNormal>> header/extent: invalid magic - magic 2, entries 0, max 0(0), depth 0(0)</p><p class=MsoNormal>> Jun 9 23:53:01 nuserver kernel: EXT4-fs error (device dm-0) in</p><p class=MsoNormal>> ext4_ext_truncate:4688: IO failure</p><p class=MsoNormal>> Jun 10 23:54:41 nuserver kernel: EXT4-fs (dm-0): error count since last</p><p class=MsoNormal>> fsck: 2</p><p class=MsoNormal>> Jun 10 23:54:41 nuserver kernel: EXT4-fs (dm-0): initial error at time</p><p class=MsoNormal>> 1623275581: ext4_ext_remove_space:2976: inode 1836090</p><p class=MsoNormal>> Jun 10 23:54:41 nuserver kernel: EXT4-fs (dm-0): last error at time</p><p class=MsoNormal>> 1623275581: ext4_ext_truncate:4688: inode 1836090</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>Plugging the relevant part of the error message ("bad header/extent:</p><p class=MsoNormal>invalid magic") into a search engine like this ...</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>https://duckduckgo.com/?t=ffab&q=bad+header%2Fextent%3A+invalid+magic</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>... yields some answers.</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>The error messages listed above mean that file system had been</p><p class=MsoNormal>corrupted. Running fsck or mode adequately e2fsck to repair it might</p><p class=MsoNormal>help, but it also could be an indicator that the hard disk(s) in</p><p class=MsoNormal>question have hardware issues.</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>When I see stuff like that I usually take a look at what the disk health</p><p class=MsoNormal>monitor says.</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>Check with "fdisk -l" to see what your disks are named and how many</p><p class=MsoNormal>there are. Usually its starts with something like /dev/sda. Then use</p><p class=MsoNormal>"smartctl" to poll the health state of each:</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>smartctl -a /dev/sda</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>There is a section in the output of that which looks similar to the text</p><p class=MsoNormal>block below, although for the purpose of this message I cut out a few</p><p class=MsoNormal>irrelevant columns to prevent word wrap:</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>SMART Attributes Data Structure revision number: 16</p><p class=MsoNormal>Vendor Specific SMART Attributes with Thresholds:</p><p class=MsoNormal>ID# ATTRIBUTE_NAME UPDATED WHEN_FAILED RAW_VALUE</p><p class=MsoNormal> 1 Raw_Read_Error_Rate Always - 0</p><p class=MsoNormal> 2 Throughput_Performance Offline - 70</p><p class=MsoNormal> 3 Spin_Up_Time Always - 271 (Average 302)</p><p class=MsoNormal> 4 Start_Stop_Count Always - 204</p><p class=MsoNormal> 5 Reallocated_Sector_Ct Always - 0</p><p class=MsoNormal> 7 Seek_Error_Rate Always - 0</p><p class=MsoNormal> 8 Seek_Time_Performance Offline - 33</p><p class=MsoNormal> 9 Power_On_Hours Always - 44589</p><p class=MsoNormal> 10 Spin_Retry_Count Always - 0</p><p class=MsoNormal> 12 Power_Cycle_Count Always - 204</p><p class=MsoNormal>192 Power-Off_Retract_Count Always - 823</p><p class=MsoNormal>193 Load_Cycle_Count Always - 823</p><p class=MsoNormal>194 Temperature_Celsius Always - 40 (Min/Max 23/61)</p><p class=MsoNormal>196 Reallocated_Event_Count Always - 0</p><p class=MsoNormal>197 Current_Pending_Sector Always - 0</p><p class=MsoNormal>198 Offline_Uncorrectable Offline - 0</p><p class=MsoNormal>199 UDMA_CRC_Error_Count Always - 0</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>The interesting parts as far as errors go are:</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>Raw_Read_Error_Rate</p><p class=MsoNormal>Reallocated_Sector_Ct</p><p class=MsoNormal>Seek_Error_Rate</p><p class=MsoNormal>Reallocated_Event_Count</p><p class=MsoNormal>Current_Pending_Sector</p><p class=MsoNormal>Offline_Uncorrectable</p><p class=MsoNormal>UDMA_CRC_Error_Count</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>All in all the disk whose output I showed above is surprisingly free of</p><p class=MsoNormal>errors, although "Power_On_Hours" shows it has been running for 44589</p><p class=MsoNormal>hours, which is slightly more than five years. I prefer to swap out</p><p class=MsoNormal>disks after 4-5 years of usage, so this one will be replaced soonish,</p><p class=MsoNormal>even if it did behave very well so far.</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>If a drive has critical (recent) errors, "smartcl" might also report</p><p class=MsoNormal>this in other parts of its lengthy output, so it's worth studying all of</p><p class=MsoNormal>it and to give it some thought.</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>-- </p><p class=MsoNormal>With best regards</p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>Michael Stauber</p><p class=MsoNormal>_______________________________________________</p><p class=MsoNormal>Blueonyx mailing list</p><p class=MsoNormal>Blueonyx@mail.blueonyx.it</p><p class=MsoNormal>http://mail.blueonyx.it/mailman/listinfo/blueonyx</p><p class=MsoNormal><o:p> </o:p></p></div></body></html>