Re: monitoring drives

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 17.10.2022 12:52, Ernesto Puerta wrote:
   - Ceph already exposes SMART-based health-checks, metrics and alerts
   from the devicehealth/diskprediction modules

<https://docs.ceph.com/en/latest/rados/operations/devices/#enabling-monitoring>.
I find this kind of high-level monitoring more digestible to operators than
   low-level SMART metrics.

Marc that started this thread was asking about SAS disk.
smartctl doesn't show much SMART Attributes on SAS disk, but some drive only have error log like this

Error counter log:
Errors Corrected by Total Correction Gigabytes Total ECC rereads/ errors algorithm processed uncorrected fast | delayed rewrites corrected invocations [10^9 bytes] errors read: 0 0 0 0 376907 93335.728 0 write: 0 2 0 2 2113307 17978.600 0 verify: 0 0 0 0 848 0.002 0


But for the drive I have is look like they all have SMART Health Status.

    "SMART Health Status: OK"


Ceph doesn't support SMART or any status on SAS disk today, I only get the message "No SMART data available".


I have gathered "smartctl -x --json=vo" log for the 6 types of SAS this I have in my possession.
You can find them here if interested [1]


[1] https://gitlab.com/-/snippets/2431089

--
Kai Stian Olstad
_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx



[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux