On 17.10.2022 12:52, Ernesto Puerta wrote:
- Ceph already exposes SMART-based health-checks, metrics and alerts
from the devicehealth/diskprediction modules
<https://docs.ceph.com/en/latest/rados/operations/devices/#enabling-monitoring>.
I find this kind of high-level monitoring more digestible to
operators than
low-level SMART metrics.
Marc that started this thread was asking about SAS disk.
smartctl doesn't show much SMART Attributes on SAS disk, but some drive
only have error log like this
Error counter log:
Errors Corrected by Total Correction
Gigabytes Total
ECC rereads/ errors algorithm
processed uncorrected
fast | delayed rewrites corrected invocations [10^9
bytes] errors
read: 0 0 0 0 376907 93335.728
0
write: 0 2 0 2 2113307 17978.600
0
verify: 0 0 0 0 848 0.002
0
But for the drive I have is look like they all have SMART Health Status.
"SMART Health Status: OK"
Ceph doesn't support SMART or any status on SAS disk today, I only get
the message "No SMART data available".
I have gathered "smartctl -x --json=vo" log for the 6 types of SAS this
I have in my possession.
You can find them here if interested [1]
[1] https://gitlab.com/-/snippets/2431089
--
Kai Stian Olstad
_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx