Re: SMART disk monitoring

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



I seem to recall a batch of hardware in one of the early clusters that
had issues with repeated polling of smart data, IO would pause for a
few seconds. This might explain why they are writing out a daily cron
script and polling a results file instead of repeatedly polling the
actual devices.

On Wed, Jan 3, 2018 at 8:37 AM, Sage Weil <sage@xxxxxxxxxxxx> wrote:
> On Sun, 12 Nov 2017, Lars Marowsky-Bree wrote:
>> On 2017-11-10T22:36:46, Yaarit Hatuka <yaarit@xxxxxxxxx> wrote:
>>
>> > Many thanks! I'm very excited to join Ceph's outstanding community!
>> > I'm looking forward to working on this challenging project, and I'm
>> > very grateful for the opportunity to be guided by Sage.
>>
>> That's all excellent news!
>>
>> Can we discuss though if/how this belongs into ceph-osd? Given that this
>> can (and is) already collected via smartmon, either via prometheus or, I
>> assume, collectd as well? Does this really need to be added to the OSD
>> code?
>
> Hi Lars,
>
> Yaarit is taking a look at this now and the smartmon.sh collector for
> prometheus looks a bit janky:
>
> 1) It seems like you have to set up a cron job to write the current smart
> output to a text file in a directory somewhere, and then prometheus will
> scrape it when polled.[1]
>
> 2) smartmon.sh[2] is a shortish pile of bash that collects only a handful
> of fields by parsing smartctl output.
>
> The second piece will hopefully improve once the JSON output mode for
> smartctl is completed (that is in progress upstream in smartmontools).
> But the first part seems awkward, and doesn't look like it would work out
> of the box.  Are you guys currently collecting SMART data?  If so, how did
> you automate/simplify the setup?
>
> Thanks!
> sage
>
>
> [1] https://github.com/prometheus/node_exporter#textfile-collector
> [2] https://github.com/prometheus/node_exporter/edit/master/text_collector_examples/smartmon.sh
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux