Look's good, what is your hardware? Server model & NVM'es? k > On 19 Feb 2021, at 13:22, zxcs <zhuxiongcs@xxxxxxx> wrote: > > BTW, actually i have two nodes has same issues, and another error node's nvme output as below > > Smart Log for NVME device:nvme0n1 namespace-id:ffffffff > critical_warning : 0 > temperature : 29 C > available_spare : 100% > available_spare_threshold : 10% > percentage_used : 1% > data_units_read : 592,340,175 > data_units_written : 26,443,352 > host_read_commands : 5,341,278,662 > host_write_commands : 515,730,885 > controller_busy_time : 14,052 > power_cycles : 8 > power_on_hours : 4,294 > unsafe_shutdowns : 6 > media_errors : 0 > num_err_log_entries : 0 > Warning Temperature Time : 0 > Critical Composite Temperature Time : 0 > Temperature Sensor 1 : 29 C > Temperature Sensor 2 : 46 C > Temperature Sensor 3 : 0 C > Temperature Sensor 4 : 0 C > Temperature Sensor 5 : 0 C > Temperature Sensor 6 : 0 C > Temperature Sensor 7 : 0 C > Temperature Sensor 8 : 0 C > > > For compare, i get one healthy node’s nvme output as below: > > mart Log for NVME device:nvme0n1 namespace-id:ffffffff > critical_warning : 0 > temperature : 27 C > available_spare : 100% > available_spare_threshold : 10% > percentage_used : 1% > data_units_read : 579,829,652 > data_units_written : 28,271,336 > host_read_commands : 5,237,750,233 > host_write_commands : 518,979,861 > controller_busy_time : 14,166 > power_cycles : 3 > power_on_hours : 4,252 > unsafe_shutdowns : 1 > media_errors : 0 > num_err_log_entries : 0 > Warning Temperature Time : 0 > Critical Composite Temperature Time : 0 > Temperature Sensor 1 : 27 C > Temperature Sensor 2 : 39 C > Temperature Sensor 3 : 0 C > Temperature Sensor 4 : 0 C > Temperature Sensor 5 : 0 C > Temperature Sensor 6 : 0 C > Temperature Sensor 7 : 0 C > Temperature Sensor 8 : 0 C _______________________________________________ ceph-users mailing list -- ceph-users@xxxxxxx To unsubscribe send an email to ceph-users-leave@xxxxxxx