Re: Ceph nvme timeout and then aborting

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Look's good, what is your hardware? Server model & NVM'es?



k

> On 19 Feb 2021, at 13:22, zxcs <zhuxiongcs@xxxxxxx> wrote:
> 
> BTW, actually i have two nodes has same issues, and another error node's nvme output as below 
> 
> Smart Log for NVME device:nvme0n1 namespace-id:ffffffff
> critical_warning                    : 0
> temperature                         : 29 C
> available_spare                     : 100%
> available_spare_threshold           : 10%
> percentage_used                     : 1%
> data_units_read                     : 592,340,175
> data_units_written                  : 26,443,352
> host_read_commands                  : 5,341,278,662
> host_write_commands                 : 515,730,885
> controller_busy_time                : 14,052
> power_cycles                        : 8
> power_on_hours                      : 4,294
> unsafe_shutdowns                    : 6
> media_errors                        : 0
> num_err_log_entries                 : 0
> Warning Temperature Time            : 0
> Critical Composite Temperature Time : 0
> Temperature Sensor 1                : 29 C
> Temperature Sensor 2                : 46 C
> Temperature Sensor 3                : 0 C
> Temperature Sensor 4                : 0 C
> Temperature Sensor 5                : 0 C
> Temperature Sensor 6                : 0 C
> Temperature Sensor 7                : 0 C
> Temperature Sensor 8                : 0 C
> 
> 
> For compare, i get one healthy node’s nvme output as below:
> 
> mart Log for NVME device:nvme0n1 namespace-id:ffffffff
> critical_warning                    : 0
> temperature                         : 27 C
> available_spare                     : 100%
> available_spare_threshold           : 10%
> percentage_used                     : 1%
> data_units_read                     : 579,829,652
> data_units_written                  : 28,271,336
> host_read_commands                  : 5,237,750,233
> host_write_commands                 : 518,979,861
> controller_busy_time                : 14,166
> power_cycles                        : 3
> power_on_hours                      : 4,252
> unsafe_shutdowns                    : 1
> media_errors                        : 0
> num_err_log_entries                 : 0
> Warning Temperature Time            : 0
> Critical Composite Temperature Time : 0
> Temperature Sensor 1                : 27 C
> Temperature Sensor 2                : 39 C
> Temperature Sensor 3                : 0 C
> Temperature Sensor 4                : 0 C
> Temperature Sensor 5                : 0 C
> Temperature Sensor 6                : 0 C
> Temperature Sensor 7                : 0 C
> Temperature Sensor 8                : 0 C

_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx




[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux