Re: NVMe Health check?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, 14 Oct 2021 at 22:29, ToddAndMargo via users <users@xxxxxxxxxxxxxxxxxxxxxxx> wrote:
On 10/13/21 06:38, George N. White III wrote:
> Also available in fedora): nvmi-cli (upstream
> https://github.com/linux-nvme/nvme-cli)
> <https://github.com/linux-nvme/nvme-cli>
> and nvmetcli  (Configure NVMe-over-Fabrics Target).

see anything wrong?

# nvme smart-log /dev/nvme0n1
Smart Log for NVME device:nvme0n1 namespace-id:ffffffff
critical_warning                        : 0
temperature                             : 31 C
available_spare                         : 100%
available_spare_threshold               : 10%
percentage_used                         : 1%

When working properly, the drive dies when it uses up all the
"available_spare" capacity.  This report is similar to what I
see, but I only have 3 nvme drives.

https://nvmexpress.org/resources/nvm-express-technology-features/nvme-features-for-error-reporting-smart-log-pages-failures-and-management-capabilities-in-nvme-architectures/

There are failure modes other than something that
degrades over time and can be monitored.

endurance group critical warning summary: 0
data_units_read                         : 773,858,127
data_units_written                      : 38,056,140
host_read_commands                      : 2,212,795,501
host_write_commands                     : 822,976,235
controller_busy_time                    : 5,347
power_cycles                            : 1,718
power_on_hours                          : 3,405
unsafe_shutdowns                        : 63
media_errors                            : 0
num_err_log_entries                     : 1,828

I have 1,812 for num_err_log_entries on a new drive
(291 power_on_hours), so I suspect many of
these log entries come when a drive is first
installed.
 
Warning Temperature Time                : 0
Critical Composite Temperature Time     : 0
Temperature Sensor 1           : 31 C
Temperature Sensor 2           : 34 C
Thermal Management T1 Trans Count       : 0
Thermal Management T2 Trans Count       : 0
Thermal Management T1 Total Time        : 0
Thermal Management T2 Total Time        : 0


TLDNR:
https://nvmexpress.org/wp-content/uploads/NVMe-NVM-Express-2.0a-2021.07.26-Ratified.pdf

--
George N. White III

_______________________________________________
users mailing list -- users@xxxxxxxxxxxxxxxxxxxxxxx
To unsubscribe send an email to users-leave@xxxxxxxxxxxxxxxxxxxxxxx
Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: https://lists.fedoraproject.org/archives/list/users@xxxxxxxxxxxxxxxxxxxxxxx
Do not reply to spam on the list, report it: https://pagure.io/fedora-infrastructure
[Index of Archives]     [Older Fedora Users]     [Fedora Announce]     [Fedora Package Announce]     [EPEL Announce]     [EPEL Devel]     [Fedora Magazine]     [Fedora Summer Coding]     [Fedora Laptop]     [Fedora Cloud]     [Fedora Advisory Board]     [Fedora Education]     [Fedora Security]     [Fedora Scitech]     [Fedora Robotics]     [Fedora Infrastructure]     [Fedora Websites]     [Anaconda Devel]     [Fedora Devel Java]     [Fedora Desktop]     [Fedora Fonts]     [Fedora Marketing]     [Fedora Management Tools]     [Fedora Mentors]     [Fedora Package Review]     [Fedora R Devel]     [Fedora PHP Devel]     [Kickstart]     [Fedora Music]     [Fedora Packaging]     [Fedora SELinux]     [Fedora Legal]     [Fedora Kernel]     [Fedora OCaml]     [Coolkey]     [Virtualization Tools]     [ET Management Tools]     [Yum Users]     [Yosemite News]     [Gnome Users]     [KDE Users]     [Fedora Art]     [Fedora Docs]     [Fedora Sparc]     [Libvirt Users]     [Fedora ARM]

  Powered by Linux