On 7/7/2024 2:25 AM, David Rientjes wrote:
Hi all,
I'm trying to crowdsource information on open source tools that can be
used directly by customers to explain memory mappings, usage, pressure,
etc.
We encounter both internal and external users that are looking for this
insight and it often requires significant engineering time to collect data
to make any conclusions.
Hello David,
Link: https://github.com/AMDESE/workload-insight-tool
sorry for replying late (as we took few months to make the
tool "opensource").
Not sure whether it exactly fits the requirement but the tools has been
very helpful for us (developers) to "visualize" the system behavior
(that is exported via procfs /sysfs interface) and initial analysis.
Deploying is easy since it is provided as a python package.
Typical usage:
1. collect the behavior of workload using
$syswit collect
e.g.,
$syswit collect -c "<PWD>/syswit/collector_configs/numa_cxl.yaml" -C -T
-m 1 -w "<workload>"
collected information is stored in Json format and it can be compared/
visualized further with
$syswit analyze (behavior of single workload)
OR
$syswit compare (comparing the result with multiple workload run
(e.g., before / after the patch).
Still a Long way to go.. But hope it is useful.
[Idea about the tool was seeded by Bharata, Bharata/Me helped in some of
the design and optimization and Ayush is the sole developer and many
inside AMD helped in reviewing/improving (Shivank,....)].
USP:
1. we can tune the information to collect.
2. It can run for longer time and store the data that can be analyzed/
visualized offline for anomaly.
(Thinking whether I should also post as a main thread in linux-mm
for greater awareness)
Thanks and Regards
- Raghu