Ciao Stefano, Sorry for the very long delay of my reply! My review is below. On 1.05.21 г. 9:07, Stefano De Venuto wrote:
Add a tool in examples/ that scans a merged host + guest trace and search for host events that are inside a vmentry/vmexit block (and vice-versa for guest events ouside the block) and report the found ones. It can be useful as a starting point for identifying issues of for
Do you mean "identifying issues or"?
checking the effectiveness of host/guest traces synchronization, or even for improving the placing of the tracepoints in the kernel. v2 changes: - Addressed Yordan's comments. - Added a way to not mark as invalid the initial guests event. - Used kshark_read_event_field_int() for the vCPU field recovering. Signed-off-by: Stefano De Venuto <stefano.devenuto99@xxxxxxxxx> ---
The description of the changes in v2 must go here. This way it is visible in the patch, but it is not going to be part of the commit message once the final version of the patch is applied to the repo.
examples/CMakeLists.txt | 4 + examples/checker.c | 224 ++++++++++++++++++++++++++++++++++++++++ 2 files changed, 228 insertions(+) create mode 100644 examples/checker.c diff --git a/examples/CMakeLists.txt b/examples/CMakeLists.txt index c2f4c01..91badb8 100644 --- a/examples/CMakeLists.txt +++ b/examples/CMakeLists.txt @@ -12,6 +12,10 @@ message(STATUS "multibufferload") add_executable(mbload multibufferload.c) target_link_libraries(mbload kshark)+message(STATUS "checker")+add_executable(checker checker.c)
The name "checker" has too broad meaning. Try to replace it with something that it not very long but in the same time says something about the usage of the tool.
+target_link_libraries(checker kshark) + message(STATUS "datahisto") add_executable(dhisto datahisto.c) target_link_libraries(dhisto kshark) diff --git a/examples/checker.c b/examples/checker.c new file mode 100644 index 0000000..425de58 --- /dev/null +++ b/examples/checker.c @@ -0,0 +1,224 @@
The file must start with a SPDX license identifier. All other examples are under GPL-2.0, but Public domain will work as well. Next in the file you must have a copyright statement with your name and your email address. I guess the copyright will go to your university or Suse. Discuss this with Dario.
+#include <stdio.h> +#include <stdlib.h> +#include <string.h> + +#include "libkshark.h" +#include "libkshark-tepdata.h" + +#define KVM_ENTRY "kvm/kvm_entry" +#define KVM_EXIT "kvm/kvm_exit" + +struct custom_stream +{ + struct kshark_data_stream* original_stream; + int* cpus; +}; + +/**
When a comment starts with "/**" Doxygen will take the text of this comment and will try to use it for generating documentation. Since you don't need to generate documentation for this example, just use normal comments (/* .... */.)
+ * Checks if a stream_id represents a guest. + * If so, @host contains the corresponding host stream_id + */ +int is_guest(int stream_id, + struct kshark_host_guest_map* mapping, + int n_mapping, int* host) +{ + for (int i = 0; i < n_mapping; i++) { + if (mapping[i].guest_id == stream_id) { + *host = mapping[i].host_id; + return 1; + } + } + + return 0; +} + +/** + * Recover guest stream_id from host VMentry/VMexit event. + * In case of success, @guest_id will contain the guest stream_id. + */ +int guest_id_from_host_entry_exit(struct kshark_host_guest_map* mapping, + int n_mapping, int* guest_id, + struct kshark_entry* entry) +{ + struct kshark_data_stream* stream; + int pid; + + stream = kshark_get_stream_from_entry(entry); + pid = kshark_get_pid(entry); + + for (int i = 0; i < n_mapping; i++) + if (mapping[i].host_id == stream->stream_id) + for (int j = 0; j < mapping[i].vcpu_count; j++) + if (mapping[i].cpu_pid[j] == pid) { + *guest_id = mapping[i].guest_id; + return 1; + } + + return 0; +} + +void print_entry(struct kshark_entry* entry) +{ + struct kshark_data_stream* stream; + char* event_name; + + stream = kshark_get_stream_from_entry(entry); + event_name = kshark_get_event_name(entry); + + printf(" %d: %s-%d, %" PRId64 " [%03d]:%s\t%s\n", + stream->stream_id, + kshark_get_task(entry), + kshark_get_pid(entry), + entry->ts, + entry->cpu, + event_name, + kshark_get_info(entry)); +}
This function is similar to what kshark_dump_entry() is doing. Is there a reason for not using kshark_dump_entry()?
+ +void free_data(struct kshark_context *kshark_ctx, + struct custom_stream** custom_streams, + struct kshark_entry** entries, int n_entries, + struct kshark_host_guest_map* host_guest_mapping, + int n_guest) +{ + struct custom_stream* custom_stream; + + for (int i = 0; i < kshark_ctx->n_streams; i++) { + custom_stream = custom_streams[i]; + + free(custom_stream->cpus); + free(custom_stream); + } + free(custom_streams); + + for (int i = 0; i < n_entries; i++) + free(entries[i]); + free(entries); + + kshark_tracecmd_free_hostguest_map(host_guest_mapping, n_guest); +} + +int main(int argc, char **argv) +{ + struct kshark_host_guest_map* host_guest_mapping; + struct custom_stream** custom_streams; + struct custom_stream* custom_stream; + struct custom_stream* guest_stream; + struct custom_stream* host_stream; + struct kshark_data_stream* stream; + struct kshark_context* kshark_ctx; + struct kshark_entry** entries; + struct kshark_entry* current; + ssize_t n_entries; + char* event_name; + int64_t vcpu; + int guest_id; + int n_guest; + int host; + int v_i; + int sd; + + kshark_ctx = NULL; + if (!kshark_instance(&kshark_ctx)) + return 1; + + custom_streams = malloc(sizeof(*custom_streams) * (argc-1)); + + for (int i = 1; i < argc; i++) { + sd = kshark_open(kshark_ctx, argv[i]); + if (sd < 0) { + kshark_free(kshark_ctx); + return 1; + } + + kshark_tep_init_all_buffers(kshark_ctx, sd); + + /** + * Creating custom streams in order to keep track if a + * pCPU is executing code of a vCPU and, if so, which vCPU. + */ + custom_stream = malloc(sizeof(*custom_stream)); + custom_stream->original_stream = kshark_get_data_stream(kshark_ctx, sd); + custom_stream->cpus = malloc(custom_stream->original_stream->n_cpus * sizeof(int)); + memset(custom_stream->cpus, -1, custom_stream->original_stream->n_cpus * sizeof(int)); + + custom_streams[i-1] = custom_stream; + } + + host_guest_mapping = NULL; + n_guest = kshark_tracecmd_get_hostguest_mapping(&host_guest_mapping); + if (n_guest < 0) { + printf("Failed mapping: %d\n", n_guest); + return 1; + } + + entries = NULL; + n_entries = kshark_load_all_entries(kshark_ctx, &entries); + + for (int i = 0; i < n_entries; ++i) { + current = entries[i];
Rename this to "current_entry"
+ + stream = kshark_get_stream_from_entry(current);
+ event_name = kshark_get_event_name(current); + + custom_stream = custom_streams[stream->stream_id];
and this can be "current_stream"
+ + if (!strcmp(event_name, KVM_ENTRY) || !strcmp(event_name, KVM_EXIT)) { + if (kshark_read_event_field_int(current, "vcpu_id", &vcpu)) { + printf("Error on recovering the vCPU field\n"); + return 1; + } + + if (!guest_id_from_host_entry_exit(host_guest_mapping, n_guest, &guest_id, current)) { + printf("Error on recovering guest stream ID\n"); + return 1; + } + + /** + * Mark the vCPU upcoming events as checkable. + * Workaround implemented in order to not mark as invalid initial guests events. + * Done in this way since we can't know if we'll find a kvm_exit (consistent state) + * or a kvm_entry. + */ + guest_stream = custom_streams[guest_id]; + guest_stream->cpus[vcpu] = 1;
I don't understand the reason for this. Is it some kind of initialization problem. If it is an initialization, the right place for doing it is before you start looping over the data.
+ + if (!strcmp(event_name, KVM_ENTRY)) + custom_stream->cpus[current->cpu] = vcpu; + else + custom_stream->cpus[current->cpu] = -1;
You can have above #define INSIDE_KVM_WINDOW -1 // Change the name if you don't like it and here you can have custom_stream->cpus[current->cpu] = INSIDE_KVM_WINDOW;
+ + } else { + + /** + * If the event comes from a guest, recover the pCPU where the event was executed + * and check if it's NOT OUTSIDE a kvm_entry/kvm_exit block. + */ + if (is_guest(stream->stream_id, host_guest_mapping, n_guest, &host)) { + host_stream = custom_streams[host]; + + for (v_i = 0; v_i < host_stream->original_stream->n_cpus; v_i++) { + if (current->cpu == host_stream->cpus[v_i]) + break; + } + + if (v_i == host_stream->original_stream->n_cpus && custom_stream->cpus[current->cpu] != -1)
Avoid having lines longer than 100 characters. Split the ''if" in two lines if (v_i == host_stream->original_stream->n_cpus && custom_stream->cpus[current->cpu] !=-1)
+ printf("%d G out:\t", i);
This printout is really cryptic. Make it more informative.
+ + /** + * If the event comes from a host, recover the CPU that executed the event + * and check if it's NOT INSIDE a kvm_entry/kvm_exit block. + */ + } else {
It seems to me that the right place for the comment above is here (inside the "else").
+ if (custom_stream->cpus[current->cpu] != -1) + printf("%d H in:\t", i);
This one is cryptic as well.
+ } + } + + print_entry(entries[i]); + } + + free_data(kshark_ctx, custom_streams, entries, n_entries, host_guest_mapping, n_guest); + + kshark_free(kshark_ctx); +}
Thanks! Yordan