Hi, On 2/27/2025 10:26 PM, Jiayuan Chen wrote: > BPF CI has failed 3 times in the last 24 hours. Add retry for ENOMEM. > It's similar to the optimization plan: > commit 2f553b032cad ("selftsets/bpf: Retry map update for non-preallocated per-cpu map") > > Failed CI: > https://github.com/kernel-patches/bpf/actions/runs/13549227497/job/37868926343 > https://github.com/kernel-patches/bpf/actions/runs/13548089029/job/37865812030 > https://github.com/kernel-patches/bpf/actions/runs/13553536268/job/37883329296 Er, it is suspicious that the allocation of htab_elem failed, because the size of key and value is small in these cases. It usually means that there is something wrong with bpf mem allocator or the slub, therefore, I think it is a bad idea to retry for ENOMEM in this case. Will try to reproduce the problem first when get some time. > > selftests/bpf: Fixes for test_maps test > Fork 100 tasks to 'test_update_delete' > Fork 100 tasks to 'test_update_delete' > Fork 100 tasks to 'test_update_delete' > Fork 100 tasks to 'test_update_delete' > ...... > test_task_storage_map_stress_lookup:PASS > test_maps: OK, 0 SKIPPED > > Signed-off-by: Jiayuan Chen <jiayuan.chen@xxxxxxxxx> > --- > tools/testing/selftests/bpf/test_maps.c | 9 +++++---- > 1 file changed, 5 insertions(+), 4 deletions(-) > > diff --git a/tools/testing/selftests/bpf/test_maps.c b/tools/testing/selftests/bpf/test_maps.c > index 8b40e9496af1..986ce32b113a 100644 > --- a/tools/testing/selftests/bpf/test_maps.c > +++ b/tools/testing/selftests/bpf/test_maps.c > @@ -1396,9 +1396,10 @@ static void test_map_stress(void) > #define MAX_DELAY_US 50000 > #define MIN_DELAY_RANGE_US 5000 > > -static bool retry_for_again_or_busy(int err) > +static bool can_retry(int err) > { > - return (err == EAGAIN || err == EBUSY); > + return (err == EAGAIN || err == EBUSY || > + (err == ENOMEM && map_opts.map_flags == BPF_F_NO_PREALLOC)); > } > > int map_update_retriable(int map_fd, const void *key, const void *value, int flags, int attempts, > @@ -1451,12 +1452,12 @@ static void test_update_delete(unsigned int fn, void *data) > > if (do_update) { > err = map_update_retriable(fd, &key, &value, BPF_NOEXIST, MAP_RETRIES, > - retry_for_again_or_busy); > + can_retry); > if (err) > printf("error %d %d\n", err, errno); > assert(err == 0); > err = map_update_retriable(fd, &key, &value, BPF_EXIST, MAP_RETRIES, > - retry_for_again_or_busy); > + can_retry); > if (err) > printf("error %d %d\n", err, errno); > assert(err == 0);