On 6/4/2021 12:37 AM, Zhu Yanjun wrote:
After I added a rxe device on the netdev, then run rdma-core test tools.
Then I remove rxe device, in the end, I unloaded rdma_rxe kernel modules.
I found the above logs.
"
[ 1249.651921] rdma_rxe: rxe-pd pool destroyed with unfree'd elem
[ 1249.651927] rdma_rxe: rxe-qp pool destroyed with unfree'd elem
[ 1249.651929] rdma_rxe: rxe-cq pool destroyed with unfree'd elem
"
It seems that some resources leak.
I will make further investigations.
Zhu Yanjun
Zhu,
I suspect this is an older error. I traced all the add and drop ref
calls for PDs, then ran the full suite of Python tests and also test_mr
which includes the memory window tests by itself and then counted the
adds and drops. For test_mr alone I get 85 adds and 85 drops but when I
run the whole suite I get 384 adds and 380 drops. Since the memory
window code is only exercised in test_mr I think it is OK. Somewhere
else there are missing drops. I will try to isolate them.
Bob