From: Chris Horn <chris.horn@xxxxxxx> lnet_peer_merge_data() and lnet_peer_set_primary_data() are responsible for dropping the reference on the ping buffer that is taken by lnet_peer_push_event() and lnet_discovery_event_reply(). However, there are some error paths in lnet_peer_data_present() where we do not call either lnet_peer_merge_data() or lnet_peer_set_primary_data(). In these cases, we need to drop the reference on the ping buffer otherwise it will leak. HPE-bug-id: LUS-10715 WC-bug-id: https://jira.whamcloud.com/browse/LU-15509 Lustre-commit: 4de9793654ec1b2f0 ("LU-15509 lnet: Ping buffer ref leak in lnet_peer_data_present") Signed-off-by: Chris Horn <chris.horn@xxxxxxx> Reviewed-on: https://review.whamcloud.com/46431 Reviewed-by: Andriy Skulysh <andriy.skulysh@xxxxxxx> Reviewed-by: James Simmons <jsimmons@xxxxxxxxxxxxx> Reviewed-by: Serguei Smirnov <ssmirnov@xxxxxxxxxxxxx> Reviewed-by: Cyril Bordage <cbordage@xxxxxxxxxxxxx> Reviewed-by: Oleg Drokin <green@xxxxxxxxxxxxx> Signed-off-by: James Simmons <jsimmons@xxxxxxxxxxxxx> --- net/lnet/lnet/peer.c | 9 +++++++-- 1 file changed, 7 insertions(+), 2 deletions(-) diff --git a/net/lnet/lnet/peer.c b/net/lnet/lnet/peer.c index 57d137c..3909c5d 100644 --- a/net/lnet/lnet/peer.c +++ b/net/lnet/lnet/peer.c @@ -3317,8 +3317,10 @@ static int lnet_peer_data_present(struct lnet_peer *lp) * down, and our reference count may be all that is keeping it * alive. Don't do any work on it. */ - if (list_empty(&lp->lp_peer_list)) + if (list_empty(&lp->lp_peer_list)) { + lnet_ping_buffer_decref(pbuf); goto out; + } flags = LNET_PEER_DISCOVERED; if (pbuf->pb_info.pi_features & LNET_PING_FEAT_MULTI_RAIL) @@ -3345,7 +3347,9 @@ static int lnet_peer_data_present(struct lnet_peer *lp) nid = pbuf->pb_info.pi_ni[1].ns_nid; if (nid_is_lo0(&lp->lp_primary_nid)) { rc = lnet_peer_set_primary_nid(lp, nid, flags); - if (!rc) + if (rc) + lnet_ping_buffer_decref(pbuf); + else rc = lnet_peer_merge_data(lp, pbuf); /* if the primary nid of the peer is present in the ping info returned * from the peer, but it's not the local primary peer we have @@ -3367,6 +3371,7 @@ static int lnet_peer_data_present(struct lnet_peer *lp) CERROR("Primary NID error %s versus %s: %d\n", libcfs_nidstr(&lp->lp_primary_nid), libcfs_nid2str(nid), rc); + lnet_ping_buffer_decref(pbuf); } else { rc = lnet_peer_merge_data(lp, pbuf); } -- 1.8.3.1