On Wed, 2017-01-18 at 18:11 +0300, Konstantin Khlebnikov wrote: > On 18.01.2017 17:23, Eric Dumazet wrote: > > > > Take a look at sk_filter_trim_cap(), where the RX packets received on a > > socket which does not have SOCK_MEMALLOC is dropped. > > > > /* > > * If the skb was allocated from pfmemalloc reserves, only > > * allow SOCK_MEMALLOC sockets to use it as this socket is > > * helping free memory > > */ > > if (skb_pfmemalloc(skb) && !sock_flag(sk, SOCK_MEMALLOC)) > > return -ENOMEM; > > I suppose this happens in BH context right after receiving packet? > > Potentially any ACK could free memory in TCP send queue, > so using reserves here makes sense. Yes, but only sockets with SOCK_MEMALLOC have this contract with the mm layer. For 'other' sockets, one possible trick would be that if only the page fragment attached to skb had the pfmemalloc bit, and not the sk_buff itself, we could attempt a skb_condense() operation [1], but it is not really easy to properly recompute skb->pfmemalloc. Pure TCP ACK packets can usually be trimmed by skb_condense(). Since they have no payload, we have a guarantee they wont sit in a queue and hold memory. -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>