On Tue, Aug 25, 2020 at 01:06:23AM +0530, Selvin Xavier wrote: > On Tue, Aug 25, 2020 at 12:31 AM Leon Romanovsky <leon@xxxxxxxxxx> wrote: > > > > On Mon, Aug 24, 2020 at 11:14:31AM -0700, Selvin Xavier wrote: > > > Driver crashes when destroy_qp is re-tried because of an > > > error returned. This is because the qp entry was removed > > > from the qp list during the first call. > > > > How is it possible that destroy_qp fail? > > > One possibility is when the FW is in a crash state. Driver commands > to FW fails and it reports an error status for destroy_qp verb. > Even Though the chances of this failure is less, wanted to avoid a > host crash seen in this scenario. Drivers are not allowed to fail destroy - the only exception is if a future destroy would succeed for some reason. This patch should ignore the return code from FW and clean up all the host memory. If the FW is not responding then the device should be killed and the DMA allowed bit turned off in the PCI config space. Jason