On Fri, 2017-08-11 at 00:33 +0300, Yuval Shaia wrote: > On Thu, Aug 10, 2017 at 09:26:06PM +0000, Adit Ranadive wrote: > > On 8/10/17, 2:01 PM, "Yuval Shaia" <yuval.shaia@xxxxxxxxxx> wrote: > > > Great, i understand that. > > > So, at least can you consider moving this dev_err into > > > pvrdma_idx_ring_has_data so callers do not need to handle errors? > > > > Thanks for the suggestion. It's something we will have to discuss > > internally, > > though I'm not sure if BUG_ON is the right way to go. That just > > seems a really > > No, i agree, BUG_ON is a mistake as ring corruption can be caused by > HW, BUG_ON is generally a mistake regardless. Even if it was an issue in the driver, another option would be to forcibly remove the ib_device and shut it all down. That would prevent a driver bug from taking the machine down, and therefore is preferable to a BUG_ON. If I see a BUG_ON, I'm immediately going to be asking why that particular condition is so bad that you think someone's Oracle server should die entirely versus limping along with some disabled device or something (because I know that's exactly what Linus will ask when he sees it). > my > suggestion is just to place the check and error print inside the > function > pvrdma_idx_ring_has_data so callers will not need to handle such > errors. > > Anyways, with or without taking the suggestion - fix lgtm. > > Reviewed-by: Yuval Shaia <yuval.shaia@xxxxxxxxxx> Pulled in for -rc. Thanks. > > big hammer to me. > > As it stands the patch should be added since it does fix a > > potential race > > condition. > > -- Doug Ledford <dledford@xxxxxxxxxx> GPG KeyID: B826A3330E572FDD Key fingerprint = AE6B 1BDA 122B 23B4 265B 1274 B826 A333 0E57 2FDD -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html