On Tue, Nov 24, 2015 at 09:21:06PM +0100, Ilya Dryomov wrote: > >> Cleanup here is (and should be) done in reverse order. > > Yes. This is true. > > I have got an other impression about the appropriate order for the corresponding > > clean-up function calls. > > > > > >> We allocate parent rbd_device and then link it with what we already have, > > > > I guess that we have got a different understanding about the relevant "linking". > > Well, there isn't any _literal_ linking (e.g. adding to a link list, > etc) in this case. We just bump some refs and do probe to fill in the > newly allocated parent. If probe fails, we put refs and free parent, > reversing the "alloc parent, bump refs" order. > > The actual linking (rbd_dev->parent = parent) is done right before > returning so we never have to undo it in rbd_dev_probe_parent() and > that's the only reason your patch probably doesn't break anything. > Think about what happens if, after your patch is applied, someone moves > that assignment up or adds an extra step that can fail after it... > The problem is that the unwind code should be a mirror of the allocate code but rbd_dev_unparent() doesn't mirror anything. Generally, writing future proof stubs like this is a wrong thing because predicting the future is hard and in the mean time we are left stubs which confuse everyone. > If all error paths could be adjusted so that NULL pointers are never > passed in, destroy functions wouldn't need to have a NULL check, would > they? Yep. We agree on the right way to do it. I am probably the number one kernel developer for removing the most sanity checks. :P (As opposed to patch 1/1 where we now rely on the sanity check inside rbd_dev_destroy().) drivers/block/rbd.c 5149 static int rbd_dev_probe_parent(struct rbd_device *rbd_dev, int depth) 5150 { 5151 struct rbd_device *parent = NULL; 5152 int ret; 5153 5154 if (!rbd_dev->parent_spec) 5155 return 0; 5156 5157 if (++depth > RBD_MAX_PARENT_CHAIN_LEN) { 5158 pr_info("parent chain is too long (%d)\n", depth); 5159 ret = -EINVAL; 5160 goto out_err; We haven't allocated anything so this should just be return -EINVAL; In the original code, we decrement the kref count on ->parent_spec on this error path so that is a classic One Err Bug. 5161 } 5162 5163 parent = rbd_dev_create(rbd_dev->rbd_client, rbd_dev->parent_spec, 5164 NULL); 5165 if (!parent) { 5166 ret = -ENOMEM; 5167 goto out_err; Still haven't allocated anything so return -ENOMEM, but if we fail after this point we will need to call rbd_dev_destroy(). 5168 } 5169 5170 /* 5171 * Images related by parent/child relationships always share 5172 * rbd_client and spec/parent_spec, so bump their refcounts. 5173 */ 5174 __rbd_get_client(rbd_dev->rbd_client); 5175 rbd_spec_get(rbd_dev->parent_spec); We will need to put these on any later error paths. 5176 5177 ret = rbd_dev_image_probe(parent, depth); 5178 if (ret < 0) 5179 goto out_err; Ok. We need to put the ->parent_spec, ->rbd_client and free the parent. 5180 5181 rbd_dev->parent = parent; 5182 atomic_set(&rbd_dev->parent_ref, 1); 5183 return 0; 5184 5185 out_err: 5186 rbd_dev_unparent(rbd_dev); This is a complicated way to say rbd_spec_put(rbd_dev->parent_spec); Also, is it really necessary to set ->parent_spec to NULL? If we didn't put the last reference then doesn't setting it to NULL mean we are leaking? Setting it to NULL is confusing and feels like a layering violation. 5187 if (parent) 5188 rbd_dev_destroy(parent); 5189 return ret; 5190 } I feel like we should be calling rbd_put_client() on this error path or else the code is buggy or has layer violations. So I *think* it should look like this: dec_ref_counts: rbd_spec_put(rbd_dev->parent_spec); rbd_put_client(rbd_dev->rbd_client); rbd_dev_destroy(parent); return ret; regards, dan carpenter -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html