Re: [RFC PATCH 1/4] IPMI: Add one interface to get more info of low-level IPMI device

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Mon, 2010-11-15 at 15:52 +0800, ykzhao wrote:
> On Thu, 2010-11-04 at 08:41 +0800, Corey Minyard wrote:
> > On 11/02/2010 12:33 AM, ykzhao wrote:
> > > On Thu, 2010-10-28 at 09:00 +0800, ykzhao wrote:
> > >
> > >> On Thu, 2010-10-28 at 00:13 +0800, Corey Minyard wrote:
> > >>
> > >>> I think you miss the point of a refcount.  The refcount is to keep the
> > >>> data structure around even if the device has gone away, so that any
> > >>> users may not crash.  Just having a counter in the IPMI structure and
> > >>> then decrementing the device refcount is not going to accomplish anything.
> > >>>
> > >>> And you don't really need to refcount the IPMI structure.  (And if you
> > >>> did, you would need to use a refcount structure and you would need to
> > >>> keep from freeing the IPMI structure until the refcount went to zero.)
> > >>> You just need to increment the refcount on the device structure, and
> > >>> whoever gets the device structure needs to be responsible for
> > >>> decrementing it's refcount.
> > >>>
> > >> Yes. After the copy mechanism is used, it is unnecessary to use the
> > >> refcount to protect the corresponding data structure.
> > >>
> > >> In fact the purpose of adding another one refcount is to add/decrease
> > >> the refcount of dev in course of calling the function of
> > >> ipmi_get_smi_info/ipmi_put_smi_info.
> > >>       But after I look at the smi_watcher mechanism, I find that the
> > >> refcount of dev is not released correctly in the following case.
> > >>          Step 1:  Other driver registers one smi_watcher and the
> > >> ipmi_get_smi_info is called in the new_smi callback function. (And the
> > >> driver will use the dev pointer before the ipmi_put_smi_info is
> > >> called).
> > >>          Step 2: for any reason, the IPMI device will be unregistered by calling
> > >> cleanup_one_si. This function will remove it from the IPMI device list
> > >> firstly and then call the smi_gone for every smi_watcher.
> > >>          Step 3: the ipmi_put_smi_info is called in smi_gone callback function.
> > >> In such case the ipmi_put_smi_info can't find the corresponding IPMI
> > >> device based on the given if_num and can't release the pointer of dev.
> > >>
> > >> But after adding another refcount, we can handle the above scenario. At
> > >> the same time even when the ipmi_put_smi_info is not called by the
> > >> caller, it still can work.
> > >> Not sure whether the above thought is redundant?
> > >>
> > > Corey,
> > >      Is the above thought correct? If not, please correct me. In fact the
> > > purpose of adding another refcount is to fix the incorrect inc/dec in
> > > course of calling ipmi_get_smi_info/ipmi_put_smi_info.
> > >
> >
> 
> Hi, Corey
>     Sorry for the late response.
> 
> > No.  The purpose of a refcount is to keep from freeing a data structure
> > while it is in use.
> >
> > Let's say your code gets the device structure for the IPMI device, then
> > a task switch occurs.  During the time it is not running, the IPMI
> > device is hotplugged away and no longer exists.  The device is deleted.
> > When your code starts running again, it still has a pointer to that
> > device structure.  Sure, the removal code may have run, but your code
> > still has a pointer to that device structure.  Which has been freed.  Bad.
> 
> Thanks for the detailed explanation about the refcount.
> 
> > This is what refcounts (well, krefs) do.  In the above scenario, your
> > code will get the device with the kref incremented.  All the above
> > happens, but since you have incremented the refcount on the device
> > structure, it will not be deleted.  It will wait until you put
> > (decrement) the refcount before performing the free.
> 
> The above consideration is based on the assumption that the function of
> ipmi_put_smi_info is called in the function of smi_watcher smi_gone
> callback function. Not sure whether it is correct? Or is there any
> problem about the definition of ipmi_put_smi_info?
> 
> ï void ipmi_put_smi_info(int if_num)
> +{
> +       ipmi_smi_t    intf;
> +       struct ipmi_smi_handlers *handlers;
> +
> +       mutex_lock(&ipmi_interfaces_mutex);
> +       list_for_each_entry_rcu(intf, &ipmi_interfaces, link) {
> +               if (intf->intf_num == if_num)
> +                       goto found;
> +       }
> +       /* Not found, return */
> +       mutex_unlock(&ipmi_interfaces_mutex);
> +       return;
> +
> +found:
> +       handlers = intf->handlers;
> +       handlers->put_smi_info(intf->send_info);
> +       mutex_unlock(&ipmi_interfaces_mutex);
> +}
> 
> 
> 
> The refcount of kref can't be decreased correctly in the following two
> cases:
>         a. When one smi_watcher is unregistered by calling
> ipmi_smi_watcher_unregister, we only delete it from the smi_watcher list
> and won't call the smi_gone callback function.
>         b. The function of cleanup_one_si is to unregister one IPMI interface.
> It will remove the ipmi interface from the corresponding ipmi list and
> then call smi_gone callback function for every smi_watcher. As the
> corresponding IPMI interface is already removed from the IPMI interface
> list, it can't find the corresponding IPMI interface based on the
> interface number. In such case the refcount of dev can't be released
> correctly.

Hi, Corey
	What do you think about the above description for the refcount of dev?
Does it make sense that the following prototype is used for the
ipmi_put_smi_info?
	void ipmi_put_smi_info(ïstruct ipmi_smi_info *data)
	{
		put_device(data->dev);	
	}
	
> >
> > >
> > >>
> > >>
> > >>> I'd also prefer to not have a struct ipmi_smi_info in the IPMI data
> > >>> structure.  Just pull the data from existing sources.
> > >>>
> > >> At most cases the requirement is different for the different addr source
> > >> type. If we don't put the ipmi_smi_info in the IPMI data structure, I am
> > >> afraid that we will have to fill the corresponding info based on the
> > >> addr source type. Maybe it will be more complex.
> > >>
> > > If only pull data from the existing source, it can save some
> > > memory-space. But it seems a bit complex as we have to fill the info
> > > based on the corresponding addr source type. And this should be done
> > > every time the function of ipmi_get_smi_info is called.
> > >
> >
> > I don't think it's that complex, and I doubt that function is called
> > that often.
> 
> Ok. I will refresh the patch set and fill the corresponding data
> structure from the existing source. But as I don't know what info is
> required for the other ipmi type except the SI_ACPI type, I will only
> add the info of SI_ACPI type.

ïFor the device type and dev pointer, I can get it from the current
smi_info directly.  But I meet with the following issues when pulling
data from the existing source to fill the specific data. 
	a. no specific info is contained in smi_info. For example: ACPI handle
is important to the SI_ACPI type. But the corresponding handle is stored
in smi_info. 
	b. Need to add a lot of If/else macro definition to handle the
corresponding IPMI device type.

Any idea about the above issues? 

Thanks.

> 
> >
> > -corey
> > >
> > >> Thanks.
> > >>          Yakui
> > >>
> > >>
> > >>> -corey
> > >>>
> > >>> On 10/26/2010 04:14 AM, yakui.zhao@xxxxxxxxx wrote:
> > >>>
> > >>>> From: Zhao Yakui<yakui.zhao@xxxxxxxxx>
> > >>>>
> > >>>> The IPMI smi_watcher will be used to catch the IPMI interface as they come or go.
> > >>>> In order to communicate with the correct IPMI device, it should be confirmed
> > >>>>    whether it is what we wanted especially on the system with multiple IPMI
> > >>>> devices. But the new_smi callback function of smi_watcher provides very
> > >>>> limited info(only the interface number and dev pointer) and there is no
> > >>>> detailed info about the low level interface. For example: which mechansim
> > >>>> registers the IPMI interface(ACPI, PCI, DMI and so on).
> > >>>>
> > >>>> This is to add one interface that can get more info of low-level IPMI
> > >>>> device. For example: the ACPI device handle will be returned for the pnp_acpi
> > >>>> IPMI device.
> > >>>>
> > >>>> Signed-off-by: Zhao Yakui<yakui.zhao@xxxxxxxxx>
> > >>>> ---
> > >>>>    drivers/char/ipmi/ipmi_msghandler.c |   46 ++++++++++++++++++++++++++++++++
> > >>>>    drivers/char/ipmi/ipmi_si_intf.c    |   49 +++++++++++++++++++++++++++++++---
> > >>>>    include/linux/ipmi.h                |   29 ++++++++++++++++++++
> > >>>>    include/linux/ipmi_smi.h            |    8 +++++
> > >>>>    4 files changed, 127 insertions(+), 5 deletions(-)
> > >>>>
> > >>>> diff --git a/drivers/char/ipmi/ipmi_msghandler.c b/drivers/char/ipmi/ipmi_msghandler.c
> > >>>> index 4f3f8c9..e323edb 100644
> > >>>> --- a/drivers/char/ipmi/ipmi_msghandler.c
> > >>>> +++ b/drivers/char/ipmi/ipmi_msghandler.c
> > >>>> @@ -970,6 +970,52 @@ out_kfree:
> > >>>>    }
> > >>>>    EXPORT_SYMBOL(ipmi_create_user);
> > >>>>
> > >>>> +int ipmi_get_smi_info(int if_num, struct ipmi_smi_info *data)
> > >>>> +{
> > >>>> +   int           rv = 0;
> > >>>> +   ipmi_smi_t    intf;
> > >>>> +   struct ipmi_smi_handlers *handlers;
> > >>>> +
> > >>>> +   mutex_lock(&ipmi_interfaces_mutex);
> > >>>> +   list_for_each_entry_rcu(intf,&ipmi_interfaces, link) {
> > >>>> +           if (intf->intf_num == if_num)
> > >>>> +                   goto found;
> > >>>> +   }
> > >>>> +   /* Not found, return an error */
> > >>>> +   rv = -EINVAL;
> > >>>> +   mutex_unlock(&ipmi_interfaces_mutex);
> > >>>> +   return rv;
> > >>>> +
> > >>>> +found:
> > >>>> +   handlers = intf->handlers;
> > >>>> +   rv = handlers->get_smi_info(intf->send_info, data);
> > >>>> +   mutex_unlock(&ipmi_interfaces_mutex);
> > >>>> +
> > >>>> +   return rv;
> > >>>> +}
> > >>>> +EXPORT_SYMBOL(ipmi_get_smi_info);
> > >>>> +
> > >>>> +void ipmi_put_smi_info(int if_num)
> > >>>> +{
> > >>>> +   ipmi_smi_t    intf;
> > >>>> +   struct ipmi_smi_handlers *handlers;
> > >>>> +
> > >>>> +   mutex_lock(&ipmi_interfaces_mutex);
> > >>>> +   list_for_each_entry_rcu(intf,&ipmi_interfaces, link) {
> > >>>> +           if (intf->intf_num == if_num)
> > >>>> +                   goto found;
> > >>>> +   }
> > >>>> +   /* Not found, return */
> > >>>> +   mutex_unlock(&ipmi_interfaces_mutex);
> > >>>> +   return;
> > >>>> +
> > >>>> +found:
> > >>>> +   handlers = intf->handlers;
> > >>>> +   handlers->put_smi_info(intf->send_info);
> > >>>> +   mutex_unlock(&ipmi_interfaces_mutex);
> > >>>> +}
> > >>>> +EXPORT_SYMBOL(ipmi_put_smi_info);
> > >>>> +
> > >>>>    static void free_user(struct kref *ref)
> > >>>>    {
> > >>>>      ipmi_user_t user = container_of(ref, struct ipmi_user, refcount);
> > >>>> diff --git a/drivers/char/ipmi/ipmi_si_intf.c b/drivers/char/ipmi/ipmi_si_intf.c
> > >>>> index 7bd7c45..d313018 100644
> > >>>> --- a/drivers/char/ipmi/ipmi_si_intf.c
> > >>>> +++ b/drivers/char/ipmi/ipmi_si_intf.c
> > >>>> @@ -57,6 +57,7 @@
> > >>>>    #include<asm/irq.h>
> > >>>>    #include<linux/interrupt.h>
> > >>>>    #include<linux/rcupdate.h>
> > >>>> +#include<linux/ipmi.h>
> > >>>>    #include<linux/ipmi_smi.h>
> > >>>>    #include<asm/io.h>
> > >>>>    #include "ipmi_si_sm.h"
> > >>>> @@ -107,10 +108,6 @@ enum si_type {
> > >>>>    };
> > >>>>    static char *si_to_str[] = { "kcs", "smic", "bt" };
> > >>>>
> > >>>> -enum ipmi_addr_src {
> > >>>> -   SI_INVALID = 0, SI_HOTMOD, SI_HARDCODED, SI_SPMI, SI_ACPI, SI_SMBIOS,
> > >>>> -   SI_PCI, SI_DEVICETREE, SI_DEFAULT
> > >>>> -};
> > >>>>    static char *ipmi_addr_src_to_str[] = { NULL, "hotmod", "hardcoded", "SPMI",
> > >>>>                                      "ACPI", "SMBIOS", "PCI",
> > >>>>                                      "device-tree", "default" };
> > >>>> @@ -291,6 +288,12 @@ struct smi_info {
> > >>>>      struct task_struct *thread;
> > >>>>
> > >>>>      struct list_head link;
> > >>>> +   /*
> > >>>> +    * Count the refcount of smi_data->dev related with the function
> > >>>> +    * of ipmi_get_smi_info/ipmi_put_smi_info.
> > >>>> +    */
> > >>>> +   atomic_t        smi_info_ref;
> > >>>> +   struct ipmi_smi_info smi_data;
> > >>>>    };
> > >>>>
> > >>>>    #define smi_inc_stat(smi, stat) \
> > >>>> @@ -1186,6 +1189,27 @@ static int smi_start_processing(void       *send_info,
> > >>>>      return 0;
> > >>>>    }
> > >>>>
> > >>>> +static int get_smi_info(void *send_info, struct ipmi_smi_info *data)
> > >>>> +{
> > >>>> +   struct smi_info *new_smi = send_info;
> > >>>> +   struct ipmi_smi_info *smi_data =&new_smi->smi_data;
> > >>>> +
> > >>>> +   get_device(new_smi->dev);
> > >>>> +   memcpy(data, smi_data, sizeof(*smi_data));
> > >>>> +   smi_data->addr_src = new_smi->addr_source;
> > >>>> +   atomic_inc(&new_smi->smi_info_ref);
> > >>>> +
> > >>>> +   return 0;
> > >>>> +}
> > >>>> +
> > >>>> +static void put_smi_info(void *send_info)
> > >>>> +{
> > >>>> +   struct smi_info *new_smi = send_info;
> > >>>> +
> > >>>> +   put_device(new_smi->dev);
> > >>>> +   atomic_dec(&new_smi->smi_info_ref);
> > >>>> +}
> > >>>> +
> > >>>>    static void set_maintenance_mode(void *send_info, int enable)
> > >>>>    {
> > >>>>      struct smi_info   *smi_info = send_info;
> > >>>> @@ -1197,6 +1221,8 @@ static void set_maintenance_mode(void *send_info, int enable)
> > >>>>    static struct ipmi_smi_handlers handlers = {
> > >>>>      .owner                  = THIS_MODULE,
> > >>>>      .start_processing       = smi_start_processing,
> > >>>> +   .get_smi_info           = get_smi_info,
> > >>>> +   .put_smi_info           = put_smi_info,
> > >>>>      .sender                 = sender,
> > >>>>      .request_events         = request_events,
> > >>>>      .set_maintenance_mode   = set_maintenance_mode,
> > >>>> @@ -2143,6 +2169,8 @@ static int __devinit ipmi_pnp_probe(struct pnp_dev *dev,
> > >>>>              return -ENOMEM;
> > >>>>
> > >>>>      info->addr_source = SI_ACPI;
> > >>>> +   info->smi_data.addr_info.acpi_info.acpi_handle =
> > >>>> +                           acpi_dev->handle;
> > >>>>      printk(KERN_INFO PFX "probing via ACPI\n");
> > >>>>
> > >>>>      handle = acpi_dev->handle;
> > >>>> @@ -3092,6 +3120,7 @@ static int try_smi_init(struct smi_info *new_smi)
> > >>>>    {
> > >>>>      int rv = 0;
> > >>>>      int i;
> > >>>> +   struct ipmi_smi_info *smi_data;
> > >>>>
> > >>>>      printk(KERN_INFO PFX "Trying %s-specified %s state"
> > >>>>             " machine at %s address 0x%lx, slave address 0x%x,"
> > >>>> @@ -3255,6 +3284,9 @@ static int try_smi_init(struct smi_info *new_smi)
> > >>>>
> > >>>>      dev_info(new_smi->dev, "IPMI %s interface initialized\n",
> > >>>>               si_to_str[new_smi->si_type]);
> > >>>> +   smi_data =&new_smi->smi_data;
> > >>>> +   smi_data->dev = new_smi->dev;
> > >>>> +   atomic_set(&new_smi->smi_info_ref, 0);
> > >>>>
> > >>>>      return 0;
> > >>>>
> > >>>> @@ -3501,7 +3533,14 @@ static void cleanup_one_si(struct smi_info *to_clean)
> > >>>>              printk(KERN_ERR PFX "Unable to unregister device: errno=%d\n",
> > >>>>                     rv);
> > >>>>      }
> > >>>> -
> > >>>> +   /* maybe the caller doesn't release the refcount of dev, which is
> > >>>> +    * increased by calling the function of ipmi_get_smi_info. So try
> > >>>> +    * to help it.
> > >>>> +    */
> > >>>> +   while (atomic_read(&to_clean->smi_info_ref)) {
> > >>>> +           put_device(to_clean->dev);
> > >>>> +           atomic_dec(&to_clean->smi_info_ref);
> > >>>> +   }
> > >>>>      if (to_clean->handlers)
> > >>>>              to_clean->handlers->cleanup(to_clean->si_sm);
> > >>>>
> > >>>> diff --git a/include/linux/ipmi.h b/include/linux/ipmi.h
> > >>>> index 65aae34..9a0c72e 100644
> > >>>> --- a/include/linux/ipmi.h
> > >>>> +++ b/include/linux/ipmi.h
> > >>>> @@ -454,6 +454,35 @@ unsigned int ipmi_addr_length(int addr_type);
> > >>>>    /* Validate that the given IPMI address is valid. */
> > >>>>    int ipmi_validate_addr(struct ipmi_addr *addr, int len);
> > >>>>
> > >>>> +enum ipmi_addr_src {
> > >>>> +   SI_INVALID = 0, SI_HOTMOD, SI_HARDCODED, SI_SPMI, SI_ACPI, SI_SMBIOS,
> > >>>> +   SI_PCI, SI_DEVICETREE, SI_DEFAULT
> > >>>> +};
> > >>>> +struct ipmi_smi_info {
> > >>>> +   enum ipmi_addr_src addr_src;
> > >>>> +   struct device *dev;
> > >>>> +   /*
> > >>>> +    * The addr_info can provide more detailed info of one IPMI device.
> > >>>> +    * Now only SI_ACPI info is provided. And it depends on the SI_ACPI
> > >>>> +    * address type. If the info is required for other address type, please
> > >>>> +    * add it.
> > >>>> +    */
> > >>>> +   union {
> > >>>> +           /* the acpi_info element is defined for the SI_ACPI
> > >>>> +            * address type
> > >>>> +            */
> > >>>> +           struct {
> > >>>> +                   void *acpi_handle;
> > >>>> +           } acpi_info;
> > >>>> +   } addr_info;
> > >>>> +};
> > >>>> +
> > >>>> +/* This is to get the private info of ipmi_smi_t */
> > >>>> +extern int ipmi_get_smi_info(int if_num, struct ipmi_smi_info *data);
> > >>>> +/* This is to decrease refcount of dev added in the function of
> > >>>> + * ipmi_get_smi_info
> > >>>> + */
> > >>>> +extern void ipmi_put_smi_info(int if_num);
> > >>>>    #endif /* __KERNEL__ */
> > >>>>
> > >>>>
> > >>>> diff --git a/include/linux/ipmi_smi.h b/include/linux/ipmi_smi.h
> > >>>> index 4b48318..b99942d 100644
> > >>>> --- a/include/linux/ipmi_smi.h
> > >>>> +++ b/include/linux/ipmi_smi.h
> > >>>> @@ -86,6 +86,14 @@ struct ipmi_smi_handlers {
> > >>>>      int (*start_processing)(void       *send_info,
> > >>>>                              ipmi_smi_t new_intf);
> > >>>>
> > >>>> +   /*
> > >>>> +    * Get the detailed private info of the low level interface and store
> > >>>> +    * it into the structure of ipmi_smi_data. For example: the
> > >>>> +    * ACPI device handle will be returned for the pnp_acpi IPMI device.
> > >>>> +    */
> > >>>> +   int (*get_smi_info)(void *send_info, struct ipmi_smi_info *data);
> > >>>> +
> > >>>> +   void (*put_smi_info)(void *send_info);
> > >>>>      /* Called to enqueue an SMI message to be sent.  This
> > >>>>         operation is not allowed to fail.  If an error occurs, it
> > >>>>         should report back the error in a received message.  It may
> > >>>>
> > >>>>
> > >>>
> > >> --
> > >> To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
> > >> the body of a message to majordomo@xxxxxxxxxxxxxxx
> > >> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> > >>
> > >
> >
> 
> --
> To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

--
To unsubscribe from this list: send the line "unsubscribe linux-acpi" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux IBM ACPI]     [Linux Power Management]     [Linux Kernel]     [Linux Laptop]     [Kernel Newbies]     [Share Photos]     [Security]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Samba]     [Video 4 Linux]     [Device Mapper]     [Linux Resources]

  Powered by Linux