RE: Next steps on Web Analytics Project

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi!

> -----Original Message-----
> From: S Moonesamy [mailto:sm+ietf@xxxxxxxxxxxx]
> Sent: Friday, September 27, 2019 7:29 PM
> To: Stephen Farrell <stephen.farrell@xxxxxxxxx>; ietf@xxxxxxxx
> Cc: Roman Danyliw <rdd@xxxxxxxx>
> Subject: Re: Next steps on Web Analytics Project
> 
> Hi Stephen,
> At 03:11 PM 27-09-2019, Stephen Farrell wrote:
> >Yes, tracking what and when becomes possible.
> >
> >I'm also unhappy with that. Is there no way to ensure that addresses
> >and geolocated regions are sufficiently aggregated so as to not
> >identify individuals?
> >
> >/16's and countries are not sufficient for all IETFers.
> >
> >I'm sure someone who reads this list would have a fair chance at
> >(re-)identifying various individuals based on time, /16 or /48, and
> >URL.
> 
> It is technically possible to identify a person or a small set of persons even if
> the IPv4 addresses are aggregated by /14.  I suggest stepping back a little.
> The technical solution is being used to drive the policy statement.  Would it
> be better to do the reverse to figure out what is feasible [1]?  That would
> entail flushing out the policy statement to get a sense of what information
> IESG members [2][3] would find useful.

I'm not entirely following how the technical solution is driving the policy statement (i.e., motivation for the project).  Section 1.1 identifies what information is useful -- the use cases and questions that would be helpful to answering for improving the web-site.  Section 2.2 describes a candidate solution based on needs dictated by the use cases.  Section 2.3 provides a mapping between the individual data elements that will be collected by the solution and these motivating use cases.  Section 3 and 4 acknowledges that there are security and privacy issues in implementing this policy and provides a series of mitigations.  

You're right, ultimately, the technology solution (Matamo) does drive some of the mitigations as it provides only certain types of anonymization and aggregation primitives.

Regards,
Roman

> The data processor could then use a
> "custom dimension" to decrease the probability of identification of that
> small set of persons.
>
> Regards,
> S. Moonesamy
> 
> 1. Please see the P.S. in your email
> 2. One of the issues is that web analytics usually use the IP addresses to
> aggregate by country.  Does an IESG member need to know whether Country
> X has expressed an interest in, for example, the IESG history of appeals?
> 3. Does an IETF LLC Director need to know who is reading the monthly
> financial statements?





[Index of Archives]     [IETF Annoucements]     [IETF]     [IP Storage]     [Yosemite News]     [Linux SCTP]     [Linux Newbies]     [Mhonarc]     [Fedora Users]

  Powered by Linux