ImageJ 2.0.0-rc-11 released

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
39 messages Options
12
Reply | Threaded
Open this post in threaded view
|

ImageJ 2.0.0-rc-11 released

ctrueden
Hi everyone,

Today the ImageJ team is proud to announce a new public release candidate
for ImageJ2: version 2.0.0-rc-11.

This release contains several critical bug-fixes as well as new features,
including tracking of anonymous usage statistics, as well as Groovy
scripting support.

For details, see the News post on the ImageJ web site at:
    http://imagej.net/2014-08-08_-_ImageJ_2.0.0-rc-11_released

Cheers,
Curtis

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: ImageJ 2.0.0-rc-11 released

Gabriel Landini
On Friday 08 Aug 2014 16:47:33 you wrote:
> Today the ImageJ team is proud to announce a new public release candidate
> for ImageJ2: version 2.0.0-rc-11.

Great to hear the IJ2 has reached rc11.

Looking at the New Features page, I think the collection of usage data should
be made an OPT IN option and so be set OFF by default.

Users might like to exercise some privacy by not being tracked, specially
about the "etc." mentioned in that page (?!).

Cheers

Gabriel

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: ImageJ 2.0.0-rc-11 released

Mark Hiner-2
Hi Gabriel,

> specially about the "etc." mentioned in that page (?!).

I wrote this section and left it vague at the time because Curtis Rueden
was in the process of refining the database schema and setting up exactly
what was tracked. I realize that I should certainly not have used vague
language when discussing user privacy though, and apologize for that!

The release wiki page is updated to specify exactly what metadata is
stored: country, java version, language, operating system, time zone,
update site.

>I think the collection of usage data should be made an OPT IN option and
so be set OFF by default.

This was a tough decision for us. We certainly don't want anyone to feel
uncomfortable using ImageJ. But we are committed to keeping these
statistics anonymous, and this is extremely important information for an
open source project. This data will certainly be used to strengthen future
funding requests to continue the support of Fiji/ImageJ and its
collaborations.

So we decided to make it opt out to get as realistic a view of usage as we
can, while still allowing anyone who is uncomfortable to disable this
functionality.

Of course, any further discussion or objections to this feature are
welcome... I, personally, would like to better understand any specific
concerns people may have.

Thanks for the feedback,
Mark

On Sat, Aug 9, 2014 at 12:44 PM, Gabriel Landini <[hidden email]>
wrote:

> On Friday 08 Aug 2014 16:47:33 you wrote:
> > Today the ImageJ team is proud to announce a new public release candidate
> > for ImageJ2: version 2.0.0-rc-11.
>
> Great to hear the IJ2 has reached rc11.
>
> Looking at the New Features page, I think the collection of usage data
> should
> be made an OPT IN option and so be set OFF by default.
>
> Users might like to exercise some privacy by not being tracked, specially
> about the "etc." mentioned in that page (?!).
>
> Cheers
>
> Gabriel
>
> --
> ImageJ mailing list: http://imagej.nih.gov/ij/list.html
>

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: ImageJ 2.0.0-rc-11 updater

Knecht, David
In reply to this post by ctrueden
Hi Curtis- I tried to run the update today from within ImageJ2 and got into a loop in the updater where it seemed to be going around in circles for a long time doing the same files over and over.  I canceled and downloaded from the web site instead.  This was on a Mac running latest Lion.  Old version was 2.0.0-rc-2.  Dave

On Aug 8, 2014, at 5:47 PM, Curtis Rueden <[hidden email]> wrote:

> Hi everyone,
>
> Today the ImageJ team is proud to announce a new public release candidate
> for ImageJ2: version 2.0.0-rc-11.
>
> This release contains several critical bug-fixes as well as new features,
> including tracking of anonymous usage statistics, as well as Groovy
> scripting support.
>
> For details, see the News post on the ImageJ web site at:
>    http://imagej.net/2014-08-08_-_ImageJ_2.0.0-rc-11_released
>
> Cheers,
> Curtis
>
> --
> ImageJ mailing list: http://imagej.nih.gov/ij/list.html

Dr. David Knecht
Professor of Molecular and Cell Biology
Core Microscopy Facility Director
University of Connecticut
Storrs, CT 06269
860-486-2200

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: ImageJ 2.0.0-rc-11 released

Herbie-4
In reply to this post by Mark Hiner-2
Sorry  Mark,

but

 > This data will certainly be used to strengthen future
 > funding requests to continue the support of Fiji/ImageJ and its
 > collaborations.

this is an unacceptable reason for gathering user data and I fear there
is no honest reason at all to do so.

 > "keeping these statistics anonymous"

Well, I'm convinced that you and your co-workers will but...

Storing user data is a no no!
Some members of the ImageJ-2 Team are criticizing Google an other data
collecting organizations since long and now...

If user data is gathered/transmitted by IJ-2, I shall, if possible,
comment out this portion of code. An OFF-option is insufficient.

Just my opinion because you were asking for further comments

Herbie

::::::::::::::::::::::::::::::::::::
On 11.08.14 16:11, Mark Hiner wrote:

> Hi Gabriel,
>
>> specially about the "etc." mentioned in that page (?!).
>
> I wrote this section and left it vague at the time because Curtis Rueden
> was in the process of refining the database schema and setting up exactly
> what was tracked. I realize that I should certainly not have used vague
> language when discussing user privacy though, and apologize for that!
>
> The release wiki page is updated to specify exactly what metadata is
> stored: country, java version, language, operating system, time zone,
> update site.
>
>> I think the collection of usage data should be made an OPT IN option and
> so be set OFF by default.
>
> This was a tough decision for us. We certainly don't want anyone to feel
> uncomfortable using ImageJ. But we are committed to keeping these
> statistics anonymous, and this is extremely important information for an
> open source project. This data will certainly be used to strengthen future
> funding requests to continue the support of Fiji/ImageJ and its
> collaborations.
>
> So we decided to make it opt out to get as realistic a view of usage as we
> can, while still allowing anyone who is uncomfortable to disable this
> functionality.
>
> Of course, any further discussion or objections to this feature are
> welcome... I, personally, would like to better understand any specific
> concerns people may have.
>
> Thanks for the feedback,
> Mark
>
> On Sat, Aug 9, 2014 at 12:44 PM, Gabriel Landini <[hidden email]>
> wrote:
>
>> On Friday 08 Aug 2014 16:47:33 you wrote:
>>> Today the ImageJ team is proud to announce a new public release candidate
>>> for ImageJ2: version 2.0.0-rc-11.
>>
>> Great to hear the IJ2 has reached rc11.
>>
>> Looking at the New Features page, I think the collection of usage data
>> should
>> be made an OPT IN option and so be set OFF by default.
>>
>> Users might like to exercise some privacy by not being tracked, specially
>> about the "etc." mentioned in that page (?!).
>>
>> Cheers
>>
>> Gabriel
>>
>> --
>> ImageJ mailing list: http://imagej.nih.gov/ij/list.html
>>
>
> --
> ImageJ mailing list: http://imagej.nih.gov/ij/list.html
>

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: ImageJ 2.0.0-rc-11 released

Gabriel Landini
In reply to this post by Mark Hiner-2
On Monday 11 Aug 2014 09:11:37 Mark Hiner wrote:
> I wrote:
> >I think the collection of usage data should be made an OPT IN option and
> > so be set OFF by default.
 
> This was a tough decision for us. We certainly don't want anyone to feel
> uncomfortable using ImageJ.

Good, so please use this as an opportunity to change the way the program gets
user permission to do it.
If this happens to be something people want to adhere to, then there is
nothing to worry about as there will be lots of users opting in when given the
chance. But at the moment this is essentially forcing everybody to tell what
they are doing with IJ in their machines without asking.

> But we are committed to keeping these statistics anonymous, and this
> is extremely important information for an open source project.
> This data will certainly be used to strengthen future
> funding requests to continue the support of Fiji/ImageJ and its
> collaborations.

Logging updates and downloads is one thing that could be quoted when applying
for funding (after all you need to contact the server to do that), but logging
plugin usage statistics in individual computers (i.e. what you do with the
software?) without user agreeing seems intrusive and over the top.

> So we decided to make it opt out to get as realistic a view of usage as we
> can, while still allowing anyone who is uncomfortable to disable this
> functionality.

SCIFIO was opt in, but usage tracking is opt out? It does not make sense.
Given privacy (or the lack of...) is such a hot topic at the moment it would
seem appropriate to do things right.

Regards

Gabriel

PS: Incidentally I also found that there seems to be no way of recording in
the macro language the Privacy Opt Out option. Can this be added too so it can
be included in the StartupMacros file? That way I can have the same setup
everywhere by transferring that file. Thanks.

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: ImageJ 2.0.0-rc-11 released

Mark Hiner-2
In reply to this post by Herbie-4
Hi Herbie,

>If user data is gathered/transmitted by IJ-2, I shall, if possible,
comment out this portion of code. An OFF-option is insufficient.

Can you explain why an OFF option is insufficient..?

Anyway, like everything in the ImageJ-2 framework, usage uploads come from
a Service
<https://github.com/scijava/scijava-common/blob/master/src/main/java/org/scijava/service/Service.java>
plugin providing the indicated functionality - in this case, the
UsageUploadService
<https://github.com/imagej/imagej-usage/blob/master/src/main/java/net/imagej/usage/UsageUploadService.java>
.

All services can be overridden by creating another implementation with a
higher priority - for usage statistics, you can use the default
implementation
<https://github.com/imagej/imagej-usage/blob/master/src/main/java/net/imagej/usage/DefaultUsageUploadService.java>
as a guide. You could even distribute your disabled usage service on an update
site <http://fiji.sc/How_to_set_up_and_populate_an_update_site>, to share
with other users.

> this is an unacceptable reason for gathering user data and I fear there
is no honest reason at all to do so.

I personally do not write grants, but I am funded by them (along with the
rest of my coworkers). It is not enough to say "hey we're going to make the
next generation ImageJ framework... thanks for the money bye!" We need the
ability to measure and report our successes and failures, and use that
information to guide future funding and development priorities.

Please consider that some level of usage statistics for Fiji have been
collected
for nearly 5 years already <http://imagej.net/Fiji_Usage>. These latest
changes are just more granular, allowing us to track individual plugins and
update site use.

>Storing user data is a no no!

Agreed completely. I would not call what we are tracking "user data." We
are tracking "plugin data", and we are really just tracking plugin use
counts for a given environment.

So, I don't understand how honesty comes into the picture... perhaps you
could explain your feelings here? How do you think this data could be
misused?

Thanks for the response,
Mark



On Mon, Aug 11, 2014 at 11:00 AM, Herbie <[hidden email]> wrote:

> Sorry  Mark,
>
> but
>
>
> > This data will certainly be used to strengthen future
> > funding requests to continue the support of Fiji/ImageJ and its
> > collaborations.
>
> this is an unacceptable reason for gathering user data and I fear there is
> no honest reason at all to do so.
>
> > "keeping these statistics anonymous"
>
> Well, I'm convinced that you and your co-workers will but...
>
> Storing user data is a no no!
> Some members of the ImageJ-2 Team are criticizing Google an other data
> collecting organizations since long and now...
>
> If user data is gathered/transmitted by IJ-2, I shall, if possible,
> comment out this portion of code. An OFF-option is insufficient.
>
> Just my opinion because you were asking for further comments
>
> Herbie
>
> ::::::::::::::::::::::::::::::::::::
>
> On 11.08.14 16:11, Mark Hiner wrote:
>
>> Hi Gabriel,
>>
>>  specially about the "etc." mentioned in that page (?!).
>>>
>>
>> I wrote this section and left it vague at the time because Curtis Rueden
>> was in the process of refining the database schema and setting up exactly
>> what was tracked. I realize that I should certainly not have used vague
>> language when discussing user privacy though, and apologize for that!
>>
>> The release wiki page is updated to specify exactly what metadata is
>> stored: country, java version, language, operating system, time zone,
>> update site.
>>
>>  I think the collection of usage data should be made an OPT IN option and
>>>
>> so be set OFF by default.
>>
>> This was a tough decision for us. We certainly don't want anyone to feel
>> uncomfortable using ImageJ. But we are committed to keeping these
>> statistics anonymous, and this is extremely important information for an
>> open source project. This data will certainly be used to strengthen future
>> funding requests to continue the support of Fiji/ImageJ and its
>> collaborations.
>>
>> So we decided to make it opt out to get as realistic a view of usage as we
>> can, while still allowing anyone who is uncomfortable to disable this
>> functionality.
>>
>> Of course, any further discussion or objections to this feature are
>> welcome... I, personally, would like to better understand any specific
>> concerns people may have.
>>
>> Thanks for the feedback,
>> Mark
>>
>> On Sat, Aug 9, 2014 at 12:44 PM, Gabriel Landini <[hidden email]>
>> wrote:
>>
>>  On Friday 08 Aug 2014 16:47:33 you wrote:
>>>
>>>> Today the ImageJ team is proud to announce a new public release
>>>> candidate
>>>> for ImageJ2: version 2.0.0-rc-11.
>>>>
>>>
>>> Great to hear the IJ2 has reached rc11.
>>>
>>> Looking at the New Features page, I think the collection of usage data
>>> should
>>> be made an OPT IN option and so be set OFF by default.
>>>
>>> Users might like to exercise some privacy by not being tracked, specially
>>> about the "etc." mentioned in that page (?!).
>>>
>>> Cheers
>>>
>>> Gabriel
>>>
>>> --
>>> ImageJ mailing list: http://imagej.nih.gov/ij/list.html
>>>
>>>
>> --
>> ImageJ mailing list: http://imagej.nih.gov/ij/list.html
>>
>>
> --
> ImageJ mailing list: http://imagej.nih.gov/ij/list.html
>

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: ImageJ 2.0.0-rc-11 released

Herbie-4
Good day Mark,

and thanks for your message!

On 11.08.14 18:51, Mark Hiner wrote:
> Hi Herbie,
>
> If user data is gathered/transmitted by IJ-2, I shall, if possible,
> comment out this portion of code. An OFF-option is insufficient.
>
> Can you explain why an OFF option is insufficient..?

It can _easily_ be manipulated.
We are dealing with Open Source and who can prevent someone to compile
and distribute an IJ-2 version where the options are just reversed. This
is one possible scenario.
In fact you can always distribute manipulated IJ-2 versions but the
described method is extremely easy to realize...

> Anyway, like everything in the ImageJ-2 framework, usage uploads come
> from a Service
> <https://github.com/scijava/scijava-common/blob/master/src/main/java/org/scijava/service/Service.java>
> plugin providing the indicated functionality - in this case, the
> UsageUploadService
> <https://github.com/imagej/imagej-usage/blob/master/src/main/java/net/imagej/usage/UsageUploadService.java>.

Are you sure users check the exact sever?
Here in Germany we are presently confronted with sophisticated SPAM
(with attachments) from highjacked official servers (Courts, State
administrations etc.)

> All services can be overridden by creating another implementation with a
> higher priority - for usage statistics, you can use the default
> implementation <
> https://github.com/imagej/imagej-usage/blob/master/src/main/java/net/imagej/usage/DefaultUsageUploadService.java>
> as a guide. You could even distribute your disabled usage service on an
> update site <http://fiji.sc/How_to_set_up_and_populate_an_update_site>,
> to share with other users.

One should do that in general.
An individual code change means the last resort (for me).

> this is an unacceptable reason for gathering user data and I fear
> there is no honest reason at all to do so.
>
> I personally do not write grants, but I am funded by them (along with
> the rest of my coworkers). It is not enough to say "hey we're going to
> make the next generation ImageJ framework... thanks for the money bye!"
> We need the ability to measure and report our successes and failures,
> and use that information to guide future funding and development priorities.

I've been 40 years in science and research and I had never to provide
comparable information and if somebody had asked me to, I had refused
the request or the project.

How comes that Wayne was funded?

Why not use the number of members on the IJ-list?
It gives an estimate about a factor of 2...5 below the real number of
IJ-users.

> Please consider that some level of usage statistics for Fiji have been
> collected for nearly 5 years already <http://imagej.net/Fiji_Usage>.
> These latest changes are just more granular, allowing us to track
> individual plugins and update site use.

This is one of many reasons that I don't use Fiji.

>  >Storing user data is a no no!
>
> Agreed completely. I would not call what we are tracking "user data." We
> are tracking "plugin data", and we are really just tracking plugin use
> counts for a given environment.

Oh well, the kind of computers, software, or whatever I use, is my
private affair.

> So, I don't understand how honesty comes into the picture... perhaps you
> could explain your feelings here? How do you think this data could be
> misused?

I wrote with respect to gathering user data:
"there is no honest reason at all to do so"

The reason for this is that my data (including my computer platform, the
software I use etc.) is private, except I decide to make it available
for someone or a group.

Collecting information from my personal computer isn't honest.
If others do it is no reason to do it too.

> Thanks for the response,
> Mark

Thank you for asking and have a good day

Herbie

::::::::::::::::::::::::::::::::::::::::::::::::::::::::::

> On Mon, Aug 11, 2014 at 11:00 AM, Herbie <[hidden email]
> <mailto:[hidden email]>> wrote:
>
>     Sorry  Mark,
>
>     but
>
>
>      > This data will certainly be used to strengthen future
>      > funding requests to continue the support of Fiji/ImageJ and its
>      > collaborations.
>
>     this is an unacceptable reason for gathering user data and I fear
>     there is no honest reason at all to do so.
>
>      > "keeping these statistics anonymous"
>
>     Well, I'm convinced that you and your co-workers will but...
>
>     Storing user data is a no no!
>     Some members of the ImageJ-2 Team are criticizing Google an other
>     data collecting organizations since long and now...
>
>     If user data is gathered/transmitted by IJ-2, I shall, if possible,
>     comment out this portion of code. An OFF-option is insufficient.
>
>     Just my opinion because you were asking for further comments
>
>     Herbie
>
>     ::::::::::::::::::::::::::::::__::::::
>
>     On 11.08.14 16:11, Mark Hiner wrote:
>
>         Hi Gabriel,
>
>             specially about the "etc." mentioned in that page (?!).
>
>
>         I wrote this section and left it vague at the time because
>         Curtis Rueden
>         was in the process of refining the database schema and setting
>         up exactly
>         what was tracked. I realize that I should certainly not have
>         used vague
>         language when discussing user privacy though, and apologize for
>         that!
>
>         The release wiki page is updated to specify exactly what metadata is
>         stored: country, java version, language, operating system, time
>         zone,
>         update site.
>
>             I think the collection of usage data should be made an OPT
>             IN option and
>
>         so be set OFF by default.
>
>         This was a tough decision for us. We certainly don't want anyone
>         to feel
>         uncomfortable using ImageJ. But we are committed to keeping these
>         statistics anonymous, and this is extremely important
>         information for an
>         open source project. This data will certainly be used to
>         strengthen future
>         funding requests to continue the support of Fiji/ImageJ and its
>         collaborations.
>
>         So we decided to make it opt out to get as realistic a view of
>         usage as we
>         can, while still allowing anyone who is uncomfortable to disable
>         this
>         functionality.
>
>         Of course, any further discussion or objections to this feature are
>         welcome... I, personally, would like to better understand any
>         specific
>         concerns people may have.
>
>         Thanks for the feedback,
>         Mark
>
>         On Sat, Aug 9, 2014 at 12:44 PM, Gabriel Landini
>         <[hidden email] <mailto:[hidden email]>>
>         wrote:
>
>             On Friday 08 Aug 2014 16:47:33 you wrote:
>
>                 Today the ImageJ team is proud to announce a new public
>                 release candidate
>                 for ImageJ2: version 2.0.0-rc-11.
>
>
>             Great to hear the IJ2 has reached rc11.
>
>             Looking at the New Features page, I think the collection of
>             usage data
>             should
>             be made an OPT IN option and so be set OFF by default.
>
>             Users might like to exercise some privacy by not being
>             tracked, specially
>             about the "etc." mentioned in that page (?!).
>
>             Cheers
>
>             Gabriel
>
>             --
>             ImageJ mailing list: http://imagej.nih.gov/ij/list.__html
>             <http://imagej.nih.gov/ij/list.html>
>
>
>         --
>         ImageJ mailing list: http://imagej.nih.gov/ij/list.__html
>         <http://imagej.nih.gov/ij/list.html>
>
>
>     --
>     ImageJ mailing list: http://imagej.nih.gov/ij/list.__html
>     <http://imagej.nih.gov/ij/list.html>
>
>

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: ImageJ 2.0.0-rc-11 released

Mark Hiner-2
In reply to this post by Gabriel Landini
> SCIFIO was opt in, but usage tracking is opt out? It does not make sense.
>

To be clear, SCIFIO is enabled by default.. you have to uncheck a box to
disable SCIFIO, so it is opt out. We publicized the SCIFIO toggle from
within the ImageJ UI because it can affect the actual operation of ImageJ
by producing different (and/or incorrect) results (and it's a beta product
which is being consumed by a complicated compatibility layer to interact
with ImageJ 1.x classes).

>but logging plugin usage statistics in individual computers (i.e. what you
do with the software?) without user agreeing seems intrusive and over the
top.

I think there is a difference in the questions "what do you do with the
software" and "what do users do with the software". I don't believe we will
ever ask the former question.

With the way our database was designed, literally just storing plugin
execution counts (and not parameters)

we can ask:
 "How many times was Bio-Formats used with Java 7"?

we *can not* ask:
 "how many times did Gabriel Landini run Bio-Formats?"

>Logging updates and downloads is one thing that could be quoted when
applying for funding

This value of this information is questionable. If this was just the core
imagej.jar that's probably sufficient. But we're talking about a
significant number of components that make up Fiji, made potentially
endless via update sites and plugins. Downloads and updates just don't tell
you the whole picture.

Using Bio-Formats as an example:

Bio-Formats 5.0.2 currently ships with Fiji. But I can't say that the
number of downloads of Fiji represent use of Bio-Formats by the scientific
imaging community. I can't say the number of times a Bio-Formats .jar is
updated via the Fiji updater is representative of use either. Those numbers
aren't bad, but they don't really mean anything.

But now, with usage statistics, we can look at the number of times the
Bio-Formats importer was used, or File > Open with SCIFIO and the
Bio-Formats plugin was used. This at least gives us an approximate picture
of how our software is being used.

>If this happens to be something people want to adhere to, then there is
nothing to worry about as there will be lots of users opting in when given
the chance.

I believe this is actually hard to predict. We put in what we thought was
an appropriate system of warnings and explanations for when and how to
disable SCIFIO, for example, and failed miserably - resulting in a
frustration and confusion for users and a wonderful spike in bug reporting.

If usage statistics were presented similarly - with a pop up on launch and
an options menu - my expectations for opt-in numbers would be very low. Not
because people don't want to contribute but because we created a barrier to
the process.

A more successful alternative might be, when statistics are actually being
uploaded, to display a dialog asking to proceed or not - with yes/no/don't
ask me again options. That sounds promising, but also potentially annoying
or confusing to get that pop up, and we can still expect statistics
reporting to drop.

So since we are not sending or storing use-specific data, and provided and
publicized the opt-out mechanism, we decided to go with the option that was
un-disruptive at the workflow level and maximized data collection.
Especially given, as you mentioned, that users ultimately need to agree to
communicate with an external server to download these applications and
updates.

>Good, so please use this as an opportunity to change the way the program
gets user permission to do it.

I hope it's clear that I am not saying we are unwilling to change how
permissions are exposed.. but if we can circumvent that need via discussion
it would certainly be my preference. And if we do end up making any
changes, I would like them to be as minimally damaging to the quality of
the data gathering as possible.

>Given privacy (or the lack of...) is such a hot topic at the moment it
would seem appropriate to do things right.

From what I know of privacy issues, these are primarily concerning the
leaking of things like e-mails, passwords, usernames or credit card
numbers. As we are only storing numeric counts tied to plugin names and the
previously-mentioned information, there is nothing to leak.

To me, there has to be actual user data being exposed to be a matter of
privacy. Can you clarify what you believe to be the concern here?

>Incidentally I also found that there seems to be no way of recording in
the macro language the Privacy Opt Out option. Can this be added too so it
can be included in the StartupMacros file?

Would you mind opening an issue on the imagej-usage github page:
https://github.com/imagej/imagej-usage/issues ?

I hope this helped clarify our thought process and reasoning a bit.

Thanks,
Mark

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: ImageJ 2.0.0-rc-11 updater

Mark Hiner-2
In reply to this post by Knecht, David
Hi David,

> it seemed to be going around in circles for a long time doing the same
files over and over

 There was a breaking point in the updater some time ago where this was
happening. I think the instructions for fixing it are in comment 10 if this
bug report: http://fiji.sc/bugzilla/show_bug.cgi?id=763#c10

You should be able to resolve this from the command line by reading through
those comments and following the steps. If you're not attached to anything
in your local install, you can also download a fresh version from
http://fiji.sc/Downloads, as the bug was eventually resolved.

Let us know if you have any trouble though!

- Mark


On Mon, Aug 11, 2014 at 10:42 AM, Knecht, David <[hidden email]>
wrote:

> Hi Curtis- I tried to run the update today from within ImageJ2 and got
> into a loop in the updater where it seemed to be going around in circles
> for a long time doing the same files over and over.  I canceled and
> downloaded from the web site instead.  This was on a Mac running latest
> Lion.  Old version was 2.0.0-rc-2.  Dave
>
> On Aug 8, 2014, at 5:47 PM, Curtis Rueden <[hidden email]> wrote:
>
> > Hi everyone,
> >
> > Today the ImageJ team is proud to announce a new public release candidate
> > for ImageJ2: version 2.0.0-rc-11.
> >
> > This release contains several critical bug-fixes as well as new features,
> > including tracking of anonymous usage statistics, as well as Groovy
> > scripting support.
> >
> > For details, see the News post on the ImageJ web site at:
> >    http://imagej.net/2014-08-08_-_ImageJ_2.0.0-rc-11_released
> >
> > Cheers,
> > Curtis
> >
> > --
> > ImageJ mailing list: http://imagej.nih.gov/ij/list.html
>
> Dr. David Knecht
> Professor of Molecular and Cell Biology
> Core Microscopy Facility Director
> University of Connecticut
> Storrs, CT 06269
> 860-486-2200
>
> --
> ImageJ mailing list: http://imagej.nih.gov/ij/list.html
>

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: ImageJ 2.0.0-rc-11 released

Mark Hiner-2
In reply to this post by Herbie-4
>
> It can _easily_ be manipulated.
> We are dealing with Open Source and who can prevent someone to compile and
> distribute an IJ-2 version where the options are just reversed
>

I agree completely but do not see that as an argument against this
functionality. In the scenario you describe, a user needs to choose to
download some malicious code - correct? But there is already the window for
that by nature of having a plugin mechanism in ImageJ.

>Are you sure users check the exact sever?

I'm sorry, I'm not sure what you mean...

>How comes that Wayne was funded?

I know nothing about the details of ImageJ 1.x funding. Info about ImageJ2
funding is on the wiki <http://imagej.net/ImageJ2#Funding>, but I apologize
if I conflated the issue by mentioning our funding. To be clear, this is a
framework that automatically is useful for any plugin developer - not just
the ImageJ2 team. I was using the ImageJ2 grant as an example to illustrate
that funding is a non-trivial area where it is useful to be able to report
usage statistics like those we are gathering... so there was genuine
purpose behind our actions.

>Why not use the number of members on the IJ-list?

Answered this question in my response to Gabriel's last message - numbers
like this aren't really granular enough to be meaningful to plugin
developers. Also, these numbers are important for considering future
development - e.g. if a plugin hasn't been used in over 2 years, should we
really put time and effort into converting it to a new format? IJ
subscriber data just can't inform a decision like that....

>Oh well, the kind of computers, software, or whatever I use, is my private
affair.

If you download an application or plugin from a server, it seems reasonable
to expect that there would be some record of that event: "there exists a
person who downloaded an application from this server." The usage
statistics being uploaded are no more detailed than this, saying "there
exists a person who ran this plugin 10 times using Java 7 on Windows". To
me, that seems less personal than sending an e-mail to this list.

I do believe that the information "Herbie ran plugin X" is your private
affair, as is "A person ran plugin X on dataset_243.tif". But the
information "someone  ran plugin X" does not seem like it's violating
anyone's privacy to me. If you feel differently, please help me to
understand why.

If you will indulge a metaphor, from my point of view, our usage statistics
are like having data that says "A person wore a t-shirt and glasses on
August 11th." That data is meaningless unless you are curious about t-shirt
and glasses annual trends - and even then, you have no way to trace back
the actions of a specific individual.

I believe a violation of privacy would be "Mark Hiner is wearing a t-shirt
and glasses on August 11th" - and we will not transmit or store that data.

Thanks,
Mark


On Mon, Aug 11, 2014 at 1:22 PM, Herbie <[hidden email]> wrote:

> Good day Mark,
>
> and thanks for your message!
>
> On 11.08.14 18:51, Mark Hiner wrote:
>
>> Hi Herbie,
>>
>> If user data is gathered/transmitted by IJ-2, I shall, if possible,
>> comment out this portion of code. An OFF-option is insufficient.
>>
>> Can you explain why an OFF option is insufficient..?
>>
>
> It can _easily_ be manipulated.
> We are dealing with Open Source and who can prevent someone to compile and
> distribute an IJ-2 version where the options are just reversed. This is one
> possible scenario.
> In fact you can always distribute manipulated IJ-2 versions but the
> described method is extremely easy to realize...
>
>  Anyway, like everything in the ImageJ-2 framework, usage uploads come
>> from a Service
>> <https://github.com/scijava/scijava-common/blob/master/
>> src/main/java/org/scijava/service/Service.java>
>>
>> plugin providing the indicated functionality - in this case, the
>> UsageUploadService
>> <https://github.com/imagej/imagej-usage/blob/master/src/
>> main/java/net/imagej/usage/UsageUploadService.java>.
>>
>
> Are you sure users check the exact sever?
> Here in Germany we are presently confronted with sophisticated SPAM (with
> attachments) from highjacked official servers (Courts, State
> administrations etc.)
>
>  All services can be overridden by creating another implementation with a
>> higher priority - for usage statistics, you can use the default
>> implementation <
>> https://github.com/imagej/imagej-usage/blob/master/src/
>> main/java/net/imagej/usage/DefaultUsageUploadService.java>
>>
>> as a guide. You could even distribute your disabled usage service on an
>> update site <http://fiji.sc/How_to_set_up_and_populate_an_update_site>,
>>
>> to share with other users.
>>
>
> One should do that in general.
> An individual code change means the last resort (for me).
>
>  this is an unacceptable reason for gathering user data and I fear
>> there is no honest reason at all to do so.
>>
>> I personally do not write grants, but I am funded by them (along with
>> the rest of my coworkers). It is not enough to say "hey we're going to
>> make the next generation ImageJ framework... thanks for the money bye!"
>> We need the ability to measure and report our successes and failures,
>> and use that information to guide future funding and development
>> priorities.
>>
>
> I've been 40 years in science and research and I had never to provide
> comparable information and if somebody had asked me to, I had refused the
> request or the project.
>
> How comes that Wayne was funded?
>
> Why not use the number of members on the IJ-list?
> It gives an estimate about a factor of 2...5 below the real number of
> IJ-users.
>
>  Please consider that some level of usage statistics for Fiji have been
>> collected for nearly 5 years already <http://imagej.net/Fiji_Usage>.
>>
>> These latest changes are just more granular, allowing us to track
>> individual plugins and update site use.
>>
>
> This is one of many reasons that I don't use Fiji.
>
>   >Storing user data is a no no!
>>
>> Agreed completely. I would not call what we are tracking "user data." We
>> are tracking "plugin data", and we are really just tracking plugin use
>> counts for a given environment.
>>
>
> Oh well, the kind of computers, software, or whatever I use, is my private
> affair.
>
>  So, I don't understand how honesty comes into the picture... perhaps you
>> could explain your feelings here? How do you think this data could be
>> misused?
>>
>
> I wrote with respect to gathering user data:
> "there is no honest reason at all to do so"
>
> The reason for this is that my data (including my computer platform, the
> software I use etc.) is private, except I decide to make it available for
> someone or a group.
>
> Collecting information from my personal computer isn't honest.
> If others do it is no reason to do it too.
>
>  Thanks for the response,
>> Mark
>>
>
> Thank you for asking and have a good day
>
> Herbie
>
> ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
>
>> On Mon, Aug 11, 2014 at 11:00 AM, Herbie <[hidden email]
>> <mailto:[hidden email]>> wrote:
>>
>>     Sorry  Mark,
>>
>>     but
>>
>>
>>      > This data will certainly be used to strengthen future
>>      > funding requests to continue the support of Fiji/ImageJ and its
>>      > collaborations.
>>
>>     this is an unacceptable reason for gathering user data and I fear
>>     there is no honest reason at all to do so.
>>
>>      > "keeping these statistics anonymous"
>>
>>     Well, I'm convinced that you and your co-workers will but...
>>
>>     Storing user data is a no no!
>>     Some members of the ImageJ-2 Team are criticizing Google an other
>>     data collecting organizations since long and now...
>>
>>     If user data is gathered/transmitted by IJ-2, I shall, if possible,
>>     comment out this portion of code. An OFF-option is insufficient.
>>
>>     Just my opinion because you were asking for further comments
>>
>>     Herbie
>>
>>     ::::::::::::::::::::::::::::::__::::::
>>
>>
>>     On 11.08.14 16:11, Mark Hiner wrote:
>>
>>         Hi Gabriel,
>>
>>             specially about the "etc." mentioned in that page (?!).
>>
>>
>>         I wrote this section and left it vague at the time because
>>         Curtis Rueden
>>         was in the process of refining the database schema and setting
>>         up exactly
>>         what was tracked. I realize that I should certainly not have
>>         used vague
>>         language when discussing user privacy though, and apologize for
>>         that!
>>
>>         The release wiki page is updated to specify exactly what metadata
>> is
>>         stored: country, java version, language, operating system, time
>>         zone,
>>         update site.
>>
>>             I think the collection of usage data should be made an OPT
>>             IN option and
>>
>>         so be set OFF by default.
>>
>>         This was a tough decision for us. We certainly don't want anyone
>>         to feel
>>         uncomfortable using ImageJ. But we are committed to keeping these
>>         statistics anonymous, and this is extremely important
>>         information for an
>>         open source project. This data will certainly be used to
>>         strengthen future
>>         funding requests to continue the support of Fiji/ImageJ and its
>>         collaborations.
>>
>>         So we decided to make it opt out to get as realistic a view of
>>         usage as we
>>         can, while still allowing anyone who is uncomfortable to disable
>>         this
>>         functionality.
>>
>>         Of course, any further discussion or objections to this feature
>> are
>>         welcome... I, personally, would like to better understand any
>>         specific
>>         concerns people may have.
>>
>>         Thanks for the feedback,
>>         Mark
>>
>>         On Sat, Aug 9, 2014 at 12:44 PM, Gabriel Landini
>>         <[hidden email] <mailto:[hidden email]>>
>>
>>         wrote:
>>
>>             On Friday 08 Aug 2014 16:47:33 you wrote:
>>
>>                 Today the ImageJ team is proud to announce a new public
>>                 release candidate
>>                 for ImageJ2: version 2.0.0-rc-11.
>>
>>
>>             Great to hear the IJ2 has reached rc11.
>>
>>             Looking at the New Features page, I think the collection of
>>             usage data
>>             should
>>             be made an OPT IN option and so be set OFF by default.
>>
>>             Users might like to exercise some privacy by not being
>>             tracked, specially
>>             about the "etc." mentioned in that page (?!).
>>
>>             Cheers
>>
>>             Gabriel
>>
>>             --
>>             ImageJ mailing list: http://imagej.nih.gov/ij/list.__html
>>             <http://imagej.nih.gov/ij/list.html>
>>
>>
>>         --
>>         ImageJ mailing list: http://imagej.nih.gov/ij/list.__html
>>         <http://imagej.nih.gov/ij/list.html>
>>
>>
>>     --
>>     ImageJ mailing list: http://imagej.nih.gov/ij/list.__html
>>     <http://imagej.nih.gov/ij/list.html>
>>
>>
>>
> --
> ImageJ mailing list: http://imagej.nih.gov/ij/list.html
>

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: ImageJ 2.0.0-rc-11 released

Gabriel Landini
In reply to this post by Mark Hiner-2
On Monday 11 Aug 2014 14:18:11 Mark Hiner wrote:
> > SCIFIO was opt in, but usage tracking is opt out? It does not make sense.
>
> To be clear, SCIFIO is enabled by default.. you have to uncheck a box to
> disable SCIFIO, so it is opt out.

Right, but it was impossible to miss as I had to answer the SCIFIO dialog when
the update came.
What is the problem in showing a similar dialog and let people know what will
be going on?

> I think there is a difference in the questions "what do you do with the
> software" and "what do users do with the software". I don't believe we will
> ever ask the former question.

Mark, what you or me personally *believe* somebody will ask in the future does
not matter. It is the process of getting informed consent on the data
collection; IJ2 is assuming and makes it less obvious than it could be.

> we can ask:
>  "How many times was Bio-Formats used with Java 7"?
> we *can not* ask:
>  "how many times did Gabriel Landini run Bio-Formats?"

Even with my poor knowledge of network traffic I can imagine that it might
trivial to script something using time stamps and ip addresses of the
uploading machine as well as plenty of emails also ip addresses from users.
Not that I remotely think that the devel team would have the time or
inclination to do this, but if we are talking about what is impossible, I
suspect it is not. So whether that is potentially identifiable information is
probably debatable. If there is then you would be effectively logging in a
database their location every hour (!) IJ2 runs. Doesn't that sound a bit
creepy?

My issue was (and remains) that data collection needs to be fully informed
before it takes place, not to be On by default.

> >If this happens to be something people want to adhere to, then there is
> > nothing to worry about as there will be lots of users opting in when given
> > the chance.
 
> I believe this is actually hard to predict.

Ask the users in a similar way SCIFIO was done and you will have the answer.
Then we would not be having this conversation.
 
> If usage statistics were presented similarly - with a pop up on launch and
> an options menu - my expectations for opt-in numbers would be very low. Not
> because people don't want to contribute but because we created a barrier to
> the process.

The issue that does not seem to stick after all this typing is that IJ2 should
not make that decision for the users. IJ2 is not the owner of the processes
happening in a user's computer. You need to ask, not assume, that people will
be happy for their computers to contact a database every hour and letting it
know they are there and doing this or that.

> A more successful alternative might be, when statistics are actually being
> uploaded, to display a dialog asking to proceed or not - with yes/no/don't
> ask me again options. That sounds promising, but also potentially annoying
> or confusing to get that pop up, and we can still expect statistics
> reporting to drop.

But if the reporting statistics drop, that would have to do. Make estimates
instead of collecting all possible data.

> So since we are not sending or storing use-specific data, and provided and
> publicized the opt-out mechanism, we decided to go with the option that was
> un-disruptive at the workflow level and maximized data collection.

Yes, you said that before, and I am sure I am not alone thinking it is not the
desirable way of doing it.

> Especially given, as you mentioned, that users ultimately need to agree to
> communicate with an external server to download these applications and
> updates.

But there is an obvious difference between the two situations. One is
requesting an update. The other is broadcasting to a database.

> I hope it's clear that I am not saying we are unwilling to change how
> permissions are exposed.. but if we can circumvent that need via discussion
> it would certainly be my preference. And if we do end up making any
> changes, I would like them to be as minimally damaging to the quality of
> the data gathering as possible.

I sounds like it is preferable not to ask people about the data collection.
That is in my view an error of judgement that can be resolved easily.

> To me, there has to be actual user data being exposed to be a matter of
> privacy. Can you clarify what you believe to be the concern here?

Sure: that the process of collecting usage data is not made clear from the
beginning and it should have informed consent before the collection starts.

If there are no privacy issues, why is the function to switch it Off called
"Privacy"?

Regards

Gabriel

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: ImageJ 2.0.0-rc-11 released

Robert Dougherty
In reply to this post by Mark Hiner-2
Mark,
In order to win and retain users of Image2, you have to earn their confidence. Spying on them and arguing with them are not ways to do that. The features of the program have to be designed to meet their needs, not yours. One of the many beauties of ImageJ is the it does not come with strings attached. No cost. No usage restrictions. No bizarre copyleft. And certainly no spying.
Bob

Sent from my iPhone

On Aug 11, 2014, at 4:25 PM, Mark Hiner <[hidden email]> wrote:

>>
>> It can _easily_ be manipulated.
>> We are dealing with Open Source and who can prevent someone to compile and
>> distribute an IJ-2 version where the options are just reversed
>
> I agree completely but do not see that as an argument against this
> functionality. In the scenario you describe, a user needs to choose to
> download some malicious code - correct? But there is already the window for
> that by nature of having a plugin mechanism in ImageJ.
>
>> Are you sure users check the exact sever?
>
> I'm sorry, I'm not sure what you mean...
>
>> How comes that Wayne was funded?
>
> I know nothing about the details of ImageJ 1.x funding. Info about ImageJ2
> funding is on the wiki <http://imagej.net/ImageJ2#Funding>, but I apologize
> if I conflated the issue by mentioning our funding. To be clear, this is a
> framework that automatically is useful for any plugin developer - not just
> the ImageJ2 team. I was using the ImageJ2 grant as an example to illustrate
> that funding is a non-trivial area where it is useful to be able to report
> usage statistics like those we are gathering... so there was genuine
> purpose behind our actions.
>
>> Why not use the number of members on the IJ-list?
>
> Answered this question in my response to Gabriel's last message - numbers
> like this aren't really granular enough to be meaningful to plugin
> developers. Also, these numbers are important for considering future
> development - e.g. if a plugin hasn't been used in over 2 years, should we
> really put time and effort into converting it to a new format? IJ
> subscriber data just can't inform a decision like that....
>
>> Oh well, the kind of computers, software, or whatever I use, is my private
> affair.
>
> If you download an application or plugin from a server, it seems reasonable
> to expect that there would be some record of that event: "there exists a
> person who downloaded an application from this server." The usage
> statistics being uploaded are no more detailed than this, saying "there
> exists a person who ran this plugin 10 times using Java 7 on Windows". To
> me, that seems less personal than sending an e-mail to this list.
>
> I do believe that the information "Herbie ran plugin X" is your private
> affair, as is "A person ran plugin X on dataset_243.tif". But the
> information "someone  ran plugin X" does not seem like it's violating
> anyone's privacy to me. If you feel differently, please help me to
> understand why.
>
> If you will indulge a metaphor, from my point of view, our usage statistics
> are like having data that says "A person wore a t-shirt and glasses on
> August 11th." That data is meaningless unless you are curious about t-shirt
> and glasses annual trends - and even then, you have no way to trace back
> the actions of a specific individual.
>
> I believe a violation of privacy would be "Mark Hiner is wearing a t-shirt
> and glasses on August 11th" - and we will not transmit or store that data.
>
> Thanks,
> Mark
>
>
>> On Mon, Aug 11, 2014 at 1:22 PM, Herbie <[hidden email]> wrote:
>>
>> Good day Mark,
>>
>> and thanks for your message!
>>
>>> On 11.08.14 18:51, Mark Hiner wrote:
>>>
>>> Hi Herbie,
>>>
>>> If user data is gathered/transmitted by IJ-2, I shall, if possible,
>>> comment out this portion of code. An OFF-option is insufficient.
>>>
>>> Can you explain why an OFF option is insufficient..?
>>
>> It can _easily_ be manipulated.
>> We are dealing with Open Source and who can prevent someone to compile and
>> distribute an IJ-2 version where the options are just reversed. This is one
>> possible scenario.
>> In fact you can always distribute manipulated IJ-2 versions but the
>> described method is extremely easy to realize...
>>
>> Anyway, like everything in the ImageJ-2 framework, usage uploads come
>>> from a Service
>>> <https://github.com/scijava/scijava-common/blob/master/
>>> src/main/java/org/scijava/service/Service.java>
>>>
>>> plugin providing the indicated functionality - in this case, the
>>> UsageUploadService
>>> <https://github.com/imagej/imagej-usage/blob/master/src/
>>> main/java/net/imagej/usage/UsageUploadService.java>.
>>
>> Are you sure users check the exact sever?
>> Here in Germany we are presently confronted with sophisticated SPAM (with
>> attachments) from highjacked official servers (Courts, State
>> administrations etc.)
>>
>> All services can be overridden by creating another implementation with a
>>> higher priority - for usage statistics, you can use the default
>>> implementation <
>>> https://github.com/imagej/imagej-usage/blob/master/src/
>>> main/java/net/imagej/usage/DefaultUsageUploadService.java>
>>>
>>> as a guide. You could even distribute your disabled usage service on an
>>> update site <http://fiji.sc/How_to_set_up_and_populate_an_update_site>,
>>>
>>> to share with other users.
>>
>> One should do that in general.
>> An individual code change means the last resort (for me).
>>
>> this is an unacceptable reason for gathering user data and I fear
>>> there is no honest reason at all to do so.
>>>
>>> I personally do not write grants, but I am funded by them (along with
>>> the rest of my coworkers). It is not enough to say "hey we're going to
>>> make the next generation ImageJ framework... thanks for the money bye!"
>>> We need the ability to measure and report our successes and failures,
>>> and use that information to guide future funding and development
>>> priorities.
>>
>> I've been 40 years in science and research and I had never to provide
>> comparable information and if somebody had asked me to, I had refused the
>> request or the project.
>>
>> How comes that Wayne was funded?
>>
>> Why not use the number of members on the IJ-list?
>> It gives an estimate about a factor of 2...5 below the real number of
>> IJ-users.
>>
>> Please consider that some level of usage statistics for Fiji have been
>>> collected for nearly 5 years already <http://imagej.net/Fiji_Usage>.
>>>
>>> These latest changes are just more granular, allowing us to track
>>> individual plugins and update site use.
>>
>> This is one of many reasons that I don't use Fiji.
>>
>>> Storing user data is a no no!
>>>
>>> Agreed completely. I would not call what we are tracking "user data." We
>>> are tracking "plugin data", and we are really just tracking plugin use
>>> counts for a given environment.
>>
>> Oh well, the kind of computers, software, or whatever I use, is my private
>> affair.
>>
>> So, I don't understand how honesty comes into the picture... perhaps you
>>> could explain your feelings here? How do you think this data could be
>>> misused?
>>
>> I wrote with respect to gathering user data:
>> "there is no honest reason at all to do so"
>>
>> The reason for this is that my data (including my computer platform, the
>> software I use etc.) is private, except I decide to make it available for
>> someone or a group.
>>
>> Collecting information from my personal computer isn't honest.
>> If others do it is no reason to do it too.
>>
>> Thanks for the response,
>>> Mark
>>
>> Thank you for asking and have a good day
>>
>> Herbie
>>
>> ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
>>
>>> On Mon, Aug 11, 2014 at 11:00 AM, Herbie <[hidden email]
>>> <mailto:[hidden email]>> wrote:
>>>
>>>    Sorry  Mark,
>>>
>>>    but
>>>
>>>
>>>> This data will certainly be used to strengthen future
>>>> funding requests to continue the support of Fiji/ImageJ and its
>>>> collaborations.
>>>
>>>    this is an unacceptable reason for gathering user data and I fear
>>>    there is no honest reason at all to do so.
>>>
>>>> "keeping these statistics anonymous"
>>>
>>>    Well, I'm convinced that you and your co-workers will but...
>>>
>>>    Storing user data is a no no!
>>>    Some members of the ImageJ-2 Team are criticizing Google an other
>>>    data collecting organizations since long and now...
>>>
>>>    If user data is gathered/transmitted by IJ-2, I shall, if possible,
>>>    comment out this portion of code. An OFF-option is insufficient.
>>>
>>>    Just my opinion because you were asking for further comments
>>>
>>>    Herbie
>>>
>>>    ::::::::::::::::::::::::::::::__::::::
>>>
>>>
>>>    On 11.08.14 16:11, Mark Hiner wrote:
>>>
>>>        Hi Gabriel,
>>>
>>>            specially about the "etc." mentioned in that page (?!).
>>>
>>>
>>>        I wrote this section and left it vague at the time because
>>>        Curtis Rueden
>>>        was in the process of refining the database schema and setting
>>>        up exactly
>>>        what was tracked. I realize that I should certainly not have
>>>        used vague
>>>        language when discussing user privacy though, and apologize for
>>>        that!
>>>
>>>        The release wiki page is updated to specify exactly what metadata
>>> is
>>>        stored: country, java version, language, operating system, time
>>>        zone,
>>>        update site.
>>>
>>>            I think the collection of usage data should be made an OPT
>>>            IN option and
>>>
>>>        so be set OFF by default.
>>>
>>>        This was a tough decision for us. We certainly don't want anyone
>>>        to feel
>>>        uncomfortable using ImageJ. But we are committed to keeping these
>>>        statistics anonymous, and this is extremely important
>>>        information for an
>>>        open source project. This data will certainly be used to
>>>        strengthen future
>>>        funding requests to continue the support of Fiji/ImageJ and its
>>>        collaborations.
>>>
>>>        So we decided to make it opt out to get as realistic a view of
>>>        usage as we
>>>        can, while still allowing anyone who is uncomfortable to disable
>>>        this
>>>        functionality.
>>>
>>>        Of course, any further discussion or objections to this feature
>>> are
>>>        welcome... I, personally, would like to better understand any
>>>        specific
>>>        concerns people may have.
>>>
>>>        Thanks for the feedback,
>>>        Mark
>>>
>>>        On Sat, Aug 9, 2014 at 12:44 PM, Gabriel Landini
>>>        <[hidden email] <mailto:[hidden email]>>
>>>
>>>        wrote:
>>>
>>>            On Friday 08 Aug 2014 16:47:33 you wrote:
>>>
>>>                Today the ImageJ team is proud to announce a new public
>>>                release candidate
>>>                for ImageJ2: version 2.0.0-rc-11.
>>>
>>>
>>>            Great to hear the IJ2 has reached rc11.
>>>
>>>            Looking at the New Features page, I think the collection of
>>>            usage data
>>>            should
>>>            be made an OPT IN option and so be set OFF by default.
>>>
>>>            Users might like to exercise some privacy by not being
>>>            tracked, specially
>>>            about the "etc." mentioned in that page (?!).
>>>
>>>            Cheers
>>>
>>>            Gabriel
>>>
>>>            --
>>>            ImageJ mailing list: http://imagej.nih.gov/ij/list.__html
>>>            <http://imagej.nih.gov/ij/list.html>
>>>
>>>
>>>        --
>>>        ImageJ mailing list: http://imagej.nih.gov/ij/list.__html
>>>        <http://imagej.nih.gov/ij/list.html>
>>>
>>>
>>>    --
>>>    ImageJ mailing list: http://imagej.nih.gov/ij/list.__html
>>>    <http://imagej.nih.gov/ij/list.html>
>> --
>> ImageJ mailing list: http://imagej.nih.gov/ij/list.html
>
> --
> ImageJ mailing list: http://imagej.nih.gov/ij/list.html

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Usage data collection (was: Re: ImageJ 2.0.0-rc-11 released)

Saalfeld, Stephan
In reply to this post by Gabriel Landini
I fully support Gabriel and Herbie in their strong rejection of this
sort of data collection and, in particular, the way it aims to dupe the
clueless, introducing it default on.

Nobody questions that this kind of data is extremely interesting and
useful.  I strongly believe that every plugin developer drools over
knowing how much their plugins got used, and when, and where.  This
lust, however, does by now means justify to actually do it.

I am also pretty sure that your intentions about this procedure are
perfectly honorable and that you're not planning any evil.  However,
that does not mean that everybody else does at any time in the future
and the data will be there, exposed to those hacking into your servers.

Please ask yourself whether you would feel comfortable about your
operating system reporting back which applications you've used when,
where and how often.  I would feel spied on.

It would be great if you could consider switching that functionality off
by default and offer interested users the choice to contribute
consciously and voluntarily.  That way, you would consciously make the
decision to gather significantly less data but in the most honorable
way, withstanding the (understandable) desire to get more faster.  In
addition to that, I would ask you to, in spirit of full Open Source and
Open Policy make the collected data (the data, not the derived
statistics) read-accessible to everybody in full and license it under an
Open Data license, e.g.

http://opendatacommons.org/licenses/odbl/

or one of the CCs

http://creativecommons.org/licenses/

Everybody can then test whether the data is truly harmless, but I
actually believe that we may find interesting ways to identify
individuals by their ImageJ usage patterns.

If you would change the usage data collection policy in this spirit, I
would consider switching it on, for a while.

I am very sorry to be so negative in this particular aspect.  I
appreciate a lot the immense amount of work you are spending to make
ImageJ2 a better analysis tool freely available to the community.

Best regards,
Stephan





On Mon, 2014-08-11 at 22:53 +0100, Gabriel Landini wrote:

> On Monday 11 Aug 2014 14:18:11 Mark Hiner wrote:
> > > SCIFIO was opt in, but usage tracking is opt out? It does not make sense.
> >
> > To be clear, SCIFIO is enabled by default.. you have to uncheck a box to
> > disable SCIFIO, so it is opt out.
>
> Right, but it was impossible to miss as I had to answer the SCIFIO dialog when
> the update came.
> What is the problem in showing a similar dialog and let people know what will
> be going on?
>
> > I think there is a difference in the questions "what do you do with the
> > software" and "what do users do with the software". I don't believe we will
> > ever ask the former question.
>
> Mark, what you or me personally *believe* somebody will ask in the future does
> not matter. It is the process of getting informed consent on the data
> collection; IJ2 is assuming and makes it less obvious than it could be.
>
> > we can ask:
> >  "How many times was Bio-Formats used with Java 7"?
> > we *can not* ask:
> >  "how many times did Gabriel Landini run Bio-Formats?"
>
> Even with my poor knowledge of network traffic I can imagine that it might
> trivial to script something using time stamps and ip addresses of the
> uploading machine as well as plenty of emails also ip addresses from users.
> Not that I remotely think that the devel team would have the time or
> inclination to do this, but if we are talking about what is impossible, I
> suspect it is not. So whether that is potentially identifiable information is
> probably debatable. If there is then you would be effectively logging in a
> database their location every hour (!) IJ2 runs. Doesn't that sound a bit
> creepy?
>
> My issue was (and remains) that data collection needs to be fully informed
> before it takes place, not to be On by default.
>
> > >If this happens to be something people want to adhere to, then there is
> > > nothing to worry about as there will be lots of users opting in when given
> > > the chance.
>  
> > I believe this is actually hard to predict.
>
> Ask the users in a similar way SCIFIO was done and you will have the answer.
> Then we would not be having this conversation.
>  
> > If usage statistics were presented similarly - with a pop up on launch and
> > an options menu - my expectations for opt-in numbers would be very low. Not
> > because people don't want to contribute but because we created a barrier to
> > the process.
>
> The issue that does not seem to stick after all this typing is that IJ2 should
> not make that decision for the users. IJ2 is not the owner of the processes
> happening in a user's computer. You need to ask, not assume, that people will
> be happy for their computers to contact a database every hour and letting it
> know they are there and doing this or that.
>
> > A more successful alternative might be, when statistics are actually being
> > uploaded, to display a dialog asking to proceed or not - with yes/no/don't
> > ask me again options. That sounds promising, but also potentially annoying
> > or confusing to get that pop up, and we can still expect statistics
> > reporting to drop.
>
> But if the reporting statistics drop, that would have to do. Make estimates
> instead of collecting all possible data.
>
> > So since we are not sending or storing use-specific data, and provided and
> > publicized the opt-out mechanism, we decided to go with the option that was
> > un-disruptive at the workflow level and maximized data collection.
>
> Yes, you said that before, and I am sure I am not alone thinking it is not the
> desirable way of doing it.
>
> > Especially given, as you mentioned, that users ultimately need to agree to
> > communicate with an external server to download these applications and
> > updates.
>
> But there is an obvious difference between the two situations. One is
> requesting an update. The other is broadcasting to a database.
>
> > I hope it's clear that I am not saying we are unwilling to change how
> > permissions are exposed.. but if we can circumvent that need via discussion
> > it would certainly be my preference. And if we do end up making any
> > changes, I would like them to be as minimally damaging to the quality of
> > the data gathering as possible.
>
> I sounds like it is preferable not to ask people about the data collection.
> That is in my view an error of judgement that can be resolved easily.
>
> > To me, there has to be actual user data being exposed to be a matter of
> > privacy. Can you clarify what you believe to be the concern here?
>
> Sure: that the process of collecting usage data is not made clear from the
> beginning and it should have informed consent before the collection starts.
>
> If there are no privacy issues, why is the function to switch it Off called
> "Privacy"?
>
> Regards
>
> Gabriel
>
> --
> ImageJ mailing list: http://imagej.nih.gov/ij/list.html

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: Usage data collection (was: Re: ImageJ 2.0.0-rc-11 released)

Stephan Preibisch
Hi everyone,

after reading through the email-thread I agree with Herbie, Gabriel and Stephan being myself very skeptical of any kind of data collection that I do not explicitly agree on and was not given the chance to opt out. But I feel that at the same time everyone agrees that this is extremely helpful and many users are not that concerned about useage statistics - right dear gmail users? ;)

Therefore, why not show a Scifio-like dialog in the beginning as suggested by Gabriel. As a privacy-aware person I would enjoy the following options:

a) Please collect my data, I love to help this Open-Source project as much as possible as it has helped me a lot as well.
b) Please do not collect my data, but still let the project know that I opted out (no other data sent).
c) Please do not send any data about me at all.

Note: This choice can be changed under Options > Privacy.
I would even add some hint like: "If you choose (c) you might not want to use the updater but instead use anonymization tools like TOR or JAP to download the current Fiji build from the website."

OK - (do a,b,c)
CANCEL (or default) - c

I highly appreciate as well all the work that went into this project and I hope that this will be a constructive discussion that resolves existing doubts and helps the developers as good as possible at the same time.

Have a great day and thanks to everyone,
Stephan
---

Dr. Stephan Preibisch
HFSP Fellow
Robert H. Singer / Eugene Myers lab

Albert Einstein College of Medicine / HHMI Janelia Farm / MPI-CBG

email: [hidden email] / [hidden email] / [hidden email]
web: http://www.preibisch.net/

On Aug 11, 2014, at 18:54 , Stephan Saalfeld <[hidden email]> wrote:

> I fully support Gabriel and Herbie in their strong rejection of this
> sort of data collection and, in particular, the way it aims to dupe the
> clueless, introducing it default on.
>
> Nobody questions that this kind of data is extremely interesting and
> useful.  I strongly believe that every plugin developer drools over
> knowing how much their plugins got used, and when, and where.  This
> lust, however, does by now means justify to actually do it.
>
> I am also pretty sure that your intentions about this procedure are
> perfectly honorable and that you're not planning any evil.  However,
> that does not mean that everybody else does at any time in the future
> and the data will be there, exposed to those hacking into your servers.
>
> Please ask yourself whether you would feel comfortable about your
> operating system reporting back which applications you've used when,
> where and how often.  I would feel spied on.
>
> It would be great if you could consider switching that functionality off
> by default and offer interested users the choice to contribute
> consciously and voluntarily.  That way, you would consciously make the
> decision to gather significantly less data but in the most honorable
> way, withstanding the (understandable) desire to get more faster.  In
> addition to that, I would ask you to, in spirit of full Open Source and
> Open Policy make the collected data (the data, not the derived
> statistics) read-accessible to everybody in full and license it under an
> Open Data license, e.g.
>
> http://opendatacommons.org/licenses/odbl/
>
> or one of the CCs
>
> http://creativecommons.org/licenses/
>
> Everybody can then test whether the data is truly harmless, but I
> actually believe that we may find interesting ways to identify
> individuals by their ImageJ usage patterns.
>
> If you would change the usage data collection policy in this spirit, I
> would consider switching it on, for a while.
>
> I am very sorry to be so negative in this particular aspect.  I
> appreciate a lot the immense amount of work you are spending to make
> ImageJ2 a better analysis tool freely available to the community.
>
> Best regards,
> Stephan
>
>
>
>
>
> On Mon, 2014-08-11 at 22:53 +0100, Gabriel Landini wrote:
>> On Monday 11 Aug 2014 14:18:11 Mark Hiner wrote:
>>>> SCIFIO was opt in, but usage tracking is opt out? It does not make sense.
>>>
>>> To be clear, SCIFIO is enabled by default.. you have to uncheck a box to
>>> disable SCIFIO, so it is opt out.
>>
>> Right, but it was impossible to miss as I had to answer the SCIFIO dialog when
>> the update came.
>> What is the problem in showing a similar dialog and let people know what will
>> be going on?
>>
>>> I think there is a difference in the questions "what do you do with the
>>> software" and "what do users do with the software". I don't believe we will
>>> ever ask the former question.
>>
>> Mark, what you or me personally *believe* somebody will ask in the future does
>> not matter. It is the process of getting informed consent on the data
>> collection; IJ2 is assuming and makes it less obvious than it could be.
>>
>>> we can ask:
>>> "How many times was Bio-Formats used with Java 7"?
>>> we *can not* ask:
>>> "how many times did Gabriel Landini run Bio-Formats?"
>>
>> Even with my poor knowledge of network traffic I can imagine that it might
>> trivial to script something using time stamps and ip addresses of the
>> uploading machine as well as plenty of emails also ip addresses from users.
>> Not that I remotely think that the devel team would have the time or
>> inclination to do this, but if we are talking about what is impossible, I
>> suspect it is not. So whether that is potentially identifiable information is
>> probably debatable. If there is then you would be effectively logging in a
>> database their location every hour (!) IJ2 runs. Doesn't that sound a bit
>> creepy?
>>
>> My issue was (and remains) that data collection needs to be fully informed
>> before it takes place, not to be On by default.
>>
>>>> If this happens to be something people want to adhere to, then there is
>>>> nothing to worry about as there will be lots of users opting in when given
>>>> the chance.
>>
>>> I believe this is actually hard to predict.
>>
>> Ask the users in a similar way SCIFIO was done and you will have the answer.
>> Then we would not be having this conversation.
>>
>>> If usage statistics were presented similarly - with a pop up on launch and
>>> an options menu - my expectations for opt-in numbers would be very low. Not
>>> because people don't want to contribute but because we created a barrier to
>>> the process.
>>
>> The issue that does not seem to stick after all this typing is that IJ2 should
>> not make that decision for the users. IJ2 is not the owner of the processes
>> happening in a user's computer. You need to ask, not assume, that people will
>> be happy for their computers to contact a database every hour and letting it
>> know they are there and doing this or that.
>>
>>> A more successful alternative might be, when statistics are actually being
>>> uploaded, to display a dialog asking to proceed or not - with yes/no/don't
>>> ask me again options. That sounds promising, but also potentially annoying
>>> or confusing to get that pop up, and we can still expect statistics
>>> reporting to drop.
>>
>> But if the reporting statistics drop, that would have to do. Make estimates
>> instead of collecting all possible data.
>>
>>> So since we are not sending or storing use-specific data, and provided and
>>> publicized the opt-out mechanism, we decided to go with the option that was
>>> un-disruptive at the workflow level and maximized data collection.
>>
>> Yes, you said that before, and I am sure I am not alone thinking it is not the
>> desirable way of doing it.
>>
>>> Especially given, as you mentioned, that users ultimately need to agree to
>>> communicate with an external server to download these applications and
>>> updates.
>>
>> But there is an obvious difference between the two situations. One is
>> requesting an update. The other is broadcasting to a database.
>>
>>> I hope it's clear that I am not saying we are unwilling to change how
>>> permissions are exposed.. but if we can circumvent that need via discussion
>>> it would certainly be my preference. And if we do end up making any
>>> changes, I would like them to be as minimally damaging to the quality of
>>> the data gathering as possible.
>>
>> I sounds like it is preferable not to ask people about the data collection.
>> That is in my view an error of judgement that can be resolved easily.
>>
>>> To me, there has to be actual user data being exposed to be a matter of
>>> privacy. Can you clarify what you believe to be the concern here?
>>
>> Sure: that the process of collecting usage data is not made clear from the
>> beginning and it should have informed consent before the collection starts.
>>
>> If there are no privacy issues, why is the function to switch it Off called
>> "Privacy"?
>>
>> Regards
>>
>> Gabriel
>>
>> --
>> ImageJ mailing list: http://imagej.nih.gov/ij/list.html
>
> --
> ImageJ mailing list: http://imagej.nih.gov/ij/list.html


--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: Usage data collection (was: Re: ImageJ 2.0.0-rc-11 released)

Mark Hiner-2
In reply to this post by Saalfeld, Stephan
Stephan:

>   In addition to that, I would ask you to, in spirit of full Open Source
and Open Policy make the collected data (the data, not the derived statistics)
read-accessible to everybody in full and license it under an Open Data
license

This is a fantastic idea and I wish we had come up with it earlier...
issue'd here: https://github.com/imagej/imagej-usage/issues/2

>I am very sorry to be so negative in this particular aspect.

No need to be sorry for being honest. If we release something it's because
we' think it's a good idea, but we are extremely lucky to have this
community already installed and willing to provide feedback.

>Please ask yourself whether you would feel comfortable about your operating
system reporting back which applications you've used when, where and how
often

Because you asked...

I'm starting to feel that after 4 years of open source software development
as a public employee, I may be the wrong person to answer this question.
All of my work is open to the public <http://fiji.sc/User:Hinerm>, my
salary is public record <http://host.madison.com/data/uw_salaries/>, and my
GitHub profile is basically my work schedule <https://github.com/Hinerm>. I
shop on Amazon.com and use google a lot, so I'm fairly certain I've
contributed counts to at least a few database entries, somewhere, and every
software tool I've worked on has been to facilitate the sharing of data
between scientists. So my honest answer is no, I would only feel spied upon
if there was actually personally identifying data that was being stored.

But  of course, those are my personal feelings. I appreciate that others do
not feel that way; I would love it if I could both fully understand those
reservations and put them at ease... but those are not necessary
prerequisites for me to see that something should probably change.

I do appreciate the concerns for the database being hacked, and I hope I
made it clear that we do not put identifying data in the database. So even
if the database was hacked, there should not be a way to abuse this data
(another reason why I think making it open source is a cool idea).

Gabriel:

>What is the problem in showing a similar dialog and let people know what
will
>be going on?

So, this is embarrassing, but I completely forgot about my own commit
<https://github.com/imagej/imagej/commit/49d6bc1f8491a4fe41101f88588895dc851041ad>
describing the usage statistics addition to the ImageJ welcome message that
gets displayed any time a new ImageJ2 is uploaded. I know it's not the same
as a dedicated dialog, but I put that text there because I did want to have
something pop up to inform people of what was going on.

I had mixed feelings about whether the dialog should look like a static
document or a changelog, and it was "easier" to just insert the update at
the time. Now I feel like that was a mistake and it should be a revision
log, with the relevant changes separated out by version with most recent at
the top. I was also thinking the welcome message should be modal,
<https://github.com/scijava/scijava-common/issues/115> to ensure users have
to see it... but I don't know if that's too annoying? Anyway, my hope is
that this would be a general solution to raise the visibility of all the
changes we make. (I have been seriously distressed at how many bug reports
we received where people didn't realize they should disable SCIFIO to
restore some lost behavior)

Gabriel / Preibisch :

You both mention asking for the upload permission at startup because that's
how we did it with SCIFIO, but I just wanted to clarify - would it matter
if permission was requested at upload time instead? I like the idea that
the dialog would be saying
"this is happening right now" instead of asking for an ambiguous pass to
upload in the future.. but I wasn't sure if I was missing something that
makes on startup more desirable? See this issue for permissions refactoring
discussion: https://github.com/imagej/imagej-usage/issues/3

Thanks to all for baring with us/me during this feedback period!

- Mark

P.S. Fun fact: last Friday, my biggest concern for our release was that the
statistics uploading would make quitting too slow... ;)

On Mon, Aug 11, 2014 at 5:54 PM, Stephan Saalfeld <
[hidden email]> wrote:

> I fully support Gabriel and Herbie in their strong rejection of this
> sort of data collection and, in particular, the way it aims to dupe the
> clueless, introducing it default on.
>
> Nobody questions that this kind of data is extremely interesting and
> useful.  I strongly believe that every plugin developer drools over
> knowing how much their plugins got used, and when, and where.  This
> lust, however, does by now means justify to actually do it.
>
> I am also pretty sure that your intentions about this procedure are
> perfectly honorable and that you're not planning any evil.  However,
> that does not mean that everybody else does at any time in the future
> and the data will be there, exposed to those hacking into your servers.
>
> Please ask yourself whether you would feel comfortable about your
> operating system reporting back which applications you've used when,
> where and how often.  I would feel spied on.
>
> It would be great if you could consider switching that functionality off
> by default and offer interested users the choice to contribute
> consciously and voluntarily.  That way, you would consciously make the
> decision to gather significantly less data but in the most honorable
> way, withstanding the (understandable) desire to get more faster.  In
> addition to that, I would ask you to, in spirit of full Open Source and
> Open Policy make the collected data (the data, not the derived
> statistics) read-accessible to everybody in full and license it under an
> Open Data license, e.g.
>
> http://opendatacommons.org/licenses/odbl/
>
> or one of the CCs
>
> http://creativecommons.org/licenses/
>
> Everybody can then test whether the data is truly harmless, but I
> actually believe that we may find interesting ways to identify
> individuals by their ImageJ usage patterns.
>
> If you would change the usage data collection policy in this spirit, I
> would consider switching it on, for a while.
>
> I am very sorry to be so negative in this particular aspect.  I
> appreciate a lot the immense amount of work you are spending to make
> ImageJ2 a better analysis tool freely available to the community.
>
> Best regards,
> Stephan
>
>
>
>
>
> On Mon, 2014-08-11 at 22:53 +0100, Gabriel Landini wrote:
> > On Monday 11 Aug 2014 14:18:11 Mark Hiner wrote:
> > > > SCIFIO was opt in, but usage tracking is opt out? It does not make
> sense.
> > >
> > > To be clear, SCIFIO is enabled by default.. you have to uncheck a box
> to
> > > disable SCIFIO, so it is opt out.
> >
> > Right, but it was impossible to miss as I had to answer the SCIFIO
> dialog when
> > the update came.
> > What is the problem in showing a similar dialog and let people know what
> will
> > be going on?
> >
> > > I think there is a difference in the questions "what do you do with the
> > > software" and "what do users do with the software". I don't believe we
> will
> > > ever ask the former question.
> >
> > Mark, what you or me personally *believe* somebody will ask in the
> future does
> > not matter. It is the process of getting informed consent on the data
> > collection; IJ2 is assuming and makes it less obvious than it could be.
> >
> > > we can ask:
> > >  "How many times was Bio-Formats used with Java 7"?
> > > we *can not* ask:
> > >  "how many times did Gabriel Landini run Bio-Formats?"
> >
> > Even with my poor knowledge of network traffic I can imagine that it
> might
> > trivial to script something using time stamps and ip addresses of the
> > uploading machine as well as plenty of emails also ip addresses from
> users.
> > Not that I remotely think that the devel team would have the time or
> > inclination to do this, but if we are talking about what is impossible, I
> > suspect it is not. So whether that is potentially identifiable
> information is
> > probably debatable. If there is then you would be effectively logging in
> a
> > database their location every hour (!) IJ2 runs. Doesn't that sound a bit
> > creepy?
> >
> > My issue was (and remains) that data collection needs to be fully
> informed
> > before it takes place, not to be On by default.
> >
> > > >If this happens to be something people want to adhere to, then there
> is
> > > > nothing to worry about as there will be lots of users opting in when
> given
> > > > the chance.
> >
> > > I believe this is actually hard to predict.
> >
> > Ask the users in a similar way SCIFIO was done and you will have the
> answer.
> > Then we would not be having this conversation.
> >
> > > If usage statistics were presented similarly - with a pop up on launch
> and
> > > an options menu - my expectations for opt-in numbers would be very
> low. Not
> > > because people don't want to contribute but because we created a
> barrier to
> > > the process.
> >
> > The issue that does not seem to stick after all this typing is that IJ2
> should
> > not make that decision for the users. IJ2 is not the owner of the
> processes
> > happening in a user's computer. You need to ask, not assume, that people
> will
> > be happy for their computers to contact a database every hour and
> letting it
> > know they are there and doing this or that.
> >
> > > A more successful alternative might be, when statistics are actually
> being
> > > uploaded, to display a dialog asking to proceed or not - with
> yes/no/don't
> > > ask me again options. That sounds promising, but also potentially
> annoying
> > > or confusing to get that pop up, and we can still expect statistics
> > > reporting to drop.
> >
> > But if the reporting statistics drop, that would have to do. Make
> estimates
> > instead of collecting all possible data.
> >
> > > So since we are not sending or storing use-specific data, and provided
> and
> > > publicized the opt-out mechanism, we decided to go with the option
> that was
> > > un-disruptive at the workflow level and maximized data collection.
> >
> > Yes, you said that before, and I am sure I am not alone thinking it is
> not the
> > desirable way of doing it.
> >
> > > Especially given, as you mentioned, that users ultimately need to
> agree to
> > > communicate with an external server to download these applications and
> > > updates.
> >
> > But there is an obvious difference between the two situations. One is
> > requesting an update. The other is broadcasting to a database.
> >
> > > I hope it's clear that I am not saying we are unwilling to change how
> > > permissions are exposed.. but if we can circumvent that need via
> discussion
> > > it would certainly be my preference. And if we do end up making any
> > > changes, I would like them to be as minimally damaging to the quality
> of
> > > the data gathering as possible.
> >
> > I sounds like it is preferable not to ask people about the data
> collection.
> > That is in my view an error of judgement that can be resolved easily.
> >
> > > To me, there has to be actual user data being exposed to be a matter of
> > > privacy. Can you clarify what you believe to be the concern here?
> >
> > Sure: that the process of collecting usage data is not made clear from
> the
> > beginning and it should have informed consent before the collection
> starts.
> >
> > If there are no privacy issues, why is the function to switch it Off
> called
> > "Privacy"?
> >
> > Regards
> >
> > Gabriel
> >
> > --
> > ImageJ mailing list: http://imagej.nih.gov/ij/list.html
>
> --
> ImageJ mailing list: http://imagej.nih.gov/ij/list.html
>

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: Usage data collection (was: Re: ImageJ 2.0.0-rc-11 released)

Albert Cardona-2
Mark,

At this point your only sensible option is to isolate all user tracking commits into a separate branch, reverting them from master and from the updater.

From the conversation in this mailing list, it is clear to me that the ImageJ 2 team has not thought through the wide implications of user and usage tracking.

The defensive tone of your replies suggests that a reflexive period of cooling, thought, planning and careful public disclosure is in order, or you risk alienating this very vibrant and extremely diverse community.

We appreciate very much the work that the ImageJ 2 team is putting in. Lets keep it at the highest standard.

Best,

Albert
--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: ImageJ 2.0.0-rc-11 released

gankaku
In reply to this post by Gabriel Landini
Hi Mark, all,

If I also may comment on this topic... I see it similar to Gabriel. If I
would not have payed attention to the release note from Curtis and Gabriels
first mail, I actually would have missed this whole issue. I guess a lot of
users not using the mailing list are thus not informed about the "data"
collection. I see, that there might be some general advantage in the
collection concerning future funding etc., so I would not argument for a
complete removal of the feature. For me it would for example also be
interesting to see how often the tools I created where used!
Anyway, nowadays more of our "private" data are collected and send
somewhere by all our electronic devices (especially Smartphones....). But
one needs to think about tools created in best willingness and without
hidden agenda might be misused simply due to their existence by malicious
subjects, not assuming that this can happen or will be possible in this
specific case.

Therefore, I would also suggest to make the tool opt-in with an initial
popup message after the first run of Fiji/IJ2, like for SCIFIO. This
potentially "interrupts" the first workflow, but I think this is not a big
issue when you see this once in the lifetime of your current Fiji/IJ2
version. The updater actually does the same with the 3 options
"update/remind me later/never" which to me seems a good possibility for
your tool. Then everybody is informed and can make a sound decision for
her-/himself. Obviously, this will result in a drop of N in your statistics
but I guess you will still get sufficient people who contribute to that.
Just out of curiosity, how was the map existing already since long ago (
http://fiji.sc/Fiji_Usage) actually created having no such "collection"
tool (download statistics?)?
Additionally, the list of data collected is only shortly mentioned in
brackets on the release page. Is this a comprehensive list or only
exemplary. It would be nice to show the users which data are collected
exactly.

Performance wise, I had the impression that it really makes Fiji slower
(without being able to put this in numbers). Therefore, minimizing data
transfer (only at the end of a session) and collection should be the goal.
Because if IJ2 is designed for dealing with big data which need way more
calculation power (without exactly knowing how it is able to process higher
amounts faster and if this is affected by the collection) etc., potentially
slowing the performance time down by a process like the data collection
might not be advantageous.

I think this is a topic where you will never find a common consense (not
even inside this very same e-mail ;-) ), but to conclude, informing the
user and presenting options is preferrable in my opinion.

Besides that, thanks for all the effort of your team and trying to make it
performing better every day.

Jan



2014-08-11 23:53 GMT+02:00 Gabriel Landini <[hidden email]>:

> On Monday 11 Aug 2014 14:18:11 Mark Hiner wrote:
> > > SCIFIO was opt in, but usage tracking is opt out? It does not make
> sense.
> >
> > To be clear, SCIFIO is enabled by default.. you have to uncheck a box to
> > disable SCIFIO, so it is opt out.
>
> Right, but it was impossible to miss as I had to answer the SCIFIO dialog
> when
> the update came.
> What is the problem in showing a similar dialog and let people know what
> will
> be going on?
>
> > I think there is a difference in the questions "what do you do with the
> > software" and "what do users do with the software". I don't believe we
> will
> > ever ask the former question.
>
> Mark, what you or me personally *believe* somebody will ask in the future
> does
> not matter. It is the process of getting informed consent on the data
> collection; IJ2 is assuming and makes it less obvious than it could be.
>
> > we can ask:
> >  "How many times was Bio-Formats used with Java 7"?
> > we *can not* ask:
> >  "how many times did Gabriel Landini run Bio-Formats?"
>
> Even with my poor knowledge of network traffic I can imagine that it might
> trivial to script something using time stamps and ip addresses of the
> uploading machine as well as plenty of emails also ip addresses from users.
> Not that I remotely think that the devel team would have the time or
> inclination to do this, but if we are talking about what is impossible, I
> suspect it is not. So whether that is potentially identifiable information
> is
> probably debatable. If there is then you would be effectively logging in a
> database their location every hour (!) IJ2 runs. Doesn't that sound a bit
> creepy?
>
> My issue was (and remains) that data collection needs to be fully informed
> before it takes place, not to be On by default.
>
> > >If this happens to be something people want to adhere to, then there is
> > > nothing to worry about as there will be lots of users opting in when
> given
> > > the chance.
>
> > I believe this is actually hard to predict.
>
> Ask the users in a similar way SCIFIO was done and you will have the
> answer.
> Then we would not be having this conversation.
>
> > If usage statistics were presented similarly - with a pop up on launch
> and
> > an options menu - my expectations for opt-in numbers would be very low.
> Not
> > because people don't want to contribute but because we created a barrier
> to
> > the process.
>
> The issue that does not seem to stick after all this typing is that IJ2
> should
> not make that decision for the users. IJ2 is not the owner of the processes
> happening in a user's computer. You need to ask, not assume, that people
> will
> be happy for their computers to contact a database every hour and letting
> it
> know they are there and doing this or that.
>
> > A more successful alternative might be, when statistics are actually
> being
> > uploaded, to display a dialog asking to proceed or not - with
> yes/no/don't
> > ask me again options. That sounds promising, but also potentially
> annoying
> > or confusing to get that pop up, and we can still expect statistics
> > reporting to drop.
>
> But if the reporting statistics drop, that would have to do. Make estimates
> instead of collecting all possible data.
>
> > So since we are not sending or storing use-specific data, and provided
> and
> > publicized the opt-out mechanism, we decided to go with the option that
> was
> > un-disruptive at the workflow level and maximized data collection.
>
> Yes, you said that before, and I am sure I am not alone thinking it is not
> the
> desirable way of doing it.
>
> > Especially given, as you mentioned, that users ultimately need to agree
> to
> > communicate with an external server to download these applications and
> > updates.
>
> But there is an obvious difference between the two situations. One is
> requesting an update. The other is broadcasting to a database.
>
> > I hope it's clear that I am not saying we are unwilling to change how
> > permissions are exposed.. but if we can circumvent that need via
> discussion
> > it would certainly be my preference. And if we do end up making any
> > changes, I would like them to be as minimally damaging to the quality of
> > the data gathering as possible.
>
> I sounds like it is preferable not to ask people about the data collection.
> That is in my view an error of judgement that can be resolved easily.
>
> > To me, there has to be actual user data being exposed to be a matter of
> > privacy. Can you clarify what you believe to be the concern here?
>
> Sure: that the process of collecting usage data is not made clear from the
> beginning and it should have informed consent before the collection starts.
>
> If there are no privacy issues, why is the function to switch it Off called
> "Privacy"?
>
> Regards
>
> Gabriel
>
> --
> ImageJ mailing list: http://imagej.nih.gov/ij/list.html
>



--

CEO: Dr. rer. nat. Jan Brocher
phone:  +49 (0)6234 917 03 39
mobile: +49 (0)176 705 746 81
e-mail: [hidden email]
info: [hidden email]
inquiries: [hidden email]
web: www.biovoxxel.de

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: Usage data collection (was: Re: ImageJ 2.0.0-rc-11 released)

Pavel Tomancak
In reply to this post by Albert Cardona-2
Dear All,

I would like to provide a perspective from someone who is not so concerned about privacy (I use Gdocs, shop on Amazon etc.). In my opinion Mark has put forward some very sensible arguments, there is a big difference between logging "Someone in Dresden just updated Fiji" and "Pavel Tomancak in Dresden just updated Fiji".

I have had these discussions with my computer science friends and colleagues a lot. It is clear that the more knowledge of what is possible to deduce from an IP address you have, the more concerned about privacy you will be. The opt-in suggestion put forward is a good one and I support it.

I would be however strongly opposed to disabling usage tracking altogether. As mentioned before, Fiji does this since the very beginning. While for someone this is one of the reasons not to use Fiji, for  others this is one of the reasons why they do. There are literally thousands of academic open source projects out there, most don't take off the ground simply because people have an attitude that this is just another pet project of XY to get a paper out. I keep using what I am used to no matter how advanced the functionality of the new software might be. Usage tracking in the form of a world map of updates has been instrumental in lifting Fiji above the threshold in the bioimaging community. I have been showing this map all over the world and I am not ashamed of it. It does not violate anyones privacy. Many other project do this kind of usage tracking. And yes, funding is an issue. I also admit I am a bit addicted to this usage map ;-).

Its good we have a discussion about this topic, but I would also encourage readers who, similarly to me, don't share these burning privacy concerns to voice their opinion.

All the best

PAvel
-----------------------------------------------------------------------------------
Pavel Tomancak, Ph.D.

Group Leader
Max Planck Institute of Molecular Cell Biology and Genetics
Pfotenhauerstr. 108
D-01307 Dresden Tel.: +49 351 210 2670
Germany Fax: +49 351 210 2020
email: [hidden email]
homepage: http://www.mpi-cbg.de/research/research-groups/pavel-tomancak.html
twitter: @PavelTomancak
-----------------------------------------------------------------------------------


--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
Reply | Threaded
Open this post in threaded view
|

Re: ImageJ 2.0.0-rc-11 released

Krs5
In reply to this post by gankaku
Dear Mark,

It looks to me that you (or the Fiji team) have opened a whole can of worms. I have no background in law but my feeling is that you have to be very careful with what data you are going to collect, how you are going to store, manage and use this data and how the users are informed about this. It is not just what the Fiji team "thinks" about this, there are laws. Part of your users, including myself, live in the EU and as you will collect data from computers in the EU I think there might be a problem. I did a quick search online and found:

According to a ruling of the Court of Justice of the EU on 24 November 2011 IP addresses are personal data (http://www.mondaq.com/x/162538/Copyright/ECJ+Confirms+That+IP+Addresses+Are+Personal+Data). This has been confirmed by the European Commission on 14 March 2013 (http://www.privacy-europe.com/blog/ip-tracking-is-it-compliant-with-data-protection-laws/). It seems it does not matter what somebody does with the data but what somebody could do with the data.

Then I can imagine that there are also issues if this data is leaving the EU. The Fiji.sc website seems to be registered in Germany but part of the development team is in the US (and I am not sure where the download servers are) but I came across something that the US recipient of the data has to be signed up to the US Department of Commerce Safe Harbor Scheme.

I really cannot tell you if (part of) these rules cover the collection of the data by Fiji but it seems to me that some more thoughts have to go into the legal framework behind this data collection.

Best wishes

Kees


Dr Ir K.R. Straatman
Senior Experimental Officer
Advanced Imaging Facility
Centre for Core Biotechnology Services
University of Leicester
http://www2.le.ac.uk/colleges/medbiopsych/facilities-and-services/cbs/lite/aif



2014-08-11 23:53 GMT+02:00 Gabriel Landini <[hidden email]>:

> On Monday 11 Aug 2014 14:18:11 Mark Hiner wrote:
> > > SCIFIO was opt in, but usage tracking is opt out? It does not make
> sense.
> >
> > To be clear, SCIFIO is enabled by default.. you have to uncheck a
> > box to disable SCIFIO, so it is opt out.
>
> Right, but it was impossible to miss as I had to answer the SCIFIO
> dialog when the update came.
> What is the problem in showing a similar dialog and let people know
> what will be going on?
>
> > I think there is a difference in the questions "what do you do with
> > the software" and "what do users do with the software". I don't
> > believe we
> will
> > ever ask the former question.
>
> Mark, what you or me personally *believe* somebody will ask in the
> future does not matter. It is the process of getting informed consent
> on the data collection; IJ2 is assuming and makes it less obvious than
> it could be.
>
> > we can ask:
> >  "How many times was Bio-Formats used with Java 7"?
> > we *can not* ask:
> >  "how many times did Gabriel Landini run Bio-Formats?"
>
> Even with my poor knowledge of network traffic I can imagine that it
> might trivial to script something using time stamps and ip addresses
> of the uploading machine as well as plenty of emails also ip addresses from users.
> Not that I remotely think that the devel team would have the time or
> inclination to do this, but if we are talking about what is
> impossible, I suspect it is not. So whether that is potentially
> identifiable information is probably debatable. If there is then you
> would be effectively logging in a database their location every hour
> (!) IJ2 runs. Doesn't that sound a bit creepy?
>
> My issue was (and remains) that data collection needs to be fully
> informed before it takes place, not to be On by default.
>
> > >If this happens to be something people want to adhere to, then
> > >there is  nothing to worry about as there will be lots of users
> > >opting in when
> given
> > > the chance.
>
> > I believe this is actually hard to predict.
>
> Ask the users in a similar way SCIFIO was done and you will have the
> answer.
> Then we would not be having this conversation.
>
> > If usage statistics were presented similarly - with a pop up on
> > launch
> and
> > an options menu - my expectations for opt-in numbers would be very low.
> Not
> > because people don't want to contribute but because we created a
> > barrier
> to
> > the process.
>
> The issue that does not seem to stick after all this typing is that
> IJ2 should not make that decision for the users. IJ2 is not the owner
> of the processes happening in a user's computer. You need to ask, not
> assume, that people will be happy for their computers to contact a
> database every hour and letting it know they are there and doing this
> or that.
>
> > A more successful alternative might be, when statistics are actually
> being
> > uploaded, to display a dialog asking to proceed or not - with
> yes/no/don't
> > ask me again options. That sounds promising, but also potentially
> annoying
> > or confusing to get that pop up, and we can still expect statistics
> > reporting to drop.
>
> But if the reporting statistics drop, that would have to do. Make
> estimates instead of collecting all possible data.
>
> > So since we are not sending or storing use-specific data, and
> > provided
> and
> > publicized the opt-out mechanism, we decided to go with the option
> > that
> was
> > un-disruptive at the workflow level and maximized data collection.
>
> Yes, you said that before, and I am sure I am not alone thinking it is
> not the desirable way of doing it.
>
> > Especially given, as you mentioned, that users ultimately need to
> > agree
> to
> > communicate with an external server to download these applications
> > and updates.
>
> But there is an obvious difference between the two situations. One is
> requesting an update. The other is broadcasting to a database.
>
> > I hope it's clear that I am not saying we are unwilling to change
> > how permissions are exposed.. but if we can circumvent that need via
> discussion
> > it would certainly be my preference. And if we do end up making any
> > changes, I would like them to be as minimally damaging to the
> > quality of the data gathering as possible.
>
> I sounds like it is preferable not to ask people about the data collection.
> That is in my view an error of judgement that can be resolved easily.
>
> > To me, there has to be actual user data being exposed to be a matter
> > of privacy. Can you clarify what you believe to be the concern here?
>
> Sure: that the process of collecting usage data is not made clear from
> the beginning and it should have informed consent before the collection starts.
>
> If there are no privacy issues, why is the function to switch it Off
> called "Privacy"?
>
> Regards
>
> Gabriel
>
> --
> ImageJ mailing list: http://imagej.nih.gov/ij/list.html
>



--

CEO: Dr. rer. nat. Jan Brocher
phone:  +49 (0)6234 917 03 39
mobile: +49 (0)176 705 746 81
e-mail: [hidden email]
info: [hidden email]
inquiries: [hidden email]
web: www.biovoxxel.de

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html

--
ImageJ mailing list: http://imagej.nih.gov/ij/list.html
12