Can OpenRefine be used with private data?

158 views
Skip to first unread message

luca gilardi

unread,
Sep 24, 2019, 10:13:22 AM9/24/19
to OpenRefine

I would like to know if my data is stored somewhere in the web using OpenRefine or if it can be used also with sensible information. Thanks in advance.
Message has been deleted
Message has been deleted

Thad Guidry

unread,
Sep 24, 2019, 10:19:13 AM9/24/19
to openr...@googlegroups.com
Yes, your data stays local on your laptop or PC and why so many folks like OpenRefine for working with their private data because it stays private.
Your data CAN BE stored in the web, BUT ONLY if you export and choose to upload to your Google Drive or upload somewhere else.
OpenRefine does no (and never will) automatic storage of your private data onto the web or exposes your private data externally on the internet.



On Tue, Sep 24, 2019 at 9:13 AM luca gilardi <lucagila...@gmail.com> wrote:

I would like to know if my data is stored somewhere in the web using OpenRefine or if it can be used also with sensible information. Thanks in advance.

--
You received this message because you are subscribed to the Google Groups "OpenRefine" group.
To unsubscribe from this group and stop receiving emails from it, send an email to openrefine+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/openrefine/5aa82fb2-03c9-40fd-b757-ee99a0bdf3be%40googlegroups.com.

Isao Matsunami

unread,
Sep 24, 2019, 2:09:32 PM9/24/19
to openr...@googlegroups.com
When you use OpenRefine with your browser, YOUR computer ( pretends to
be ) IS a data processing server, which you feel you are communicating
with. Open Refine is using your browser just as use interface, not as
communication tool over the internet. So you can use OR when your PC
is offline.

2019年9月24日(火) 23:19 Thad Guidry <thadg...@gmail.com>:
> To view this discussion on the web visit https://groups.google.com/d/msgid/openrefine/CAChbWaM6jrTgY9en5Ccxc3KcOm7UpSHNthd3xGJg-w6%2Bm6LKwQ%40mail.gmail.com.

Peter Nõu

unread,
Sep 25, 2019, 4:49:17 AM9/25/19
to openr...@googlegroups.com
This is an important communications challenge for every software/system that uses a browser front end. Increasingly so over time, as this question highlights.

Obvious to ‘it professionals’, maybe, but very not so to a wider audience.

/ P e t e r N õ u @dkmj

Gustavo Rotondaro

unread,
Sep 25, 2019, 5:56:23 PM9/25/19
to OpenRefine
This is so very true!  Especially for a tool like this that reaches a global audience (literally and professionally).

I may have missed it but didn't see anything on the open refine website that explains this in non-technical terms.


On Wednesday, September 25, 2019 at 4:49:17 AM UTC-4, Peter Nõu wrote:
This is an important communications challenge for every software/system that uses a browser front end. Increasingly so over time, as this question highlights.

Obvious to ‘it professionals’, maybe, but very not so to a wider audience.

/ P e t e r N õ u @dkmj

Thad Guidry

unread,
Sep 25, 2019, 6:01:05 PM9/25/19
to openr...@googlegroups.com
Gustavo,

We have that info in the first paragraph of the installation instructions. 

I guess we could put a bit of that paragraph into the OpenRefine.org website introduction paragraph ?

Welcome!

OpenRefine (formerly Google Refine) is a powerful tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data.

OpenRefine is a desktop application in that you download it, install it, and run it on your own computer. However, unlike most other desktop applications, it runs as a small web server on your own computer and you point your web browser at that web server in order to use Refine. So, think of Refine as a personal and private web application.  


What do you think of the above?



--
You received this message because you are subscribed to the Google Groups "OpenRefine" group.
To unsubscribe from this group and stop receiving emails from it, send an email to openrefine+...@googlegroups.com.

Ettore RIZZA

unread,
Sep 25, 2019, 6:06:25 PM9/25/19
to openrefine
Maybe someone will have to take the time one day to clean up and rewrite the FAQ. Over time, it has been cluttered with technical details that would be better elsewhere, while some essential information is not included.

Ettore Rizza


Gustavo Rotondaro

unread,
Sep 25, 2019, 8:01:33 PM9/25/19
to OpenRefine
Hey Thad,

I knew it was somewhere but I think highlighting that in the OpenRefine.org website is important.

Ettore also makes a good point about the language.  I, myself, am not an IT person but know enough to be dangerous or know at least who to ask to help me translate the technical language (access that many people may not have).  For many, using the web browser to interact with OpenRefine automatically means they are sending their data over the Internet.

Here are some thoughts for you to consider from a non-technical person:

1. Break it down into a few sections/questions
2. Avoid long paragraphs. Keep the language very simple
3. Address common concerns people may have about their data privacy/security
4. Perhaps a dedicated page on this topic that evolves over-time
5. Potential questions:

-> Where is my data stored when I upload it to OpenRefine?
-> Does my data ever leave my computer for any reason?
-> Is OpenRefine secure enough for my data? An opportunity here to clarify that people's own security practices are what matters the most. Offer some simple tips: use a strong password, do not share your password because that will allow people to potentially access your data, often scan your system for viruses, enforce screen lock when away from your computer, etc.
-> Does OpenRefine collects any activity logs from me?

I'm sure there are more other essential or technical matters I'm missing but from a user perspective, I'd like to know these things upfront.

Here is an interesting article about privacy policies that may offer other ideas: https://tinyurl.com/y2p238o9 about this topic.

Just a thought, hope it helps.
-G.

To unsubscribe from this group and stop receiving emails from it, send an email to openr...@googlegroups.com.

--
You received this message because you are subscribed to the Google Groups "OpenRefine" group.
To unsubscribe from this group and stop receiving emails from it, send an email to openr...@googlegroups.com.

Thad Guidry

unread,
Sep 25, 2019, 10:17:59 PM9/25/19
to openr...@googlegroups.com
Thanks Gustavo !

We're extra busy and could really use community help here!

Would you and Ettore be willing to improve our FAQ with those questions or better yet a Wiki page dedicated to discussing data privacy issues?  (you can do it !  Its just a Wiki !)

In the meantime, I've just created a pull request to add a small note about data locality and privacy to help users understand more quickly.
 


To unsubscribe from this group and stop receiving emails from it, send an email to openrefine+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/openrefine/967b0500-d6e1-4ffe-946b-ad9ea4309b40%40googlegroups.com.

Peter Nõu

unread,
Sep 26, 2019, 6:59:52 AM9/26/19
to openr...@googlegroups.com
I might be able to help in writing the FAQ, will look at the existing material and see how if i might make a contribution. 

Application local is one thing; key to point out is that DATA is local, unless you take explicit step to share. That should comfort users unclear of such architecture distinctions. 


/ P e t e r N õ u @dkmj

26 sep. 2019 kl. 04:18 skrev Thad Guidry <thadg...@gmail.com>:



Thad Guidry

unread,
Sep 26, 2019, 8:08:28 AM9/26/19
to openr...@googlegroups.com
Thanks Peter!

Yeah our idea is that we can also add a "Click to read more..." link right after the new paragraph on data privacy that we just pushed up to the OpenRefine website. Take a look ! ->>

OpenRefine always keeps your data private on your own computer until YOU want to share or collaborate. Your private data never leaves your computer unless you want it to. (It works by running a small server on your computer and you use your web browser to interact with it) <LINK TO WIKI PAGE WOULD GO HERE>

I have additionally added the new feature request #2172 to give a warning to users about certain features in OpenRefine that expose parts of their data as part of that operation and for them to opt in or not.
Folks are welcome to chime in on that feature with comments.



Isao Matsunami

unread,
Sep 26, 2019, 8:59:49 AM9/26/19
to openr...@googlegroups.com
It's great!
Let's celebrate "non professionals" using OpenRefine. They ARE
professionals in the other fields.

This lady uses OR to check spelling variations in modern poetry.
https://twitter.com/erinannmcc/status/1172259896890257409
Nerdy guys won't find this kind of usage.

Maybe in the future rap singers will add "rhyme clustering algorithm"
or dyslexia people may want "non character clustering" like grouping
"seven" and "7".


Isao Matsunami
Reply all
Reply to author
Forward
0 new messages