Help with file upload (endlessly "Inspecting selected files")

418 views
Skip to first unread message

A D

unread,
Aug 19, 2014, 11:02:20 AM8/19/14
to openr...@googlegroups.com
Hi,

I hit a wall with trying to upload a .xlsx file. Open Refine (2.5) simply gets stuck at "Inspecting selected files" and then nothing happens, except I have to quit Firefox (31.0) and/or Chrome (36.0) and then also Open Refine. I have uploaded a different .xlsx file from the same source without any problems, however with this one nothing does the trick. I saved it in .csv and uploaded it fine, but was then unable to change character encoding as it doesn't get saved btw xls and csv. I need to have the correct character encoding, though, and so I need to find a way to upload the .xlsx file. I am running Mac OS X 10.6.8

Please, please, please advise about possible solutions, this is work related. Thank you!!

Anuška


PS: not a big techie, just a journo :)

SanjayKumar Rajbhar

unread,
Aug 19, 2014, 11:44:42 AM8/19/14
to openr...@googlegroups.com
download fresh copy of openrefine. this will do the trick. or move  "Open refine"  folder  some where other path. and run it.(that contain set up and all the supporting ) 


--
You received this message because you are subscribed to the Google Groups "OpenRefine" group.
To unsubscribe from this group and stop receiving emails from it, send an email to openrefine+...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.



--
--
Sanjay

Thad Guidry

unread,
Aug 19, 2014, 2:26:14 PM8/19/14
to openr...@googlegroups.com
Adelic,

He is suggesting to download and use OpenRefine beta 2.6, (not 2.5) which contains many bug fixes to help with your problem.

A D

unread,
Aug 19, 2014, 3:54:59 PM8/19/14
to openr...@googlegroups.com
Hi, thank you both.

I was under the impression the 2.6 was still unstable and I should use 2.5?

Tx,
Anuška

Thad Guidry

unread,
Aug 19, 2014, 4:23:08 PM8/19/14
to openr...@googlegroups.com
2.6 MIGHT have a few bugs still, yes...but 2.5 had even more bugs.  As with all software, there are always bugs lurking that have not been found by users yet.


A D

unread,
Aug 20, 2014, 4:21:36 AM8/20/14
to openr...@googlegroups.com
Alright! Thanks, will DL.

Nick Heynen

unread,
Nov 19, 2014, 12:09:21 PM11/19/14
to openr...@googlegroups.com
I had the same problem despite running OpenRefine 2.6-beta.1. I was trying to upload a xlsx spreadsheet (Excel 2007) with 139,882 rows. It got stuck on "Inspecting selected files" with no error message in both Chrome (38.0.2125.111 m) and Firefox (33.1.1). The only way I could get it to work was changing the file to a csv. That worked, but there does seem to be some kind of problem with formatted xlsx files. I'm also just a journo, so not sure if any of that is helpful, but thought I'd share anyway.

Thad Guidry

unread,
Nov 19, 2014, 2:54:47 PM11/19/14
to openrefine
Apache POI , which we use for extracting the XLSX files...has been updated...in particular, there are 4 known issues with XLSX handling that have been patched since POI 3.8 FINAL, the version we use for dealing iwth XLSX files.

We might want to drop in the new jar file for POI.

Until then Nick, if you want, you might also try getting the new POI 3.11 beta jars and replacing the 3.8 jars we have under \OpenRefine\main\webapp\WEB-INF\lib
and see if that helps and let us know.

Also, can you open a Github issue for us to track this ?  and possibly upload your XLSX file that had problems to that issue, if your file is not private ?


On Wed, Nov 19, 2014 at 11:09 AM, Nick Heynen <nhe...@madison.com> wrote:
I had the same problem despite running OpenRefine 2.6-beta.1. I was trying to upload a xlsx spreadsheet (Excel 2007) with 139,882 rows. It got stuck on "Inspecting selected files" with no error message in both Chrome (38.0.2125.111 m) and Firefox (33.1.1). The only way I could get it to work was changing the file to a csv. That worked, but there does seem to be some kind of problem with formatted xlsx files. I'm also just a journo, so not sure if any of that is helpful, but thought I'd share anyway.

--

Nick Heynen

unread,
Nov 19, 2014, 4:55:46 PM11/19/14
to openr...@googlegroups.com
I've never opened a Github issue, so I probably am not the right person to get that going. Here's the xlsx file that causes the problem: https://drive.google.com/file/d/0B-Or4A-7hQ80ZGgwbHJKSUpnckU/view?usp=sharing

Can you provide more detailed instructions for updating the POI? I'm new to OpenRefine and I'm afraid I still need things spelled out pretty explicitly. Thanks for the suggestion.

Thad Guidry

unread,
Nov 24, 2014, 2:45:08 PM11/24/14
to openrefine
Nick,

It was not necessary to update POI to the latest version.  The fix is that you need to give more memory for OpenRefine to work with your file.

I was able to launch OpenRefine at the Windows 7 command line with:

  C:\Users\eguitha\git\OpenRefine>refine /m 4096m

which gives OpenRefine 4096 megabytes (4 gigs) of memory to work with.  I am using a 64bit Java JDK installation on my Windows 7 laptop.

It took about 20 seconds for "Inspecting selected files" to finish and was able to see the rows/column in our Preview grid.
I then continued to import the file, and then exported it as an OpenRefine project file (automatically gzipped) for your use.

I have attached the project file via Google Drive which you should be able to import into OpenRefine using "Import Project" :

On Wed, Nov 19, 2014 at 3:55 PM, Nick Heynen <nhe...@madison.com> wrote:
I've never opened a Github issue, so I probably am not the right person to get that going. Here's the xlsx file that causes the problem: https://drive.google.com/file/d/0B-Or4A-7hQ80ZGgwbHJKSUpnckU/view?usp=sharing

Can you provide more detailed instructions for updating the POI? I'm new to OpenRefine and I'm afraid I still need things spelled out pretty explicitly. Thanks for the suggestion.

--

Nick Heynen

unread,
Dec 4, 2014, 1:21:15 PM12/4/14
to openr...@googlegroups.com
Tried this with up to 8 gigs of memory and no dice. Maybe I'm doing it wrong at the command line? Does this look wrong? 
C:\Users\mniheynn\OpenRefine\openrefine /m 8192m

Nick Heynen

unread,
Dec 4, 2014, 1:23:38 PM12/4/14
to openr...@googlegroups.com
By "no dice" I mean I keep getting the same, endless inspecting files response noted above.

Nick Heynen

unread,
Dec 4, 2014, 1:26:48 PM12/4/14
to openr...@googlegroups.com
Here's my system profile, if it helps: Win 7 SP 1, 64 bit OS, Intel i7-3770 CPU 3.4GHz 3.4GHz, 16GB RAM.

Thad Guidry

unread,
Dec 4, 2014, 10:35:39 PM12/4/14
to openrefine
Nick,

Try to use the ./refine.bat or ./refine.sh command ... rather than the /openrefine.exe

Just look under the openrefine folder and you should see the commands as well.
You can even hack on the refine.bat file with a text editor to change to your liking for defaults.

On Thu, Dec 4, 2014 at 12:26 PM, Nick Heynen <nhe...@madison.com> wrote:
Here's my system profile, if it helps: Win 7 SP 1, 64 bit OS, Intel i7-3770 CPU 3.4GHz 3.4GHz, 16GB RAM.

--
Reply all
Reply to author
Forward
0 new messages