jaco...@sas.upenn.edu
unread,Apr 15, 2016, 12:43:31 PM4/15/16Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to sqldf
I am trying to use read.csv.sql using Linux Ubuntu 14.04. My raw csv file is larger than RAM, but I do extensive SQL cleaning within the function. When the file is fully cleaned, it is much smaller than RAM (compressed from 20+ GB to ~30 MB). Nonetheless, I still get memory errors when I try to run the command on my machine.
My co-author, running OS X, is able to run the commands with no problem, despite the fact that we have the same amount of underlying RAM.
For reference, here is the command I am trying to run:
read.csv.sql(dataSetName,
sql = " select SYMBOL, DATE, min(TIME) as TIME, PRICE from file where
SUBSTR(Time, 5,1) IN ( '0', '1','2','3','4','5', '6','7','8','9') GROUP BY
SYMBOL, DATE, SUBSTR(TIME,2,4)", dbname = 'sqldb' )