I've been attempting to pull in a sizable data set in with the drRead.table set of commands. This data set is still small enough to pull into the local session with the regular read.table API and then I can convert the collection to a ddf, however, attempting to load the data with drRead.table using the same parameters I pass to read.table results in column class errors. The commands I'm comparing in particular are:
lab.data <- drRead.table("/path/to/file", output="/path/to/results", sep="|", header=TRUE, stringsAsFactors=FALSE) # this fails with: scan() expected a 'real' got 'E.U./DL'
lab.data <- read.table("/path/to/file", output="/path/to/results", sep="|", header=TRUE, stringsAsFactors=FALSE) # this succeeds, and I create a ddf using lab.ddf <- ddf(lab.data)
I should also note that for other identically structured data sets, the errors will include things like scan() expected 'an integer', got '0.4'.
Any thoughts on this apparent inconsistency in the API?
Regards,
Alek