Hi!
I can't find documentation for these options and how/when to use them?
Reason I ask is that I am unable to parse a 2.7 G csv file. I let it run for 15 minutes on a 96 GB machine, I got this warning after 2 or 3 minutes, but it never finished:
In [2]: df = pd.read_csv(fname, parse_dates=[1])
/u/paige/maye/Enthought/Canopy_64bit/User/lib/python2.7/site-packages/pandas-0.12.0_1100_g0c30665-py2.7-linux-x86_64.egg/pandas/io/parsers.py:1033: DtypeWarning: Columns (15,18,19) have mixed types. Specify dtype option on import or set low_memory=False.
data = self._reader.read(nrows)
Playing with low_memory=False and memory_map produces core dump crashes unfortunately, that's why I am wondering how to use them properly.
Cheers,
Michael