modin and read_csv() skiprows

17 views
Skip to first unread message

jshe...@nygenome.org

unread,
Dec 10, 2019, 5:47:09 PM12/10/19
to modin-dev
I'm exploring modin now and I have a question. When I load a CSV file using skiprows modin does not skip the rows. Is this expected and if so should it come with a warning?

import modin.pandas as pd

data
= pd.read_csv(baf, skiprows=2, names = ['CHR', 'POS', 'REF_COUNT', 'ALT_COUNT', 'BAF'], dtype = {'CHR' : str}, sep='\t')

data
.head()
    CHR    POS  REF_COUNT  ALT_COUNT                  BAF
0   CHR    POS  REF_COUNT  ALT_COUNT                  BAF
1  chrFAKE  FAKE1         FAKE          FAKE               0
2  chrFAKE  FAKE2         FAKE         FAKE   0
3  chrFAKE  FAKE3         FAKE         FAKE  0

Thanks

Devin Petersohn

unread,
Dec 10, 2019, 6:21:33 PM12/10/19
to jshe...@nygenome.org, modin-dev
Thanks for posting this!

I was able to reproduce the issue, but only when the `names` is also passed in, this is a bug. If you are able, please report this on the GitHub so it can be tracked and fixed: https://github.com/modin-project/modin/issues

Thanks again for asking!

Devin

--
You received this message because you are subscribed to the Google Groups "modin-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to modin-dev+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/modin-dev/7ab265de-5d96-45c1-838c-4a87a812fe8d%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages