Error occur during group_significance.py using rarefied BIOM data

17 views

Skip to first unread message

Yoko Nagai

unread,

Jan 10, 2018, 1:28:41 AM1/10/18

to Qiime 1 Forum

Hello,

I got an error using QIIME 1.9.1 group_significance.py for rarefied BIOM data, whether original BIOM data went without any problem or error.

I used unziped BIOM data that is created by core_diversity_analsyses.py

Command:

group_significance.py -i unzip_table_even5000_from_core_diversity_analysis.biom -m map.txt -c Group -s mann_whitney_u -o out.txt

Error:

Traceback (most recent call last):

File "/usr/local/bin/group_significance.py", line 344, in <module>

main()

File "/usr/local/bin/group_significance.py", line 326, in main

GROUP_TEST_CHOICES, int(opts.permutations))

File "/usr/local/lib/python2.7/dist-packages/qiime/otu_significance.py", line 147, in run_group_significance_test

test_stat, pval = test_choices[test](row[0], row[1])

File "/usr/local/lib/python2.7/dist-packages/qiime/stats.py", line 1882, in mw_t

u, pval = mannwhitneyu(x, y, continuity)

File "/usr/local/lib/python2.7/dist-packages/scipy/stats/stats.py", line 4092, in mannwhitneyu

raise ValueError('All numbers are identical in amannwhitneyu')

ValueError: All numbers are identical in amannwhitneyu

Do I need to re-create rarefied dataset?

Colin Brislawn

unread,

Jan 15, 2018, 12:22:37 AM1/15/18

to Qiime 1 Forum

Hello Yoko,

Thank you for posting the full command and error.

Based on this error "All numbers are identical in amannwhitneyu" it looks like the stat test cannot be performed because all the numbers are identical!

The group_significance.py script should work just fine on normalized data, but the results can be strange and make errors if the rarefaction depth is too low or if you have too few samples. Also, rarefaction removes samples under your rarefaction level (so samples under 5000 reads would be dropped), and this is could also make you have two few samples.

How many samples are in that even5000 table? Does this script work if you rarify to 1000 reads per sample (instead of 5000)?

Thanks,

Colin

Reply all

Reply to author

Forward

0 new messages