For clarity of viewing my 2 columns in the genome lengths file, I had 3 tab separators between chromosome ID and chromosome length, in my genome file.
When I changed it to just 1 tab separator, things work as expected. Sorry about this mix-up from my end. But thanks for your help.
MEANING OF WARNING?
However, for my closest and fisher runs, that DO run now, I see warnings, as shown below from the STDOUT.
Is there reason to look into and try to fix my input GFF3 files? genometools' gt gff3validator returned with "input is valid gf3" for both these gff3 files.
So I am not sure what these warnings are about. Should I worry or no? Please advice. Thanks!
closestBed -d -a Acoerulea_322_v3_Hel_phmmert_l20_REsorted.gff3 -b Acoerulea_322_v3_Rep_phmmert_l20_REsorted.gff3 -g Acoerulea_322_v3.fa.shIDscleaned-up_IDs_SeqLen
Several output lines
.
.
.
.
***** WARNING: File Acoerulea_322_v3_Rep_phmmert_l20_REsorted.gff3 has inconsistent naming convention for record:
scaffold_8 phmmert_PIF1_PF05970.13 ORF 194959 195993 0 + 0 ID=ORF80_phmmert_PIF1_PF05970.13
.
.
.
More output lines
.
.
.
***** WARNING: File Acoerulea_322_v3_Rep_phmmert_l20_REsorted.gff3 has inconsistent naming convention for record:
scaffold_8 phmmert_PIF1_PF05970.13 ORF 194959 195993 0 + 0 ID=ORF80_phmmert_PIF1_PF05970.13
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
bedtools fisher -a Acoerulea_322_v3_Hel_phmmert_l20_REsorted.gff3 -b Acoerulea_322_v3_Rep_phmmert_l20_REsorted.gff3 -g Acoerulea_322_v3.fa.shIDscleaned-up_IDs_SeqLen
***** WARNING: File Acoerulea_322_v3_Hel_phmmert_l20_REsorted.gff3 has inconsistent naming convention for record:
scaffold_8 phmmert_Helitron_like_N_PF14214.5 ORF 192792 193016 6.4e-08 + 2 ID=ORF254_phmmert_Helitron_like_N_PF14214.5
# Number of query intervals: 375
# Number of db intervals: 777
# Number of overlaps: 25
# Number of possible intervals (estimated): 197997
# phyper(25 - 1, 375, 197997 - 375, 777, lower.tail=F)
# Contingency Table Of Counts
#_________________________________________
# | in -b | not in -b |
# in -a | 25 | 350 |
# not in -a | 752 | 196870 |
#_________________________________________
# p-values for fisher's exact test
left right two-tail ratio
1 8.3946e-23 8.3946e-23 18.700
***** WARNING: File Acoerulea_322_v3_Hel_phmmert_l20_REsorted.gff3 has inconsistent naming convention for record:
scaffold_8 phmmert_Helitron_like_N_PF14214.5 ORF 192792 193016 6.4e-08 + 2 ID=ORF254_phmmert_Helitron_like_N_PF14214.5
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Links to the 3 input files are shown below if you wish to replicate the warnings. Thanks a ton!