Hello,
I am new to transcriptomics and am trying to use transabyss for the first time, but running into error with input files (error listed below).
Here is the command I use (with four input files):
transabyss --pe Cyto_92_1M_S7_R1_001.fastq.gz Cyto_92_1M_S7_R2_001.fastq.gz Cyto_92_1P_S8_R1_001.fastq.gz Cyto_92_1P_S8_R2_001.fastq.gz
I get the following error:
Reading `/Users/.../RNAseq/Cyto_92_1M_S7_R1_001.fastq.gz'...
/Users/.../RNAseq/Cyto_92_1M_S7_R1_001.fastq.gz:1: error: Expected either `>' or `@' or 11 fields
and saw `' and 1 fields near
?BC?>ŝ?r?J?l??S??s?#?`l
n??͉}刽??I??Ȓ,~$ٽz}?ڍ?T*\?̙?J??O??z???\=?V??????????}}z}\-_?O???r????????x???:||?סi??ph???9?k??}ծd??ڗ?W??????????V?4???ᚪ??.??G?v`}?q?????j??vS????_????};?5?6WN?+?2???zE_??O·~l??>>??'2b?/???jw8`???5y???%?}=??z?6?z???????\W??n_???b_????Ǐ??z?ǧͶe?j1$??h??H???[y??$\?`'v?hwn??
make: *** [transabyss-1.fa] Error 1
ERROR: CMD ended with status code 2
It seems to me that the program cannot read input files properly. When I open the files the reads look as follows (and have "@" in the beginning):
File 1:
@D00635:134:C95LWANXX:4:2312:1436:1874 1:N:0:CGCTCATT+AGGCGAAG
NGGACATCGGCCAGGAGGCTGGACTGGGGAACACCCTTCCAGAAGTCGGTGGAGGAAGCGGCCTTGGTGGCAACGGTCAGGATGTCGGTGATGGCCTCGATCTTGGGAGCAATGTTGACGAGATCG
+
#=A@BFGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGDGGGGGGGGGGGGGGGGGGGGGGGGGFGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGEGGGGGGGGGGGGGGGGGGGGGGGGG
@D00635:134:C95LWANXX:4:2312:1626:1922 1:N:0:CGCTCATT+AGGCGAAG
NCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCAGA
+
#:3<AEFGGGGGGGGGGFGGGFGGGGGGGGGGGGGGGGGFGGGGGGGGGGGGGGGFGGGGGFGGGBGGGGGEGGGGFEGGGGEGGBGC>BGBF0@>FBE>@BEFC0<<@FE0800@C@F<<../86
@D00635:134:C95LWANXX:4:2312:2489:1830 1:N:0:CGCTCATT+AGGCGAAG
NATCGTTAGTGCCTCGGTCCAAGGGTGCTTGTGGAGGTTCATCAACATATTCTCCTCAAGGGCTGTCTTGCGGTAGTTGATGCCAATGGAGTAGTAGTGTCGGTTCAGGCCATGAATCAGCGCTTG
+
......
File 2:
@D00635:134:C95LWANXX:4:2312:1436:1874 2:N:0:CGCTCATT+AGGCGAAG
CGTCAACATTGCTCCCAAGATCGAGGCCATCACCGACATCCTGACCGTTGCCACCAAGGCCGCTTCCTCCACCGACTTCTGGAAGGGTGTTCCCCAGTCCAGCCTCCTGGCCGATGTCCTAGATCG
+
CCCCCGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGFGGGGGGEGGGGGGGGGGGGGGGGGGGGGGBGGGGGGGGGGG
@D00635:134:C95LWANXX:4:2312:1626:1922 2:N:0:CGCTCATT+AGGCGAAG
CTCTGGCTCTGGCTCTGGCTCTGGCTCTGGCTCTGGCTCTGGCTCTGGCTCTGGCTCTGGCTCTGGCTCTGGCTCTGGCTCTGGCTCTGGCTCTGGCTCTGGCTCTGGCTCTGGCTCTTGCCCTGG
+
CCCCCGGGGGGGGGGGGGFFGGGGGGGGGGD>CFGGGGGGGGG@@FGGGFC@DDC11@GGC@1CFE>1:F:F11=<CD11111D11CB100=0<@00=:0F0008>0<000C0000000000880<
@D00635:134:C95LWANXX:4:2312:2489:1830 2:N:0:CGCTCATT+AGGCGAAG
CCTCAACAAGCCCTCTATTCAAGCGCTGATTCATGGCCTGAACCGACACTACTACTCCATTGGCATCAACTACCGCAAGACAGCCCTTGAGGAGAATATGTTGATGAACCTCCACAAGCACCCTTG
+
......
I went through older messages in the group regarding similar problems with input files. I see in my files there is 1 and 2 indicated in reads names, but not in the format /1, /2 (as was mentioned in older conversations). Can this format be the issue? If so, what should I do about it? Will be grateful for any suggestions.
Thanks,
Olga