[Thread-8] ERROR net.maizegenetics.analysis.gbs.FastqToTagCountPlugin - The filename does not contain either 3, 4, or 5 underscore-delimited values.
[Thread-8] ERROR net.maizegenetics.analysis.gbs.FastqToTagCountPlugin - Expect: flowcell_lane_fastq.txt.gz OR flowcell_s_lane_fastq.txt.gz OR code_flowcell_s_lane_fastq.txt.gz
Thread-10] INFO net.maizegenetics.analysis.gbs.FastqToTagCountPlugin - Total barcodes found in lane:96
[Thread-10] ERROR net.maizegenetics.analysis.gbs.FastqToTagCountPlugin - Your system doesn't have enough memory to store the number of sequencesyou specified. Try using a smaller value for the minimum number of reads.
“Or does it just need to match the values in the flowcell column of the key file?”
Yes. You can rename the fastq file to conform. The “lane” part should be a number (the corresponding lane number on the flowcell = 7 for your case).
The “flowcell” part doesn’t HAVE to match anything inside the file. But it wouldn’t hurt if it matched the Illumina ID for the flowcell, which you should be able to see like this:
$ gunzip -c PotatoBeetlePlate1_NoIndex_L007_R1_001.fastq.gz | head -12
The Tassel-GBS pipeline is not set up for paired-end data (which it looks like you have). Try putting the R1 file (PotatoBeetlePlate1_NoIndex_L007_R1_001.fastq.gz) in a folder by itself (and then rename it to <FLOWCELL>_7_fastq.gz.
Best,
Jeff
--
Jeff Glaubitz
Project Manager
Biology of Rare Alleles in Maize and its Wild Relatives
National Science Foundation award IOS-1238014
http://www.panzea.org
Institute for Genomic Diversity
Cornell University
175 Biotechnology Bldg
Ithaca, NY 14853
Phone: 607-255-1386
jcg...@cornell.edu
From: tas...@googlegroups.com [mailto:tas...@googlegroups.com]
On Behalf Of Michael Crossley
Sent: Wednesday, February 18, 2015 12:24 PM
To: tas...@googlegroups.com
Cc: mcros...@gmail.com; Lynn Carol Johnson
Subject: Re: [TASSEL-Group] issue getting started with fastq input
I tried renaming the keyfile and values in the flowcell column of the key file and still get no response from TASSEL.
Is it important for the "flowcell_lane_" in the file name match something specific in the sequence fastq files - or does it just need to match the values in the flowcell column of the key file?
I am really wondering if there is something wrong with my fastq files that is preventing TASSEL from recognizing them in my input directory.Is it normal for them not to appear when selecting the input directory?
On Tuesday, February 17, 2015 at 2:06:36 PM UTC-6, Lynn Carol Johnson wrote:
The problem is the name of your file, and the name of your flow cells in the key file. The flow cell name should NOT have underscores. TASSEL code uses the underscores to identify the flow cell and the lane, then maps to the key file. Can you remove the underscores your flow cell lane name in both your key file and in the file name ? Tassel expects one of the following formats: (you don’t need the “txt”)
flowcell_lane_fastq.txt.gz OR flowcell_s_lane_fastq.txt.gz OR code_flowcell_s_lane_fastq.txt.gz
Thanks - Lynn
From:
"mcros...@gmail.com" <mcros...@gmail.com>
Reply-To: "tas...@googlegroups.com" <tas...@googlegroups.com>
Date: Tuesday, February 17, 2015 at 1:49 PM
To: "tas...@googlegroups.com" <tas...@googlegroups.com>
Cc: "mcros...@gmail.com" <mcros...@gmail.com>, Lynn Carol Johnson <lc...@cornell.edu>
Subject: Re: [TASSEL-Group] issue getting started with fastq input
Here is a snapshot of my keyfile - the only thing I am in doubt of is my "Flowcell" designation - does this need to match a particular aspect of the fastq file?
Thanks for your time & attention,
On Tuesday, February 17, 2015 at 10:18:20 AM UTC-6, Lynn Carol Johnson wrote:
Michael -
You don’t need the .txt extension. I am able to run with a file of this name: D0D7RACXX_1_fastq.gz
What values are you using for Max Good Reads/Min Tag Count? The defaults are these:
I find if I use the default of 300000000 with Min Tag Count of 1, I run out of memory. When I lower the default for Max Good Reads , or increase the MinTag Count value, the operation completes successfully.
Can you show me what
$ ls -Fl ‘GBS/fastq/CPB R1 fastq/’
outputs?
Please paste the output, instead of a screen shot.
Best,
Jeff
You received this message because you are subscribed to the Google Groups "TASSEL - Trait Analysis by Association, Evolution and Linkage" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
tassel+un...@googlegroups.com.
To post to this group, send email to tas...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tassel/dd7fe6c9-8331-40c6-bc31-b3b71bb981eb%40googlegroups.com.
...
I am thinking that the “Hide extensions of known file types” default in Windows is preventing you from seeing the actual name of the file. Perhaps it is “C5TA4ACXX_7_fastq.gz.gz” or something.
Either turn off that option in Windows, or try listing the directory in a DOS window (Google the analogous DOS commands for cd and ls).
Jeff
From: tas...@googlegroups.com [mailto:tas...@googlegroups.com]
On Behalf Of Michael Crossley
Sent: Wednesday, February 18, 2015 3:49 PM
To: tas...@googlegroups.com
Subject: Re: [TASSEL-Group] issue getting started with fastq input
I am working from the Windows GUI, so I don't think I can do that. (The screenshot of the file was taken from an ubuntu terminal, where I am storing a copies of my files for other analyses, but I'm not running TASSEL from there).
Or did you mean for me to run this from the Windows command prompt?
On Wednesday, February 18, 2015 at 2:26:50 PM UTC-6, Jeff Glaubitz wrote:
Can you show me what
$ ls -Fl ‘GBS/fastq/CPB R1 fastq/’
outputs?
Please paste the output, instead of a screen shot.
Best,
Jeff
From:
tas...@googlegroups.com [mailto:tas...@googlegroups.com]
On Behalf Of Michael Crossley
Sent: Wednesday, February 18, 2015 1:32 PM
To: tas...@googlegroups.com
Subject: Re: [TASSEL-Group] issue getting started with fastq input
When I tried fastq to tag count with this file, I got no response.
But when I added .gz to the file name (so that it read C5TA4ACXX_7_fastq.gz) I got the following error message.
But my file should end with _fastq.gz
...