Hi all,
This is the first time I am using tassel. I am trying to run the GBSSeqToTagDBPlugin but I keep getting the following error msg:
disk I/O error
java.sql.SQLException: disk I/O error
at org.sqlite.core.NativeDB.throwex(NativeDB.java:397)
at org.sqlite.core.NativeDB._exec(Native Method)
at org.sqlite.jdbc3.JDBC3Statement.executeUpdate(JDBC3Statement.java:116)
at net.maizegenetics.dna.tag.TagDataSQLite.<init>(TagDataSQLite.java:98)
at net.maizegenetics.analysis.gbs.v2.GBSSeqToTagDBPlugin.processData(GBSSeqToTagDBPlugin.java:232)
at net.maizegenetics.plugindef.AbstractPlugin.performFunction(AbstractPlugin.java:110)
at net.maizegenetics.plugindef.AbstractPlugin.dataSetReturned(AbstractPlugin.java:1631)
at net.maizegenetics.plugindef.ThreadedPluginListener.run(ThreadedPluginListener.java:29)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)
Start processing batch 1
Enzyme: PstI
Enzyme: PstI
java.lang.ArrayIndexOutOfBoundsException: 0
at net.maizegenetics.analysis.gbs.v2.GBSUtils.initializeBarcodeTrie(GBSUtils.java:159)
at net.maizegenetics.analysis.gbs.v2.GBSSeqToTagDBPlugin.processFastQFile(GBSSeqToTagDBPlugin.java:303)
at net.maizegenetics.analysis.gbs.v2.GBSSeqToTagDBPlugin.lambda$processData$86(GBSSeqToTagDBPlugin.java:243)
at net.maizegenetics.analysis.gbs.v2.GBSSeqToTagDBPlugin$$Lambda$19/231886004.accept(Unknown Source)
at java.util.stream.ForEachOps$ForEachOp$OfRef.accept(ForEachOps.java:184)
at java.util.ArrayList$ArrayListSpliterator.forEachRemaining(ArrayList.java:1374)
at java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:512)
at java.util.stream.ForEachOps$ForEachTask.compute(ForEachOps.java:291)
at java.util.concurrent.CountedCompleter.exec(CountedCompleter.java:731)
at java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289)
at java.util.concurrent.ForkJoinTask.doInvoke(ForkJoinTask.java:401)
at java.util.concurrent.ForkJoinTask.invoke(ForkJoinTask.java:734)
at java.util.stream.ForEachOps$ForEachOp.evaluateParallel(ForEachOps.java:160)
at java.util.stream.ForEachOps$ForEachOp$OfRef.evaluateParallel(ForEachOps.java:174)
at java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:233)
at java.util.stream.ReferencePipeline.forEach(ReferencePipeline.java:418)
at java.util.stream.ReferencePipeline$Head.forEach(ReferencePipeline.java:583)
at net.maizegenetics.analysis.gbs.v2.GBSSeqToTagDBPlugin.processData(GBSSeqToTagDBPlugin.java:241)
at net.maizegenetics.plugindef.AbstractPlugin.performFunction(AbstractPlugin.java:110)
at net.maizegenetics.plugindef.AbstractPlugin.dataSetReturned(AbstractPlugin.java:1631)
at net.maizegenetics.plugindef.ThreadedPluginListener.run(ThreadedPluginListener.java:29)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)
[pool-1-thread-1] INFO net.maizegenetics.plugindef.AbstractPlugin - Finished net.maizegenetics.analysis.gbs.v2.GBSSeqToTagDBPlugin: time: May 23, 2016 11:46:59
[pool-1-thread-1] INFO net.maizegenetics.pipeline.TasselPipeline - net.maizegenetics.analysis.gbs.v2.GBSSeqToTagDBPlugin: time: May 23, 2016 11:46:59: progress: 100%
Does anybody know what all that mean?
I appreciated your help.
Andrea
--
Andrea C. Varella Ph.D.
Montana State University
Plant Sciences and Plant Pathology Department
Linfield Hall - Bozeman MT
Hi Lynn,
I have tassel-5-standalone and tassel 5.0.0 on the university server. I tried to run the GBSSeqToTagDBPlugin with both of them, but I keep getting the same error message. I know I am creating the db file on a disk that I have write permission, so that should not be the problem.
I guess this is is a memory problem then. I will talk to the IT people from the university and see if we can make it work.
Thanks for your help!
Andrea
Hello,I recently ran Tassel Version 5.2.33 (January 12, 2017) on a run composed of 8 Illumina HiSeq 2000 lanes, with a total of 1,412,833,762 reads.1) Allocating Tassel 20 GB of ram (via -Xmx) resulted with the GBSSeqToTagDBPlugin to fail with:[pool-1-thread-1] ERROR net.maizegenetics.plugindef.ThreadedPluginListener - Out of Memory: GBSSeqToTagDBPlugin could not complete task:2) Re-running with 30 GB of ram (via -Xmx) resulted with the GBSSeqToTagDBPlugin to fail with:
java.sql.SQLException: [SQLITE_IOERR] Some kind of disk I/O error occurred (disk I/O error)3) Re-running with 42 GB of ram (via -Xmx) resulted with the GBSSeqToTagDBPlugin to fail with numerous:
tagInsertPS.executeBatch() 100001Followed by:java.sql.SQLException: disk I/O error4) I finally re-ran with -Xmx allocating 200 GB of RAM, and watching `top` showed that the GBSSeqToTagDBPlugin step used approximately 106 GB, which then the plugin completed successfully. (The machine this ran on had 32 cores and 256 GB of RAM).Questions:1) Is there a protocol to determine before I start a run ("back of the envelope calculation") how much RAM I'll need to allocate Tassel (via -Xmx) to complete the GBSSeqToTagDBPlugin step successfully? e.g., based upon the number of reads in the FASTQ files I'm providing?
You can find this info at Tassel 3 manua, maybe you can use it as a reference: