Hi,
I'm new to sequenza and try to evaluate the tool.
Unfortunately I have troubles to use sequenza.extract on a windowed seqz file. The error is:
test2 <- sequenza.extract("xxxxxx_small.seqz.gz")
Processing chr1: Error in scan(file = file, what = what, sep = sep, quote = quote, dec = dec, :
scan() expected 'an integer', got '15.3921568627'
the first lines in xxxxxx_small.seqz.gz are:
chromosome position base.ref depth.normal depth.tumor depth.ratio Af Bf zygosity.normal GC.percent good.reads AB.normal AB.tumor tumor strand
chr1 12928 N 15.3921568627 17.0 1.084 1.0 0 hom 63 51 N . 0
chr1 12947 C 13 14 1.077 0.929 0 hom 63 14 C A0.071 A1.0
xxxxxx_small.seqz.gz was created using
sequenza-utils bam2seqz [...] --chromosome chr1 > xxxxxx
sequenza-utils seqz_binning -w 50 -s xxxxxx | pigz -p 12 > xxxxxx_small.seqz.gz
Is there anything I'm doing wrong?
I'm using sequenza-utils downloaded from git on May 5th 2017.
I should note that I had to apply the following change to wig.py in order to be able to run bam2seqz:
--- wig.py 2017-04-27 10:48:48.000000000 +0200
+++ /usr/local/bioinf/python/python-2.7.13/lib/python2.7/site-packages/sequenza_utils-2.2.0-py2.7-linux-x86_64.egg/sequenza/wig.py 2017-05-08 10:12:12.000000000 +0200
@@ -27,7 +27,7 @@
self._end = self._start + self._span
return ((self._chromosome, self._start, self._end), (self._val,))
except ValueError:
- line, chrom, span = wig_line.strip().split()
+ line, chrom, span = wig_line.strip().split(None, 2)
self._chromosome = chrom.split('chrom=')[1]
self._span = int(span.split('span=')[1])
pos, self._val = next(self._wig).strip().split('\t', 1)
It seems that the wig header line created from my bams has more than 3 fields.
Thanks
Dietmar