Hi Matt,
Interestingly, this error doesn't actually indicate a problem with the quality encoding. SolexaQA correctly recognizes the file as a Sanger variant and applies the correct quality scores.
(There is some debate about what comprises a 'new' FASTQ encoding. Some people prefer to describe every little change as a new variant, while others tend to recognize just a few major classes of very closely related variants. From a coding perspective, the Sanger and Illumina 1.8+ formats can be parsed with exactly the same algorithm, so I guess this puts me with the lumpers. Why Illumina even insists on fiddling in minor ways with the standard FASTQ formats, I'll never know...)
In any case, the error you are seeing is something different. It reflects another annoying habit - fiddling with the read header lines.
Unlike many other QC programs, SolexaQA explicitly determines quality per tile. The price of this is quite high - the program has to determine the tile number for every read. Unfortunately, Illumina and others continually change the header format (although the company deserves considerable credit for mostly sticking with standard formats over the last couple of years):
There are now so many header lines, all very different, that it is logically impossible to parse the tile number from all of them. The philosophy we have taken is for SolexaQA to support all of the major variants (especially those coming off the latest generation of Illumina machines).
In your example, the Sequence Read Archive (SRA) has modified the read headers in their files:
@SRR797058.1 HWI-ST600:227:C0WR4ACXX:7:1101:16297:2000/1
This is where your error comes from - SolexaQA can no longer determine the tile number. (This particular error is actually more often caused by an extra empty line at the end of the file, hence the error message).
The good news is that the solution is simple. Strip out the extra characters added by the SRA (using sed, awk or your preferred alternative):
@HWI-ST600:227:C0WR4ACXX:7:1101:16297:2000/1
This reverts the file to a standard Illumina header format and SolexaQA runs the file just fine.
Best
-Murray