Hi Taj,
Most files are intermediary files produced by Zorro pipeline. If Zorro finishes without errors or warnings, you can safely ignore/delete them. These files are useful to debug a failed Zorro run. In the next version, I will add a parameter to clean those files after Zorro finishes successfully.
Zorro outputs the final assembly into <prefix>.ZORRO.fasta. To assess the merge, look at this file. If the merge was good, you will find that:
->HybridContigs are contigs that were formed using contigs from both input assemblies and, even if the original input contigs contain repetitive regions, there was enough overlap of unique (non repeat) regions that allowed to merge them unambiguously. HybridContigs are the most confident contigs. If both assemblies are of good quality and they cover, individually, most of the genome and the genome is not ultra-repetitive, HybridContigs will be large and contain most of the assembly bases.
->RepHybridContigs are contigs that were formed using contigs from both input assemblies, but the overlaps between input contigs are in repetitive regions. These contigs are suspicious and could be misassembled.
->Singlets are contigs that appear in only one of the input assemblies. If assembly1 and assembly2 were produced using different sequencing technologies, each will potentially have different coverage bias. Thus, each assembly, could cover parts of the genome that the other assembly missed. They could also be generated by errors in the input assemblies.
The ids of the sequences in <prefix>.ZORRO.fasta can be used to discriminate HybridContigs, RepHybridContigs and Singlets. I usually remove contigs shorter than 300 because, empirically, I have found that many of them are misassembled and will cause problems to scaffolding tools. However, its up to you to decide if you will discard short contigs and the contig length cutoff.
After running Zorro, and before scaffolding, I suggest to run REAPR or PILON in the zorro contigs. These tools will align input reads against the assemblies and look for misassemblies and suspect regions. These regions could be broken, corrected or masked.
After that, you can run your scaffolding tool.
After scaffolding, I suggest to use a gap closure tool (if your scaffolding tool hasn't done it yet). After that, check and correct your assembly with either REAPR or PILON again.
Hope that this message is useful to you and other Zorro users until it is better documented.
Best regards,
Gustavo Gilson Lacerda Costa
Bioinformatician at State University of Campinas, Brazil