I tried to simply concatenate the two fasta files for human and mouse that I got from here:
Mouse
ftp://ftp.ensembl.org/pub/release-90/fasta/mus_musculus/cdna/Mus_musculus.GRCm38.cdna.all.fa.gzHuman
ftp://ftp.ensembl.org/pub/release-90/fasta/homo_sapiens/cdna/Homo_sapiens.GRCh38.cdna.all.fa.gzI extracted both to get the .fa files and then ran:
cat mmusculus.fa hsapiens.fa > combined.fa
Then I ran salmon index as follows:
salmon index -t combined.fa -i combined_index
When I then try to run salmon quant as follows:
#!/bin/bash
for f1 in *_1_sequence.txt.gz
do
f2=${f1%%_1_sequence.txt.gz}"_2_sequence.txt.gz"
salmon quant -i ../cdna/combined_index -l A --gcBias \
-1 $f1 \
-2 $f2 \
-p 36 -o quants_combined/${f1}_quant
done
I get an error telling me that versionInfo.json is missing:
[Error: The index version file ../cdna/combined_index/versionInfo.json doesn't seem to exist. Please try re-building the salmon index.]
And indeed, the file versionInfo.json is missing in the combined_index folder while it is present in the indexes created for the mouse and human transcriptomes separately.
How can I create a combined index for both human and mouse that salmon quant will accept/will be compatible?