Hi Seth, thank you for writing ...
Firstly, I'm assuming all your NGS reads comes from chloroplast DNA, right?
If true, I think you can use the GBS-SNP-CROP exactly as suggested on User Manual. I mean, if your raw data comes from chloroplast genomes, when you build the mock reference and then map the high-quality and demultiplexed reads, you are able to call SNPs specific to chloroplast. So, I guess there is no reason to use blastn and any other analysis...
However, if your raw reads are not from chloroplast specific DNA, I think your strategy can works, but instead to blast the mock reference to a related species's complete chloroplast genome, seems to me more interesting you map the high-quality demultiplexed reads (outputs from step 3) and then pull out only reads that mapped. With those reads that matched you can build your mock reference and move forward normally... using your genome version of mock reference, for example.
To answer your question "why use the mock reference genome and not the centroids file", we have saw some SNPs were wrong called due to the mis-alignment of the end of a GBS read across the boundary between two adjacent clusters in the Mock Reference. To minimize such errors, adjacent centroids are now separated from one another by a string of 20 high-quality A’s, an adenine-based boundary found to enhance alignment accuracy. If N clusters are used to build the mock reference, the mock reference will contain 20*(N-1) adenine boundary bases. The coordinates of these bases (found in PosToMask.txt) should be used to mask the final list of putative SNPs in order to avoid calling false SNPs within these artificial boundaries. However, the number of false SNPs are dependent of data, but it isn't so much. You can use the centroid file instead the genome mock reference, but unfortunately you are not able to find those false SNPs.
In order to increase your final number of SNPs called, I encourage you work in a more relaxed way in a SAMTools flags and also in a genotyping criteria on step 7. In addition, you can try to identify your reads that match to chloroplast genome before to build the mock reference.
Please, let me know about your progress.
Best,
Arthur