It is sometimes confusing with these terminology:
Base is the summary statistics, and it should only be used for the effect size estimates and should be independent from the target data
Target data is where we usually perform PRS on. Generally, people will estimate the best p-value threshold from it and just report the R2
However, it will be best to use a separate independent data set call the validation data set to estimate the R2 to avoid overfitting and stuff.
As a result, I am not completely certain what you mean by getting the highest R2 in the base file (as you will still need the genotype to calculate the R2 and stuff)