Hi Yi,
Thank you for reaching out.
For your first question, the total sequenced sample count is 250,018 samples; however, not all samples harbor a mutation. The data_mutations_extended.txt file (MAF) only includes samples that have at least one called mutation, so the number of unique samples in that file will always be lower.
For your second question, the sample count difference could be related to the reason above. The variant difference, by default, cBioPortal filters out Silent, Intron, IGR, 3'UTR, 5'UTR, 3'Flank, and 5'Flank variants, except for TERT promoter mutations. So you might see more alterations in the MAF file than what is displayed on the portal.
Also, when cBioPortal calculates mutation frequency, it uses the subset of samples that were actually profiled for that gene, not the total number of NSCLC samples. That is:
# of samples with a mutation in this gene that are also profiled for mutations: 15,738/# of samples profiled for mutations in this gene: 35,560
Let me know if you have any other questions.
Thanks,
Ritika