Dear Akanchha,
It's a very valid and common concern, particularly in fields where historical practices may not have always strictly adhered to statistical methodologies.
To determine a statistically sound sample size, you'll need to consider several factors relevant to your research design:
1. Population Size: While often large, knowing the approximate size of the population you're drawing from can be helpful, especially if it's a finite population.
2. Margin of Error (Confidence Interval): This is the acceptable range within which you expect your true population parameter to fall. A smaller margin of error requires a larger sample size.
3. Confidence Level: This indicates how confident you want to be that your sample results accurately reflect the population. Common confidence levels are 90%, 95%, or 99%. A higher confidence level demands a larger sample size.
4. Expected Proportion (or Standard Deviation for continuous data): If your research involves proportions (e.g., success rate of your new method), you'll need an estimate of this proportion. If you don't have a prior estimate, using 0.5 (50%) is often a conservative choice as it maximizes the required sample size. For continuous data, you would need an estimate of the population standard deviation.
5. Type of Data and Analysis: The nature of your data (categorical or continuous) and the statistical tests you plan to use will influence the appropriate formula.
Given your focus on developing a new method for unclear fingerprints, you might be looking at a study involving proportions (e.g., success rate of the new method in clarifying fingerprints).
A common formula for calculating sample size for proportions is:
n = (Z^2 * p * (1-p)) / E^2
Where:
n = required sample size
Z = Z-score (from the standard normal distribution, corresponding to your desired confidence level)
p = estimated proportion of the population
E = desired margin of error
For example, for a 95% confidence level, the Z-score is approximately 1.96. If you expect a 50% success rate (p=0.5) and want a margin of error of 5% (E=0.05), the calculation would be:
n = (1.96^2 * 0.5 * (1-0.5)) / 0.05^2
n = (3.8416 * 0.25) / 0.0025
n = 0.9604 / 0.0025
n = 384.16
So, you would need a sample size of approximately 385.
If your research involves comparing means between groups (e.g., your new method vs. an existing one), different formulas would apply, often incorporating the standard deviation and effect size.
I would strongly recommend consulting with a statistician who can help you define these parameters accurately based on your specific research questions and the nature of fingerprint data. They can guide you in choosing the most appropriate formula and help you justify your sample size to your research committee.
I hope this provides a helpful starting point for addressing the expert's questions.
Best wishes
Neeraj