We understand that that the model was built based on the TOP 200 apps selling on app store. We did a distribution fitting using @Risk according to the formula you derived. The model fits a LogNormal distribution perfectly based on the Anderson-Darling statistic, which is what it is supposed to be, as your model is based on the Pareto distribution. However, this only holds for up to 200 data sets, and when we extend to more than 3000, the model would not fit. As you might know, there are more than 130,000 game apps on the apple store, we wonder to what extent do you think your model can predict the number of downloads according to the ranking?
We endeavoured hard to exploit the publicly available data, but as a matter of fact, they are very limited. By the time we are fitting our distribution for the Android apps, we cannot move forward. As we discovered that you have estimated the Pareto parameter on Android system but no scaling parameter is estimated. We wonder if any information can be provided to us for further investigation (e.g, scaling parameter, or the number of total downloads for each app)?
(4/10/13)