I found your info on CMU's site and would like to ask you a question on your recently published paper.
I have been referencing your research "Inferring App Demand from Publicly Available Data" for a project I have been working on. I have been testing the model successfully (for Drp) on a small sample size of paid apps. I have tried to use the model for free apps but the outcome (for Drf) seems disproportionally large. For Free, I have used the formulas you have in the Free section of your paper, a theta set to below 1, Bg based on total free daily downloads and Drf=Bf*Rf^-Af. I must be doing something wrong. Would you have any advice?
Assuming you are estimating the values of beta0 and beta1 in a regression on your end and that you are considering only the free apps that have a presence in the grossing app list. Also assuming that you are taking the log of ranks in base-10 and thus are using the base-10 in the Bf formula.
Effect of theta is relevant but the value of theta could vary over time and I am sure it has changed during the last few years (with more emphasis on in-app purchase option). Theta provides an estimate of the revenue stream from in-app purchase and you may need to re-estimate theta from paid app list to correctly estimate the Bf using the formula Bf=Bg*exp(Beta0*Ag)/Theta.
I have 3 questions/comments after reading your email:
1. I have actually been using, as you used in the Bf formula, exp rather than base-10. I wrote it initially with base-10 to go with the formula in your paper.
2. Using equation 9, I do not understand how theta comes into play. It is not in equation 9, or in the calculation of Af, Rf or Df. Do I need to add it somehow to equation 9?
3. Just to double confirm, I am correct in assuming that for formula 9 to estimate Bf I am using for Df the total number of downloads for Free Apps that are ranked?
2. theta is not part of equation 9. Estimation of theta is on page 19 where the model becomes non-linear. You need to rewrite eq 3&4 with theta and re-estimate the model parameters.
As for 2, I believe that I understand: I would need to re-estimate the Beta0 and Beta1 (Beta2 being irrelevant since the price is 0). If so, I will look further into this and email you if I have follow up questions.