On Wed, Oct 19, 2016 at 8:10 PM, Robert Garrison II <
garrison...@gmail.com> wrote:
> I understand that this may be an elementary question, but bear with me.
>
> I have a 16M record dataframe, and I want to perform regressions on 60
> subsets of the dataframe defined by Variable_A. I created a list of the
> subsets (e.g. subsets=list(set(df['Variable_A'])) ), that I want to iterate
> through.
>
> I set up the following regression loop:
>
> for i in subsets:
> i=smf.ols('response ~ C(variable_1) + C(variable_2)
> +...+C(variable_N)',data=df.loc[(df[Variable_A]==str(i)])]).fit()