Hi,
I am working with chunked df and it's time consuming to write code for querying chunked df
for example if I would like to perform:
then I have to write:
values = set()
for df_chunk in df_chunked:
values.union(df.unique())
len(values)
maybe it would be a good idea to create mechanism to perform these functions out of the box?
I have an idea to create object inside of dataframe that would hold logic with looping through all chunks like:
would perform code with loop from code example 2.
I am not sure if it's doable in this way and if it would have value?
Regards,
Łukasz