Hello there.
I have a table with around 1000 columns of type Nullable(Float64). Most of them are NULL for most of the rows. Whether a column contains only NULLs or is filled with values depends on the column sensor_type in the same table. I'm partitioning on YYYYMM and on this column, so that most of the columns within the same part contain only NULL values. Still, these files are (alltogether) very large. I've experimented with different codecs a little, unfortunately they don't bring much of saving. If I would remove the "all-NULL" files from the parts, I could reduce the size of this table on disk from 500 Gb to 6 Gb.
Is there a way to prevent Clickhouse from writing "all-NULL" or "all-Zero" columns into the part, as a way to reduce storage costs and replication latency? If not, would you consider it as a future improvement?
Best,
Maxim