So the problem, is I am switch from SQL and relational understanding,
and I did forget to change my Primary Key back down to what I really need
which is now
Primary Key ((mutation name), anchor_tm, ID
which would be limiting the number of keys
and if I am understanding this properly
Currently, I have the tables of the gene
then am partitioning that by the mutation_name or cds known by people in the field which will be about on average 142
which that number varies
or
would it be better to separate into the table gene then partition on that gene name? as this will limit the number of partitions and move the mutation name as a clustering key?