Hello Orca users,
I am currently experimenting with orca for my studies. I would like to train it with the data I have, so I follow the steps writing under train README. It looks like the commands you discuss here are deprecated.
I'm using cooltools version 0.5.4, and in this version `compute-expected` is changed to `expected-cis` and `-trans` commands. Now it doesn't have the `--drop-diags`, but instead `--ignore-diags` option. I tried the new commands with and without `--ignore-diags 0` option and got the exact same file without any `diags` column. This causes `KeyError: 'diag'` error in `process_expectation.py` step as `expected.groupby(["diag"]).agg({"n_valid": "sum", "balanced.sum": "sum"})` line doesn't apply anymore. Is there an updated version of the python script? I'd appreciate your help.
P.S.: What pd.read_csv returns is as follows
1 region1 region2 dist n_valid count.sum balanced.sum count.avg balanced.avg
0 chr1 chr1 0 54285 NaN NaN NaN NaN
1 chr1 chr1 1 53806 NaN NaN NaN NaN
2 chr1 chr1 2 53739 9314132.0 1762.522873 173.321647 0.032798
3 chr1 chr1 3 53699 5232769.0 988.150333 97.446303 0.018402
4 chr1 chr1 4 53673 3614801.0 679.551476 67.348592 0.012661
Best,
Mesut