--
You received this message because you are subscribed to the Google Groups "Beancount" group.
To unsubscribe from this group and stop receiving emails from it, send an email to beancount+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/beancount/58ec4e60-530f-4055-990b-03e374251a29n%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/beancount/CACDkcs-w4K8p3a2v_k2sCbhJOvGd4%2Bwm%2B3H8em5fJnS8xO7%2B1w%40mail.gmail.com.
Very interesting, thank you all. I wonder if a single user's journal would suffice for a learning dataset in this case. For me, expenses across categories of interest are those have been stable for years. Plus, I’m willing to deal with false positives (but preferably not false negatives).
There is a kind of machine learning problem called outlier detection. I think sciki-learn library is a good starting point
Excellent, thank you for the helpful pointers! A quick search brought up these, which I’ve noted down to look into when I have time(TM):
https://scikit-learn.org/stable/modules/outlier_detection.html
https://scikit-learn.org/stable/auto_examples/neighbors/plot_lof_outlier_detection.html
--
You received this message because you are subscribed to the Google Groups "Beancount" group.
To unsubscribe from this group and stop receiving emails from it, send an email to beancount+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/beancount/d1cffa4f-4c32-484c-b4f8-c6d4cbf748cen%40googlegroups.com.