Heya,
I'm using the built-in CSV importer (beancount.ingest.importers.csv)
with bean-extract and, in spite of being documented as bare bone, it
works perfectly fine for my need :)
The only issue I'm facing is that I want to customize the behavior of
beancount.ingest.similar.SimilarityComparator and I didn't find a way to
do so.
(In short, I've a special metadata key, bank-label, which I import from
my CSV files and which I trust as quasi-unique ID for deduplicating
transactions. That key + transaction date would be my ideal
deduplication criteria. SimilarityComparator() is both more strict,
e.g., it requires dates to be relatively near in time, without a way to
pass a different time window; and more lax, e.g., allow amounts to vary
a bit; than what I want.)
Ideally, I'd like to write my own SimilarityComparator and pass it down
to bean-extract via the importer configuration, but the configuration
API doesn't allow to do so ATM. Would such a generalization be welcome
to you, Martin? (as bug report and/or patch)
Cheers
--
Stefano Zacchiroli .
za...@upsilon.cc .
upsilon.cc/zack . . o . . . o . o
Computer Science Professor . CTO Software Heritage . . . . . o . . . o o
Former Debian Project Leader & OSI Board Director . . . o o o . . . o .
« the first rule of tautology club is the first rule of tautology club »