Hi everyone,
Hope you’re all doing well. I’m currently exploring solutions for processing Arabic financial statements including OCR, parsing, structuring, and data extraction and I was wondering if anyone here knows of any open source repositories, research projects, or internal tools related to this.
If you’ve come across anything useful (libraries, models, datasets, or full pipelines), I’d really appreciate any pointers or recommendations.
Best,
Bushra
--
You received this message because you are subscribed to the Google Groups "SIGARAB: Special Interest Group on Arabic Natural Language Processing" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sigarab+u...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/sigarab/CAH_AL8%2BhUR2O2aZwk4-s0PMXTCFHVM%3DZrvqKad3rEgB2TZAFgQ%40mail.gmail.com.
Assalamu alaikum,
If you want to self-host, you can look at: https://github.com/datalab-to/surya
Because financial statements are predictable and structured already, the OCR should, in theory, be relatively easier to achieve.
All the best.
--
You received this message because you are subscribed to the Google Groups "SIGARAB: Special Interest Group on Arabic Natural Language Processing" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sigarab+u...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/sigarab/CAH_AL8%2BhUR2O2aZwk4-s0PMXTCFHVM%3DZrvqKad3rEgB2TZAFgQ%40mail.gmail.com.
-- Find me at: https://www.kentoseth.com https://fosstodon.org/web/@kentoseth