Arabic financial statements document processing

8 views
Skip to first unread message

Bushra Asseri

unread,
Nov 19, 2025, 12:14:10 AMNov 19
to sig...@googlegroups.com

Hi everyone,


Hope you’re all doing well. I’m currently exploring solutions for processing Arabic financial statements including OCR, parsing, structuring, and data extraction and I was wondering if anyone here knows of any open source repositories, research projects, or internal tools related to this.

If you’ve come across anything useful (libraries, models, datasets, or full pipelines), I’d really appreciate any pointers or recommendations.


Best,

Bushra 

Samhaa El-Beltagy

unread,
Nov 19, 2025, 5:02:08 AMNov 19
to Bushra Asseri, sig...@googlegroups.com
Hi Bushra, 
We've looked into a lot of tools for this, and the one that worked for us,was google's gemini flash2.5 model. We use api calls to upload the statements, and use prompting to extract what we want and have it returned in json. There is a good free quota per day to cover your needs if you are not going to upload thousands of statements. 

Best of luck, 



--
You received this message because you are subscribed to the Google Groups "SIGARAB: Special Interest Group on Arabic Natural Language Processing" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sigarab+u...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/sigarab/CAH_AL8%2BhUR2O2aZwk4-s0PMXTCFHVM%3DZrvqKad3rEgB2TZAFgQ%40mail.gmail.com.

Mohamed H.

unread,
Nov 19, 2025, 5:57:01 AMNov 19
to sig...@googlegroups.com

Assalamu alaikum,

If you want to self-host, you can look at: https://github.com/datalab-to/surya

Because financial statements are predictable and structured already, the OCR should, in theory, be relatively easier to achieve.

All the best.

--
You received this message because you are subscribed to the Google Groups "SIGARAB: Special Interest Group on Arabic Natural Language Processing" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sigarab+u...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/sigarab/CAH_AL8%2BhUR2O2aZwk4-s0PMXTCFHVM%3DZrvqKad3rEgB2TZAFgQ%40mail.gmail.com.
Reply all
Reply to author
Forward
0 new messages