Hello,
I'm currently working with PDFium's text extraction features in pypdfium2. While it goes really well on latin letters, I have problems with hindi text.
When trying to extract text from the attached PDF, I get something like "मၝघोषणाᆸपुჹऔर सहमत ီ ჸकᇆ", although it should be "मैं घोषणा, पुष्टि और सहमत हूँ कि:" instead.
Now I'm wondering if this is a known limitation of PDFium itself, or if I'm doing something wrong when decoding the data provided by PDFium. This is my current code:
```python3
c_array = (ctypes.c_ushort * (n_chars+1))()
pdfium.FPDFText_GetBoundedText(*args, ctypes.cast(c_array, ctypes.POINTER(ctypes.c_ushort)), n_chars)
text = bytes(c_array).decode("utf-16-le")[:-1]
```
Thanks!