one of the file is enclosed. It is subtitle of a TV Series, extracted
from video of the same. Any rights belong to Zee I guess.
I had searched on the net and found that it is about utf-8 text getting
saved as latin-1959 or something. Now whatever encoding you change in
the file, you cannot get back the original hindi character as the ANSI
characters that are displayed are very much valid and no change of
encoding would want to convert that.
There is a site:
https://www.branah.com/unicode-converter
if you put the first jumbled line in the file:
शॠकॠरिया, करॠनल।
into the fourth box "UTF-8 Text", it will show "शुक्रिया, कर्नल।" in the
top box.
similarly, second line
ओह, माफ़ कीजियेगा।
gives "ओह, माफ़ कीजियेगा।"
So, it is obvious that conversion logic and methods are there.
from the page source, the relevant code is:
<input type="button" id="btntext" value="Convert" style="width:100px">
Unicode text (Example: a 中 Я)<br>
<textarea id="text" rows="4" cols="75"></textarea><br><input
type="button" id="separate" value="Add spaces"
style="width:100px"><input type="button" id="combine" value="Remove
spaces" style="margin-left:1em;width:100px"><label
style="margin-left:1em"><input type="checkbox" id="whitespace"> Convert
whitespace characters</label><label style="margin-left:1em"><input
type="checkbox" id="endian"> Little Endian</label></p>
Just that it is not possible to convert many such lines in a 31kb file.
by pasting in that box one by one manually.
I want a method that a software and differentiate between other ANSI
characters and these jumbled character and converts only these
characters to unicode as above, copying entire other ANSI things intact,
giving me entire file that I can add to my movie player and see correct
text.
Thanks.
--
Rawat
On 11-Dec-23 10:13 AM, Ravishankar Shrivastava wrote:
> Probably, the file is corrupt or the encoding is set wrong while
> file-saving, and if it is so, data is hard to recover. Still, if you can
> share the file, further analysis can be done on the file to recover data.
>
> Ravi
>
>
> On Mon, 11 Dec, 2023, 02:03 , <
technic...@googlegroups.com
> <mailto:
technic...@googlegroups.com>> wrote:
>
>
technic...@googlegroups.com
> <
https://groups.google.com/forum/?utm_source=digest&utm_medium=email#!forum/technical-hindi/topics>
> Google Groups
> <
https://groups.google.com/forum/?utm_source=digest&utm_medium=email/#!overview>
> <
https://groups.google.com/forum/?utm_source=digest&utm_medium=email/#!overview>
>
> विषय डाइजेस्ट
> सभी विषय देखें
> <
https://groups.google.com/forum/?utm_source=digest&utm_medium=email#!forum/technical-hindi/topics>
>
>
> * Any offline-online tool to convert such text
> <#m_8134792511179162147_group_thread_0> - 1 अपडेट
>
> Any offline-online tool to convert such text
> <
http://groups.google.com/group/technical-hindi/t/3bcc03f02f7790bb?utm_source=digest&utm_medium=email>
>
> V S Rawat <
vsr...@gmail.com <mailto:
vsr...@gmail.com>>: Dec 11
> वापस ऊपर <#m_8134792511179162147_digest_top>
> आपको यह डाइजेस्ट मिला, क्योंकि आपने इस समूह के अपडेट की सदस्यता ली है. आप समूह
> सदस्यता पेज
> <
https://groups.google.com/forum/?utm_source=digest&utm_medium=email#!forum/technical-hindi/join>
> <mailto:
technical-hin...@googlegroups.com> को ईमेल भेजें.
>
> --
> आपको यह मैसेज इसलिए मिला है क्योंकि आपने Google Groups के "Scientific and
> Technical Hindi (वैज्ञानिक तथा तकनीकी हिन्दी)" ग्रुप की सदस्यता ली है.
> इस समूह की सदस्यता खत्म करने और इससे ईमेल पाना बंद करने के लिए,
>
technical-hin...@googlegroups.com
> <mailto:
technical-hin...@googlegroups.com> को ईमेल भेजें.
> वेब पर यह चर्चा देखने के लिए,
>
https://groups.google.com/d/msgid/technical-hindi/CAAX3pZ4nvehTZSurmHk6hvDzvGfddpOmcGyn%3DT3%3DTvYMAisirA%40mail.gmail.com
> <
https://groups.google.com/d/msgid/technical-hindi/CAAX3pZ4nvehTZSurmHk6hvDzvGfddpOmcGyn%3DT3%3DTvYMAisirA%40mail.gmail.com?utm_medium=email&utm_source=footer>
> पर जाएं.