The problem with parsing a Cyrillic site

132 views
Skip to first unread message

Элона Кулешова

unread,
Oct 24, 2021, 5:16:51 PM10/24/21
to Web Scraping
Hello. After parsing in the csv file, instead of Russian characters, just a set of letters. I suspect that there is a problem with the encoding, although previously there was no such problem on other projects. Please tell me what could be the reason and how to fix it?

1.PNG

Andrew11

unread,
Oct 24, 2021, 5:24:22 PM10/24/21
to Web Scraping
If you have Visual Studio Code text editor, open the CSV file in there and change the encoding to UTF-8 with BOM. Then double click should open in Excel with unscrambled letters. It's only in Excel this happens, and isn't a problem with the CSV file itself. Let me know if it doesn't work.

sh...@parsehub.com

unread,
Oct 25, 2021, 11:53:45 AM10/25/21
to Web Scraping
Hi,

If you are using a spreadsheet software such as excel then you may need to change the import format settings so that it can handle the Cyrillic characters. This guide shows you how to do this in excel when importing your .csv from ParseHub:
https://help.parsehub.com/hc/en-us/articles/115001263913-My-CSV-Excel-file-is-formatted-incorrectly

Let us know if that helps!
Cheers,
Shan
Reply all
Reply to author
Forward
0 new messages