Hi Haidee,
I think the issue would not just be spaces, but where to put spaces. In English, we surround words with spaces or punctuation+space, so it is easy for a problem to turn the words in a text file into tokens. But in character based languages, it's not always 1 character per word. Characters build into words contextually, something a human reader is good at, but a computer a little less so. I think you will need to tokenize your text first, and then use AntConc. That might be a challenge.
-Bill