I have been importing several old Toolbox databases into Flex, but I am having some problems using regular expressions with Notepad++. I needed to add “\ps v” to all entries that began with “*”. The result looked good in Notepad++ but then Flex did not see that “\ps v” was on a new line and merged the information with the “\x” field. When I went back to Notepad++ and went to “Viewà show symbolsà end of line”, most lines had “CR” “LF”. The problem records do not have “CR” only “LR” . What is the correct regular expression symbols for carriage return? I need to improve my expressions for the next data set to be imported.
I want to alert others that Notepad++, while free and good in many ways, does not display a Unicode text correctly in all cases.
Jeff Shrum
Language Technology Consultant
SIL Southern Africa
+258 82 300 8461
In Malawi: +265 99 373 3153
\r\n matches crlf
--
You are subscribed to the publicly accessible group "FLEx list".
Only members can post but anyone can view messages on the website.
---
You received this message because you are subscribed to the Google Groups "FLEx list" group.
To unsubscribe from this group and stop receiving emails from it, send an email to flex-list+...@googlegroups.com.
To post to this group, send email to flex...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/flex-list/000001cec8ee%24b9e5cd60%242db16820%24%40org.
For more options, visit https://groups.google.com/groups/opt_out.
I have been importing several old Toolbox databases into Flex, but I am having some problems using regular expressions with Notepad++. I needed to add “\ps v” to all entries that began with “*”. The result looked good in Notepad++ but then Flex did not see that “\ps v” was on a new line and merged the information with the “\x” field. When I went back to Notepad++ and went to “Viewàshow symbolsà end of line”, most lines had “CR” “LF”. The problem records do not have “CR” only “LR” . What is the correct regular expression symbols for carriage return? I need to improve my expressions for the next data set to be imported.
I want to alert others that Notepad++, while free and good in many ways, does not display a Unicode text correctly in all cases.In Malawi: +265 99 373 3153
Beth,
Yes, the file I have is a mess. It seems to have lots of hidden tab characters and LF’s without CR’s. I think I made it worse without knowing it. The file I was given is actually a .doc file that has had who knows what done to it. I have learned to scrutinize files more before I start working on them. Things are not always what they seem. I think I can find all of the LF’s without CR’s and add the CR’s with a regular expression now that I know what the problem is. One of the interesting things that happens on importing a \lx field without out a CR on the end is that Flex marks even single word lexemes as phrases. Next time I see this, I will know what the underlying problem with the SFM file is.
Jeff S.
To view this discussion on the web visit https://groups.google.com/d/msgid/flex-list/143020A9-AA28-4365-A77A-96ED707737ED%40sil.org.
Thanks Jim. These look like the strings that I was taught to use, but when the result looked correct in Notepad++ I did not add them to the expression that I used.
Jeff S.
To view this discussion on the web visit https://groups.google.com/d/msgid/flex-list/83ddeca7b26de44faf8d2bcba4857817%40sil.org.
In this data there are lines with only LF and no CR, but I cannot seem to write and expression that will only capture those cases. Whatever I have tried captures both lines with LF and CRLF. Anyone know how to do this?
Jeff S.
From: flex...@googlegroups.com [mailto:flex...@googlegroups.com] On Behalf Of Jeff Shrum
Sent: Monday, October 14, 2013 5:05 PM
To: flex...@googlegroups.com
Subject: [FLEx] Carriage return or new line
I have been importing several old Toolbox databases into Flex, but I am having some problems using regular expressions with Notepad++. I needed to add “\ps v” to all entries that began with “*”. The result looked good in Notepad++ but then Flex did not see that “\ps v” was on a new line and merged the information with the “\x” field. When I went back to Notepad++ and went to “Viewà show symbolsà end of line”, most lines had “CR” “LF”. The problem records do not have “CR” only “LR” . What is the correct regular expression symbols for carriage return? I need to improve my expressions for the next data set to be imported.
--
Robert,
Yes, I see that a two step process could work. Thankyou. I am just surprised that something that has symbol “\n” cannot be treated in isolation. Probably dates back to a weakness in DOS or KPl-M that no one ever bothered to correct.
Jeff S.
To view this discussion on the web visit https://groups.google.com/d/msgid/flex-list/9F84FA6A40E240B087D58DF3CD61F6A6%40RobertHedinger.
To view this discussion on the web visit https://groups.google.com/d/msgid/flex-list/001301cec9e4%2462ca88d0%24285f9a70%24%40org.