Text processing

6 views
Skip to first unread message

krishnasaibn

unread,
Apr 25, 2019, 3:11:37 AM4/25/19
to Lamp_tutorial

notepad_2019-04-25_12-32-52.png


My text file looks like the above image. I have a few text files they may not be in same pattern as this text file. I need to extract data of each subheading like(Notes,Rule Note, or other Note) alone in generic manner  and i need to pass to temp file further process continues from temp file. then next subheading note should pass temp file like till the end of the file


Can any one help with this one

udhay prakash pethakamsetty

unread,
Apr 28, 2019, 12:18:55 AM4/28/19
to krishnasaibn, Lamp_tutorial
using regular expressions will solve the problem, when the pattern is unique in all
regards

UDHAY PRAKASH


--
You received this message because you are subscribed to the Google Groups "Lamp_tutorial" group.
To unsubscribe from this group and stop receiving emails from it, send an email to lamp_tutoria...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Message has been deleted

krishnasaibn

unread,
Apr 29, 2019, 7:29:29 AM4/29/19
to Lamp_tutorial
Pattern is  unique but in some files it may miss one or two subheadings, or some may have only last subheading other Note

1. using regex missing the subheading match pattern at the end which is required for next iteration

2. content may also have the subheading words

I'm ok with also working with beautiful soup. and any approach is fine .
To unsubscribe from this group and stop receiving emails from it, send an email to lamp_t...@googlegroups.com.

udhay prakash pethakamsetty

unread,
May 7, 2019, 10:24:52 PM5/7/19
to krishnasaibn, Lamp_tutorial
If you have the previlage to sent the files, reply all the distincct usecases files. 
I will reply regex, if feasible
regards

UDHAY PRAKASH


On Mon, Apr 29, 2019 at 4:56 PM krishnasaibn <krishn...@gmail.com> wrote:
Pattern is  unique but in some files it may miss one or two subheadings, or some may have only last subheading other Note

1. using regex missing the subheading match pattern at the end which is required for next iteration

2. content may also have the subheading words


On Sunday, 28 April 2019 09:48:55 UTC+5:30, uday3prakash wrote:
To unsubscribe from this group and stop receiving emails from it, send an email to lamp_t...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.
Reply all
Reply to author
Forward
0 new messages