Hallo Mike,
thanks to your help I was help to put tags around metadata on my news article corpus.
Now I would like to ask you if there's a way with the text converter tool to delete every chunk of text within brakets from the files.
I can use the corpus with the tags, of course, but I would like also to have a clean version of the texts.
For example, I would like to remove all these kind of lines from multiple files. Is there a way for doing this?
<h>«Il digitale? All' Europa manca una piattaforma per
competere»</h>
<h>Corriere della Sera (Italy)</h>
<h>27 gennaio 2019 domenica</h>
<h>RIBATTUTA Edizione</h>
<h>Copyright 2019 RCS Mediagroup All Rights
Reserved</h>
<h> </h>
<h>Section: ECONOMIA; Pag. 27</h>