Query for tags (phrasal categories)

83 Aufrufe
Direkt zur ersten ungelesenen Nachricht

Ulrike

ungelesen,
05.03.2015, 11:09:4605.03.15
an ant...@googlegroups.com
Hello,

This is a really basic question, but I can't find it in the tutorial:

I have (well will have) a corpus containing some category tagging like this :
<np>determiner noun another modifier</np>


but I can't figure out how to find all instances of NP's. I've tried with <np>*</np> and other wildcards but I don't get any results so I suppose that's not the way to do it....

Any help would be appreciated !

Best regards,

Laurence Anthony

ungelesen,
05.03.2015, 18:24:1405.03.15
an ant...@googlegroups.com

Hi,

If you know the tag has four parts, just use four asterisks separated by spaces, instead of the one in your example.

That should work.

Laurence

--
You received this message because you are subscribed to the Google Groups "AntConc-discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to antconc+u...@googlegroups.com.
To post to this group, send email to ant...@googlegroups.com.
Visit this group at http://groups.google.com/group/antconc.
For more options, visit https://groups.google.com/d/optout.

Ulrike Albers

ungelesen,
06.03.2015, 03:03:5406.03.15
an ant...@googlegroups.com
Hello,

Thank you very much for your answer Laurence.

The aim is to get all noun phrases (for instance) - they might have one or more tokens (in theory, there is no limit to the number). Or to get all let's say utterances by speaker 1, to give another example.

--
You received this message because you are subscribed to a topic in the Google Groups "AntConc-discussion" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/antconc/H1XHgojoB-o/unsubscribe.
To unsubscribe from this group and all its topics, send an email to antconc+u...@googlegroups.com.

To post to this group, send email to ant...@googlegroups.com.
Visit this group at http://groups.google.com/group/antconc.
For more options, visit https://groups.google.com/d/optout.



--
Cordialement,
Ulrike Albers

James Vanden Bosch

ungelesen,
08.03.2015, 20:46:3608.03.15
an ant...@googlegroups.com
Dear Ulrike,

If you're dealing with a tagged corpus, as in your original question, a search for "np" alone would pull up all the proper nouns.  But perhaps you're looking for noun  phrases instead ?  I guess I'm not sure that I understand your question.

Sincerely yours,
Jim

Laurence Anthony

ungelesen,
08.03.2015, 21:02:3908.03.15
an ant...@googlegroups.com
Hi again,

If you are unsure of how many items will be between the <np></np> tags, just use @ wildcards instead of * wildcards. A search for <np>@@@@@</np> would find noun phrases consisting of 1 to 5 elements. The <np> tag could be replaced by any tag of interest, too.

I hope that helps.

Laurence.

Ulrike Albers

ungelesen,
09.03.2015, 08:05:4209.03.15
an ant...@googlegroups.com
Hello,

Thank you so much for your help Laurence ! Unfortunately, it does not give mi any hits.

With

<np>lorem</np>

I get results, but with

<np>@</np>   or <np>@@@@@</np>

I don't get any...
I suppose something's wrong with my configuration (or my xml file ? but then, why would <np>lorem</np> show results ?)... Maybe I should reinstall Antconc ?

Best regards,

Laurence Anthony

ungelesen,
09.03.2015, 09:03:4809.03.15
an ant...@googlegroups.com

Hi,

Can you send a simple test file? That will let me check why you get no hits.

You don't need to reinstall AntConc. In fact, AntConc never needs installing. Just double click in the icon. It runs as a portable app.

Laurence.

Ulrike Albers

ungelesen,
09.03.2015, 09:21:4309.03.15
an ant...@googlegroups.com
Hello Laurence,

Yes of course -I forgot about this feature of your program !

Here is my test file.

Thank you very much !
test.xml

Ulrike

ungelesen,
31.03.2015, 05:11:5931.03.15
an ant...@googlegroups.com
Hello Laurence, hello everybody,

I hvave joined my file (see below, last post) - have you had the time to have a look at it ? Or would anybody know why I have that issue ?


Thank you very much !

Best regards,

Laurence Anthony

ungelesen,
01.04.2015, 00:33:3401.04.15
an ant...@googlegroups.com
Hi,

Sorry for the delay. Just got back to Japan after a year in the UK.

The solution is rather weird. The @ wildcard looks for non-word characters on either side. So, the following doesn't work:

<np>@@@@@</np>

because there is non-word characters after the > of <np> or before the < of </np>. To get the search to work, you need to use the following:

<np@@@@@np>

This will find an np tag with up to five 'words' inside.

To get an exact match of say three words, you can use this:

<np###np>

It's not very intuitive is it? I need to work on this.

Laurence.


--

Ulrike Albers

ungelesen,
08.04.2015, 09:37:3008.04.15
an ant...@googlegroups.com
Hello Laurence,

I'm sorry I only answer now... Thank you so much for your help - it does work fine this way !

It might be a little weird, but well, it does the job :) Thanks again !!!

--
You received this message because you are subscribed to a topic in the Google Groups "AntConc-discussion" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/antconc/H1XHgojoB-o/unsubscribe.
To unsubscribe from this group and all its topics, send an email to antconc+u...@googlegroups.com.

To post to this group, send email to ant...@googlegroups.com.
Visit this group at http://groups.google.com/group/antconc.
For more options, visit https://groups.google.com/d/optout.



--
Cordialement,
Ulrike Albers
Allen antworten
Antwort an Autor
Weiterleiten
0 neue Nachrichten