Interpreting maui response

42 views
Skip to first unread message

Thiago Henrique Martinelli

unread,
Nov 11, 2016, 7:05:25 AM11/11/16
to Kea and Maui Support
Hi people.
I used Maui for automatic tagging. It gives me a response that contain some terms and several numbers. 
How do i interpret this response?  Does anyone know or have a help material to send me? 

Thanks


Richard Cyganiak

unread,
Nov 11, 2016, 7:11:19 AM11/11/16
to kea-and-ma...@googlegroups.com
The terms are the generated tags. The numbers are Maui’s confidence scores. 0 for no confidence, 1 for very high confidence.

Richard



--
You received this message because you are subscribed to the Google Groups "Kea and Maui Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kea-and-maui-sup...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Thiago Henrique Martinelli

unread,
Nov 11, 2016, 2:06:08 PM11/11/16
to Kea and Maui Support
Hi Richard.
I understand. 
Thanks very much.
One more thing: there are several numbers in the output. For example:

franc,France,0.006936,0,0,0,0.998484,0.998484,3,1,0,0,0,0,0,0,0.200217,3,True

What's the difference between these numbers? Is there a documentation where i cand find the meaning of each one?  And this last parameter (True)?

Thank you

Thiago


Em sexta-feira, 11 de novembro de 2016 10:11:19 UTC-2, Richard Cyganiak escreveu:
The terms are the generated tags. The numbers are Maui’s confidence scores. 0 for no confidence, 1 for very high confidence.

Richard


On 11 Nov 2016, at 12:05, Thiago Henrique Martinelli <thiago.henriq...@gmail.com> wrote:

Hi people.
I used Maui for automatic tagging. It gives me a response that contain some terms and several numbers. 
How do i interpret this response?  Does anyone know or have a help material to send me? 

Thanks



--
You received this message because you are subscribed to the Google Groups "Kea and Maui Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kea-and-maui-support+unsub...@googlegroups.com.

Richard Cyganiak

unread,
Nov 14, 2016, 2:15:02 PM11/14/16
to kea-and-ma...@googlegroups.com
Sorry Thiago, I misunderstood. I’m not seeing this in the output. Are you running MauiTopicExtractor? Is this in the output files or on the console or in the log? Is there anything else around it (that is, can you give a fuller example of the output)?

Richard



To unsubscribe from this group and stop receiving emails from it, send an email to kea-and-maui-sup...@googlegroups.com.

Thiago Henrique Martinelli

unread,
Nov 14, 2016, 2:26:44 PM11/14/16
to Kea and Maui Support
Hi Richard.
I'm running  MauiTopicExtractor with -d (debug mode) option.
These numbers appear in the console.
Here follows an example of the output:

thiago@lrm-prometheus:~/min/ingles/Maui1.2$ java maui.main.MauiTopicExtractor -l data/user/test/ -m test -d -v agrovoc_en -f skos
Extracting keyphrases with options: -l data/user/test/ -m test -v agrovoc_en -f skos -e default -i en -n 10 -t maui.stemmers.PorterStemmer -s maui.stopwords.StopwordsEnglish -d    
-- Loading the model... 
--- Loading the vocabulary...
--- Building the Vocabulary index from the SKOS file...
--- Statistics about the vocabulary: 
38201 terms in total
10028 non-descriptive terms
28172 terms have related terms
-- Extracting keyphrases... 
No existing topics for data/user/test/a-inter_cropping.txt
-- Reading instance
-- Converting instance for document a-inter_cropping
---- Extracting candidates... 
143 candidates 
0 positive; 143 negative instances
-- Processing document: a-inter_cropping
-- Keyphrases and feature values:
http://www.fao.org/aos/agrovoc#c_1971,'Cropping systems',0.003356,1.776492,0.005961,0.674837,0.674837,0,6,2,0,8,0,0,0,0,0.222895,3,False
http://www.fao.org/aos/agrovoc#c_7038,'Shifting cultivation',0.006711,1.57789,0.01059,0.084967,0.364379,0.279412,3,3,0,2,0,0,0,0,0.203846,5,False
http://www.fao.org/aos/agrovoc#c_34762,'Dairy farms',0.003356,3.291998,0.011047,0.047386,0.047386,0,1,2,0,2,0,0,0,0,0.194903,6,False
http://www.fao.org/aos/agrovoc#c_208,'Agroindustrial sector',0.003356,2.014903,0.006761,0.098039,0.098039,0,9,1,0,1,0,0,0,0,0.184876,8,False
http://www.fao.org/aos/agrovoc#c_6148,'Poultry farming',0.003356,3.193558,0.010717,0.150327,0.150327,0,4,2,0,2,0,0,0,0,0.183461,9,False
-- 0.0 correct
No existing topics for data/user/test/w-Virgil.txt
-- Reading instance
-- Converting instance for document w-Virgil
---- Extracting candidates... 
214 candidates 
0 positive; 214 negative instances
-- Processing document: w-Virgil
-- Keyphrases and feature values:
-- 0.0 correct
No existing topics for data/user/test/ab387e.txt
-- Reading instance
-- Converting instance for document ab387e
---- Extracting candidates... 
323 candidates 
0 positive; 323 negative instances
-- Processing document: ab387e
-- Keyphrases and feature values:
http://www.fao.org/aos/agrovoc#c_3050,'Forest resources',0.009498,1.178655,0.011195,0.020748,0.987415,0.966667,54,2,0,8,0,0,0,0,0.73422,1,False
http://www.fao.org/aos/agrovoc#c_3218,'Genetic resources',0.017639,1.59039,0.028053,0.00034,0.953401,0.953061,15,2,0,4,0,0,0,0,0.643501,2,False
http://www.fao.org/aos/agrovoc#c_28127,'Protective forests',0.008141,1.791759,0.014587,0.517347,0.803061,0.285714,26,2,0,2,0,0,0,0,0.274102,4,False
http://www.fao.org/aos/agrovoc#c_28075,'Forest protection',0.008141,1.791759,0.014587,0.517347,0.803061,0.285714,26,2,0,3,0,0,0,0,0.249988,5,False
http://www.fao.org/aos/agrovoc#c_28126,'Protected forests',0.004071,1.57789,0.006423,0.517347,0.803061,0.285714,14,2,0,3,0,0,0,0,0.218172,8,False
http://www.fao.org/aos/agrovoc#c_8426,'Wood industry',0.006106,2.054124,0.012542,0.282993,0.491837,0.208844,17,2,0,2,0,0,0,0,0.167405,9,False
http://www.fao.org/aos/agrovoc#c_3060,'Forestry policies',0.002714,1.717651,0.004661,0.747619,0.823129,0.07551,45,2,0,3,0,0,0,0,0.163501,10,False
-- 0.0 correct
No existing topics for data/user/test/a-cotton.txt
-- Reading instance
-- Converting instance for document a-cotton
---- Extracting candidates... 
684 candidates 
0 positive; 684 negative instances
-- Processing document: a-cotton
-- Keyphrases and feature values:
http://www.fao.org/aos/agrovoc#c_3337,'Gossypium barbadense',0.002458,5.966147,0.014663,0.006772,0.787157,0.780385,0,2,0,1,0,0,0,0,0.269513,3,False
http://www.fao.org/aos/agrovoc#c_3336,'Gossypium arboreum',0.002458,5.049856,0.012411,0.007014,0.587133,0.580119,0,2,0,1,0,0,0,0,0.172739,4,False
http://www.fao.org/aos/agrovoc#c_3339,'Gossypium hirsutum',0.002151,4.261399,0.009164,0.00653,0.786673,0.780143,0,2,0,1,0,0,0,0,0.166992,5,False
http://www.fao.org/aos/agrovoc#c_34005,'Cotton ginning',0.006759,3.225307,0.021799,0.008103,0.823074,0.814972,0,1,0,2,0,0,0,0,0.12021,8,False
http://www.fao.org/aos/agrovoc#c_49822,'Aral Sea',0.000922,4.261399,0.003928,0.339461,0.84073,0.50127,0,2,0,1,0,0,0,0,0.105326,9,False
http://www.fao.org/aos/agrovoc#c_761,'Bacillus thuringiensis',0.000922,4.174387,0.003847,0.347805,0.875317,0.527512,0,2,0,1,0,0,0,0,0.105326,10,False
-- 0.0 correct
No existing topics for data/user/test/a-crop_destruction.txt
-- Reading instance
-- Converting instance for document a-crop_destruction
---- Extracting candidates... 
60 candidates 
0 positive; 60 negative instances
-- Processing document: a-crop_destruction
-- Keyphrases and feature values:
http://www.fao.org/aos/agrovoc#c_34779,'European Union',0.018692,0.87855,0.016422,0.392857,0.75,0.357143,3,2,0,0,0,0,0,0,0.19344,3,False
http://www.fao.org/aos/agrovoc#c_13570,'Price fixing',0.009346,0.301452,0.002817,0.35,0.35,0,15,1,0,0,0,0,0,0,0.145045,4,False
http://www.fao.org/aos/agrovoc#c_28752,'Production quota',0.009346,3.886705,0.036324,0.675,0.675,0,0,2,0,0,0,0,0,0,0.134122,7,False
http://www.fao.org/aos/agrovoc#c_8678,'Agricultural products',0.028037,0.362185,0.010155,0.182143,0.978571,0.796429,6,2,0,1,0,0,0,0,0.114152,8,False
http://www.fao.org/aos/agrovoc#c_16118,'Crop residues',0.009346,1.839012,0.017187,0.289286,0.289286,0,2,2,0,0,0,0,0,0,0.108741,9,False
http://www.fao.org/aos/agrovoc#c_3029,'Food supply',0.009346,0.897243,0.008385,0.542857,0.542857,0,15,2,0,1,0,0,0,0,0.099001,10,False
-- 0.0 correct
No existing topics for data/user/test/w-james_joyce.txt
-- Reading instance
-- Converting instance for document w-james_joyce
---- Extracting candidates... 
419 candidates 
0 positive; 419 negative instances
-- Processing document: w-james_joyce
-- Keyphrases and feature values:
-- 0.0 correct
No existing topics for data/user/test/w-shakespeare.txt
-- Reading instance
-- Converting instance for document w-shakespeare
---- Extracting candidates... 
344 candidates 
0 positive; 344 negative instances
-- Processing document: w-shakespeare
-- Keyphrases and feature values:
-- 0.0 correct
No existing topics for data/user/test/w-nabokov.txt
-- Reading instance
-- Converting instance for document w-nabokov
---- Extracting candidates... 
328 candidates 
0 positive; 328 negative instances
-- Processing document: w-nabokov
-- Keyphrases and feature values:
http://www.fao.org/aos/agrovoc#c_8364,'Western Europe',0.00091,2.084583,0.001897,0.121988,0.121988,0,2,2,0,7,0,0,0,0,0.087908,5,False
http://www.fao.org/aos/agrovoc#c_4740,'Mental ability',0.015469,1.31696,0.020372,0.012425,0.941265,0.92884,1,1,0,1,0,0,0,0,0.070909,7,False
-- 0.0 correct
No existing topics for data/user/test/a-cover-crop.txt
-- Reading instance
-- Converting instance for document a-cover-crop
---- Extracting candidates... 
429 candidates 
0 positive; 429 negative instances
-- Processing document: a-cover-crop
-- Keyphrases and feature values:
http://www.fao.org/aos/agrovoc#c_1936,'Cover plants',0.041667,1.986465,0.082769,0,0.991353,0.991353,0,2,0,2,0,0,0,0,0.352947,1,False
http://www.fao.org/aos/agrovoc#c_3364,'Grassland management',0.000641,2.382628,0.001527,0.156767,0.156767,0,4,3,0,0,0,0,0,0,0.288163,2,False
http://www.fao.org/aos/agrovoc#c_7176,'Soil management',0.000641,2.204947,0.001413,0.040602,0.040602,0,5,2,0,5,0,0,0,0,0.126051,7,False
http://www.fao.org/aos/agrovoc#c_6148,'Poultry farming',0.000641,3.193558,0.002047,0.033459,0.033459,0,4,2,0,2,0,0,0,0,0.124415,8,False
-- 0.0 correct
No existing topics for data/user/test/a-agriculture.txt
-- Reading instance
-- Converting instance for document a-agriculture
---- Extracting candidates... 
978 candidates 
0 positive; 978 negative instances
-- Processing document: a-agriculture
-- Keyphrases and feature values:
http://www.fao.org/aos/agrovoc#c_2807,'Farming systems',0.01601,0.208823,0.003343,0.0058,0.953272,0.947472,12,1,0,8,0,0,0,0,0.228118,1,False
http://www.fao.org/aos/agrovoc#c_1666,'Climatic change',0.003958,1.776492,0.007031,0.034638,0.865861,0.831223,8,2,0,2,0,0,0,0,0.22588,2,False
http://www.fao.org/aos/agrovoc#c_438,'Animal products',0.002878,0.903552,0.002601,0.033004,0.780819,0.747815,16,2,0,7,0,0,0,0,0.22089,3,False
http://www.fao.org/aos/agrovoc#c_5976,'Plant production',0.002159,0.737716,0.001592,0.00384,0.876889,0.87305,21,2,0,7,0,0,0,0,0.150206,5,False
http://www.fao.org/aos/agrovoc#c_1971,'Cropping systems',0.00072,1.776492,0.001278,0.032677,0.216159,0.183482,6,2,0,12,0,0,0,0,0.124016,8,False
http://www.fao.org/aos/agrovoc#c_8325,'Water resources',0.001079,1.060872,0.001145,0.020341,0.800915,0.780573,10,2,0,5,0,0,0,0,0.123271,9,False
-- 0.0 correct

-- Evaluation results based on 10 documents:
Avg. number of correct keyphrases per document: 0 +/- 0
Precision: 0 +/- 0
Recall: 0 +/- 0
F-Measure: NaN

Richard Cyganiak

unread,
Nov 14, 2016, 3:20:07 PM11/14/16
to kea-and-ma...@googlegroups.com
Oh, from the directory name in your command it looks like you’re using Maui 1.2? I’m using the latest code from GitHub (1.3 plus some small additional changes) and the output there looks different.

I’m having a look at the source code of Maui 1.2, and will guess that each value in the comma-separated list has the following meaning:

1. ID of the term in the thesaurus
2. A label of the term (either the preferred one, or the one that was matched to a candidate — I’m not sure)
3-16. Values for 14 features — see Alyona’s thesis for details about each:

private int tfIndex = 0; // term frequency
private int idfIndex = 1; // inverse document frequency
private int tfidfIndex = 2; // TFxIDF
private int firstOccurIndex = 3; // position of the first occurrence
private int lastOccurIndex = 4; // position of the last occurrence
private int spreadOccurIndex = 5; // spread of occurrences
private int domainKeyphIndex = 6; // domain keyphraseness
private int lengthIndex = 7; // term length
private int generalityIndex = 8; // generality

// Thesaurus features
private int nodeDegreeIndex = 9; // node degree

// Wikipedia features
private int semRelIndex = 10; // semantic relatedness
private int wikipKeyphrIndex = 11; // wikipedia keyphraseness
private int invWikipFreqIndex = 12; // inverse wikipedia frequency
private int totalWikipKeyphrIndex = 13; // total wikipedia keyphraseness

17. Probability score (0–1)
18. Rank of the keyword (1–10 if you generate 10 per document)
19. True if the term was given in the training data for this document, False otherwise

This is from a quick look at the source code, so no guarantee :-)

Hope that helps,
Richard



On 14 Nov 2016, at 19:26, Thiago Henrique Martinelli <thiago.henriq...@gmail.com> wrote:

Hi Richard.
I'm running  MauiTopicExtractor with -d (debug mode) option.
These numbers appear in the console.
Here follows an example of the output:

thiago@lrm-prometheus:~/min/ingles/Maui1.2$ java maui.main.MauiTopicExtractor -l data/user/test/ -m test -d -v agrovoc_en -f skos
Extracting keyphrases with options: -l data/user/test/ -m test -v agrovoc_en -f skos -e default -i en -n 10 -t maui.stemmers.PorterStemmer -s maui.stopwords.StopwordsEnglish -d    
-- Loading the model... 
--- Loading the vocabulary...
--- Building the Vocabulary index from the SKOS file...
--- Statistics about the vocabulary: 
38201 terms in total
10028 non-descriptive terms
28172 terms have related terms
-- Extracting keyphrases... 
No existing topics for data/user/test/a-inter_cropping.txt
-- Reading instance
-- Converting instance for document a-inter_cropping
---- Extracting candidates... 
143 candidates 
0 positive; 143 negative instances
-- Processing document: a-inter_cropping
-- Keyphrases and feature values:
http://www.fao.org/aos/agrovoc#c_1971,'Croppingsystems',0.003356,1.776492,0.005961,0.674837,0.674837,0,6,2,0,8,0,0,0,0,0.222895,3,False
http://www.fao.org/aos/agrovoc#c_34762,'Dairyfarms',0.003356,3.291998,0.011047,0.047386,0.047386,0,1,2,0,2,0,0,0,0,0.194903,6,False
http://www.fao.org/aos/agrovoc#c_6148,'Poultryfarming',0.003356,3.193558,0.010717,0.150327,0.150327,0,4,2,0,2,0,0,0,0,0.183461,9,False
http://www.fao.org/aos/agrovoc#c_3218,'Geneticresources',0.017639,1.59039,0.028053,0.00034,0.953401,0.953061,15,2,0,4,0,0,0,0,0.643501,2,False
http://www.fao.org/aos/agrovoc#c_28075,'Forestprotection',0.008141,1.791759,0.014587,0.517347,0.803061,0.285714,26,2,0,3,0,0,0,0,0.249988,5,False
http://www.fao.org/aos/agrovoc#c_3060,'Forestrypolicies',0.002714,1.717651,0.004661,0.747619,0.823129,0.07551,45,2,0,3,0,0,0,0,0.163501,10,False
-- 0.0 correct
No existing topics for data/user/test/a-cotton.txt
-- Reading instance
-- Converting instance for document a-cotton
---- Extracting candidates... 
684 candidates 
0 positive; 684 negative instances
-- Processing document: a-cotton
-- Keyphrases and feature values:
http://www.fao.org/aos/agrovoc#c_3337,'Gossypiumbarbadense',0.002458,5.966147,0.014663,0.006772,0.787157,0.780385,0,2,0,1,0,0,0,0,0.269513,3,False
http://www.fao.org/aos/agrovoc#c_3336,'Gossypiumarboreum',0.002458,5.049856,0.012411,0.007014,0.587133,0.580119,0,2,0,1,0,0,0,0,0.172739,4,False
http://www.fao.org/aos/agrovoc#c_3339,'Gossypiumhirsutum',0.002151,4.261399,0.009164,0.00653,0.786673,0.780143,0,2,0,1,0,0,0,0,0.166992,5,False
http://www.fao.org/aos/agrovoc#c_761,'Bacillusthuringiensis',0.000922,4.174387,0.003847,0.347805,0.875317,0.527512,0,2,0,1,0,0,0,0,0.105326,10,False
-- 0.0 correct
No existing topics for data/user/test/a-crop_destruction.txt
-- Reading instance
-- Converting instance for document a-crop_destruction
---- Extracting candidates... 
60 candidates 
0 positive; 60 negative instances
-- Processing document: a-crop_destruction
-- Keyphrases and feature values:
http://www.fao.org/aos/agrovoc#c_34779,'EuropeanUnion',0.018692,0.87855,0.016422,0.392857,0.75,0.357143,3,2,0,0,0,0,0,0,0.19344,3,False
http://www.fao.org/aos/agrovoc#c_13570,'Price fixing',0.009346,0.301452,0.002817,0.35,0.35,0,15,1,0,0,0,0,0,0,0.145045,4,False
http://www.fao.org/aos/agrovoc#c_28752,'Productionquota',0.009346,3.886705,0.036324,0.675,0.675,0,0,2,0,0,0,0,0,0,0.134122,7,False
http://www.fao.org/aos/agrovoc#c_8678,'Agriculturalproducts',0.028037,0.362185,0.010155,0.182143,0.978571,0.796429,6,2,0,1,0,0,0,0,0.114152,8,False
http://www.fao.org/aos/agrovoc#c_16118,'Cropresidues',0.009346,1.839012,0.017187,0.289286,0.289286,0,2,2,0,0,0,0,0,0,0.108741,9,False
http://www.fao.org/aos/agrovoc#c_3029,'Foodsupply',0.009346,0.897243,0.008385,0.542857,0.542857,0,15,2,0,1,0,0,0,0,0.099001,10,False
http://www.fao.org/aos/agrovoc#c_8364,'WesternEurope',0.00091,2.084583,0.001897,0.121988,0.121988,0,2,2,0,7,0,0,0,0,0.087908,5,False
http://www.fao.org/aos/agrovoc#c_4740,'Mentalability',0.015469,1.31696,0.020372,0.012425,0.941265,0.92884,1,1,0,1,0,0,0,0,0.070909,7,False
-- 0.0 correct
No existing topics for data/user/test/a-cover-crop.txt
-- Reading instance
-- Converting instance for document a-cover-crop
---- Extracting candidates... 
429 candidates 
0 positive; 429 negative instances
-- Processing document: a-cover-crop
-- Keyphrases and feature values:
http://www.fao.org/aos/agrovoc#c_1936,'Coverplants',0.041667,1.986465,0.082769,0,0.991353,0.991353,0,2,0,2,0,0,0,0,0.352947,1,False
http://www.fao.org/aos/agrovoc#c_3364,'Grasslandmanagement',0.000641,2.382628,0.001527,0.156767,0.156767,0,4,3,0,0,0,0,0,0,0.288163,2,False
http://www.fao.org/aos/agrovoc#c_6148,'Poultryfarming',0.000641,3.193558,0.002047,0.033459,0.033459,0,4,2,0,2,0,0,0,0,0.124415,8,False
-- 0.0 correct
No existing topics for data/user/test/a-agriculture.txt
-- Reading instance
-- Converting instance for document a-agriculture
---- Extracting candidates... 
978 candidates 
0 positive; 978 negative instances
-- Processing document: a-agriculture
-- Keyphrases and feature values:
http://www.fao.org/aos/agrovoc#c_2807,'Farmingsystems',0.01601,0.208823,0.003343,0.0058,0.953272,0.947472,12,1,0,8,0,0,0,0,0.228118,1,False
http://www.fao.org/aos/agrovoc#c_1666,'Climaticchange',0.003958,1.776492,0.007031,0.034638,0.865861,0.831223,8,2,0,2,0,0,0,0,0.22588,2,False
http://www.fao.org/aos/agrovoc#c_438,'Animalproducts',0.002878,0.903552,0.002601,0.033004,0.780819,0.747815,16,2,0,7,0,0,0,0,0.22089,3,False
http://www.fao.org/aos/agrovoc#c_5976,'Plantproduction',0.002159,0.737716,0.001592,0.00384,0.876889,0.87305,21,2,0,7,0,0,0,0,0.150206,5,False
http://www.fao.org/aos/agrovoc#c_8325,'Waterresources',0.001079,1.060872,0.001145,0.020341,0.800915,0.780573,10,2,0,5,0,0,0,0,0.123271,9,False
To unsubscribe from this group and stop receiving emails from it, send an email to kea-and-maui-sup...@googlegroups.com.

Thiago Henrique Martinelli

unread,
Nov 14, 2016, 3:36:02 PM11/14/16
to Kea and Maui Support
Richard.
Thanks very much for the answer. It will help me a lot!
Reply all
Reply to author
Forward
0 new messages