IteratorFactory properties and the number of processed document

6 views
Skip to first unread message

Ibtehal Baazeem

unread,
May 9, 2015, 1:20:14 AM5/9/15
to s-space-re...@googlegroups.com
Dear S-Space Developers, 

I'm using Eclipse to write and run my code.
       
       1) I have a question regarding setting the property of the IteratorFactory, is this the correct way?
       
Properties preprocessProp=new Properties();
preprocessProp.setProperty("edu.ucla.sspace.text.TokenizerFactory.tokenCountLimit","20");
preprocessProp.setProperty("edu.ucla.sspace.text.TokenizerFactory.stemmer", "edu.ucla.sspace.text.EnglishStemmer");

//assign the properties to the IteratorFactory
IteratorFactory.setProperties(preprocessProp);
   
        lsa.processSpace(definedProp);
   
   
   2) How can I know the number of processed document? when I tried this method I got an error?
   System.out.print( "documentSpaceSize: "+ lsa.documentSpaceSize());

            
            3) when I tried:

 

       preprocessProp.setProperty("edu.ucla.sspace.text.IteratorFactory.tokenFilter", "exclude=lists/mysqlstopwords.txt");


        It seems doesn't work, I got the same number of words using lsa.getWords().size(). Could you please advise what might be the cause?





         Many Thanks,
Reply all
Reply to author
Forward
0 new messages