Major bug correction - indexing

17 views
Skip to first unread message

carlinho...@gmail.com

unread,
Aug 6, 2015, 2:22:48 PM8/6/15
to mwetoolkit
Hi everybody,

We just corrected a major bug in index.py. Now it is possible to index very large corpora, with more than 1G words like WaC, wikipedia, etc.

We welcome any feedback and suggestions.

Enjoy

Silvio and Carlos

Carlos Ramisch

unread,
Aug 10, 2015, 3:09:04 AM8/10/15
to mwetoolkit
Hi,

I forgot to mention, we corrected some code in C that must be recompiled. To be able to use this improved version, you should :
- svn up
- make

Best
Carlos

--
You received this message because you are subscribed to the Google Groups "mwetoolkit" group.
To unsubscribe from this group and stop receiving emails from it, send an email to mwetoolkit+...@googlegroups.com.
To post to this group, send email to mweto...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/mwetoolkit/c27c7123-8faf-42d9-b199-ab27ca82bec6%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.



--
 Carlos RAMISCH
 pageperso.lif.univ-mrs.fr/~carlos.ramisch

---------------------------------------------------------------------------------
 address:
    LIF-TALEP
    Parc Scientifique et Technologique de Luminy
    163, avenue de Luminy - Case 901
    13288 MARSEILLE CEDEX 9
    France
---------------------------------------------------------------------------------
Reply all
Reply to author
Forward
0 new messages