Hi,
did you make sure that the program actually terminated (by itself)? The transformation can take a while (maybe days).
If so, you could try with a smaller dump (e.g. a different language)to see whether that works and if so, maybe there is a problem with the particular dump you are trying to process.
Best,
Johannes
--
You received this message because you are subscribed to the Google Groups "jwpl-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
jwpl+uns...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Did you make sure there is enough space on your hard drive?
To unsubscribe from this group and stop receiving emails from it, send an email to jwpl+unsubscribe@googlegroups.com.
System.setProperty("jdk.xml.totalEntitySizeLimit", "500000000");
SAXParserFactory spf = SAXParserFactory.newInstance();
spf.setFeature(XMLConstants.FEATURE_SECURE_PROCESSING, false);
// Path where wiki dump is stored.
String[] arg = {"english", "Contents", "Disambiguation_pages", "/save/ngupta19/enwiki/20160501/"};
JWPLDataMachine.main(arg);This was able to process the dumps into the 11 .txt files. It took around 7 hours. Just for bookkeeping, making the SQL database and loading tables took around 2-3 hours and initial indexing in Java around 1 hour.Thanks,