Hi all,
I'm trying installing full english language to evaluate similarity between wikipedia categories.
I started yesterday the process from gui and it has gone well right a few time ago, then log window keep remaining to
mar 16, 2015 7:34:00 PM org.wikibrain.loader.SqlLinksLoader processOneLink
INFORMAZIONI: Processed link 893100000, found 541011312 interesting and 335216479 new
mar 16, 2015 7:34:01 PM org.wikibrain.loader.SqlLinksLoader processOneLink
INFORMAZIONI: Processed link 893200000, found 541011312 interesting and 335216479 new
mar 16, 2015 7:34:01 PM org.wikibrain.loader.SqlLinksLoader processOneLink
INFORMAZIONI: Processed link 893300000, found 541011312 interesting and 335216479 new
mar 16, 2015 7:34:01 PM org.wikibrain.loader.SqlLinksLoader processOneLink
INFORMAZIONI: Processed link 893400000, found 541011312 interesting and 335216479 new
mar 16, 2015 7:34:02 PM org.wikibrain.loader.SqlLinksLoader processOneLink
INFORMAZIONI: Processed link 893500482, found 541011312 interesting and 335216479 new
mar 16, 2015 7:34:03 PM org.wikibrain.loader.SqlLinksLoader processOneLink
INFORMAZIONI: Processed link 893600000, found 541011312 interesting and 335216479 new
mar 16, 2015 7:34:03 PM org.wikibrain.loader.SqlLinksLoader processOneLink
INFORMAZIONI: Processed link 893700000, found 541011312 interesting and 335216479 new
mar 16, 2015 7:34:04 PM org.wikibrain.loader.SqlLinksLoader processOneLink
INFORMAZIONI: Processed link 893800000, found 541011312 interesting and 335216479 new
mar 16, 2015 7:34:04 PM org.wikibrain.loader.SqlLinksLoader processOneLink
INFORMAZIONI: Processed link 893900000, found 541011312 interesting and 335216479 new
mar 16, 2015 7:34:05 PM org.wikibrain.utils.ParallelForEach$4 run
INFORMAZIONI: processing iterable 894000000
mar 16, 2015 7:34:05 PM org.wikibrain.loader.SqlLinksLoader processOneLink
INFORMAZIONI: Processed link 894000000, found 541011312 interesting and 335216479 new
mar 16, 2015 7:34:05 PM org.wikibrain.loader.SqlLinksLoader processOneLink
INFORMAZIONI: Processed link 894100000, found 541011312 interesting and 335216479 new
now are the 08:15 PM and it's still stopped at that point with java process using just the 0.7-2.0% of CPU ( 37.0% memory) instead of about 390% (I have 4 core cpu with 12 GB of RAM) like until rencently. RIght now, I have in my directory 98.8 GB of the stimated 152 GB requested. What could be happened? It's normal that it ishandling? Can I stop and restart the process from this point or, if I stop the process, I'll lost all the data computed right now and reboot all the process from the beginning?
So, if this, how can I avoid having again this situation?
This is the configuration I launched with gui:
java memory: 10 GB
language: en
data source H2
selected phases:
basic data
lucene
phrases
concepts
wikidata
semantic relatedness
this is the initial diagnostic:
* ALL DIAGNOSTIC TESTS SUCCEEDED! **
*************************************
Rough estimate of download size: 25620,0 MBs
This may be an over-estimate if some files have already been downloaded.
Time on dial-up (50kbs): 85400,0 minutes
Time on Broadband (1Mbs): 4270,0 minutes
Time on Broadband (10Mbs): 427,0 minutes
Time on Broadband (100Mbs): 42,7 minutes
stage download will download about 22080,0 about MBs
stage concepts will download about 660,0 about MBs
stage wikidata will download about 2880,0 about MBs
Completion time estimate: 1792,3 minutes (NOT including download time)
stage fetchlinks: 0,0 minutes
stage download: 0,0 minutes
stage dumploader: 137,9 minutes
stage redirects: 7,1 minutes
stage wikitext: 1004,4 minutes
stage lucene: 370,6 minutes
stage phrases: 77,6 minutes
stage concepts: 41,7 minutes
stage wikidata: 129,2 minutes
stage sr: 23,8 minutes
Disk space is okay. (need 152,780 GBs, have 172,938 GBs)
Warning: Available disk space may be INACCURATE if you have multiple drives.
stage fetchlinks: 1,2 MBs
stage download: 22080,0 MBs
stage dumploader: 31542,9 MBs
stage redirects: 1577,1 MBs
stage wikitext: 45000,0 MBs
stage lucene: 39428,6 MBs
stage phrases: 9000,0 MBs
stage concepts: 1577,1 MBs
stage wikidata: 6000,0 MBs
stage sr: 240,0 MBs
Amount of memory allocated for the JVM is okay
memory required: 8,0GB
memory allocated: 9,5GB
Connection to database succeeded. Active configuration:
username: "sa"
partitions: "default"
password: ""
connectionsPerPartition: 2
url: "jdbc:h2:./db/h2;LOG=0;CACHE_SIZE=65536;LOCK_MODE=0;UNDO_LOG=0;MAX_OPERATION_MEMORY=100000000"
driver: "org.h2.Driver"
thank you
best regards!