Problem in importing data using executable Wikibrain jar file

59 views
Skip to first unread message

Shreya

unread,
Jun 21, 2016, 6:20:39 AM6/21/16
to wikibrain
Hi all,
       I am working on the research project where I am using Wikibrain.

My work is to do on Full English articles.So I followed all the steps:

1.    Installing WikiBrain

2.    Installing PostgreSQL and PostGIS

3.     Configuration : I have created external override configuration file (i.e. override.conf) and made the specified changes that is required.

4.      While Importing the data I am running the executable jar file, with “spatial” box checked.

       Also created a new database(named “wiki”) in PostgreSQL and made changes in the configuration file corresponding to PostGIS settings.


Getting problem while connecting to spatial database: 


The output :

running:

org.wikibrain.Loader org.wikibrain.Loader -l en -s fetchlinks -s download -s dumploader -s redirects -s wikitext -s lucene -s phrases -s wikidata -s spatial -s sr -c customized.conf

18:20:27.740 [main] INFO  org.wikibrain.core.cmd.Env - Configured default logging at the Info Level

18:20:27.757 [main] INFO  org.wikibrain.core.cmd.Env - To customize log4j2 set the 'log4j.configurationFile' system property or set EnvBuilder.setReconfigureLogging to false.

18:20:44.618 [main] INFO  org.wikibrain.conf.Configurator - configurator installed 75 providers for 38 classes

18:20:44.629 [main] INFO  org.wikibrain.core.cmd.Env - using override configuration files [customized.conf]

18:20:44.633 [main] INFO  org.wikibrain.core.cmd.Env - using baseDir E:\wikibrain-withdeps-0.7.4\.

18:20:44.635 [main] INFO  org.wikibrain.core.cmd.Env - using max vm heapsize of 7918MB

18:20:44.642 [main] INFO  org.wikibrain.core.cmd.Env - using languages (EN)

18:20:44.644 [main] INFO  org.wikibrain.core.cmd.Env - using maxThreads 1

18:20:44.651 [main] INFO  org.wikibrain.core.cmd.Env - using tmpDir .\.tmp

18:20:48.591 [main] INFO  org.wikibrain.loader.pipeline.PipelineLoader - Beginning dry run

18:20:49.509 [main] INFO  org.wikibrain.spatial.dao.postgis.PostGISVersionChecker - checking for postgis extension

18:20:49.574 [main] INFO  org.wikibrain.loader.pipeline.PipelineLoader - Ended dry run

***********************************

** SOME DIAGNOSTIC TESTS FAILED! **

***********************************

DIAGNOSTICS THAT FAILED:

=======================

 

Connection to spatial database FAILED! Active configuration:

                schema: "public"

                passwd: ""

                dbtype: "postgis"

                host: "localhost"

                user: "toby"

                database: "wikibrain_spatial"

                "max connections": 19

                port: 5432


DIAGNOSTICS THAT SUCCEEDED:

===========================

Rough estimate of download size: 25680.0 MBs

                This may be an over-estimate if some files have already been downloaded.

                Time on dial-up (50kbs): 85600.0 minutes

                Time on Broadband (1Mbs): 4280.0 minutes

                Time on Broadband (10Mbs): 428.0 minutes

                Time on Broadband (100Mbs): 42.8 minutes

                stage download will download about 22080.0 about MBs

                stage concepts will download about 660.0 about MBs

                stage wikidata will download about 2880.0 about MBs

                stage spatial will download about 60.0 about MBs

 

Completion time estimate: 1746.8 minutes (NOT including download time)

                stage fetchlinks: 0.0 minutes

                stage download: 0.0 minutes

                stage dumploader: 139.2 minutes

                stage redirects: 2.8 minutes

                stage wikitext: 948.5 minutes

                stage lucene: 429.3 minutes

                stage phrases: 30.1 minutes

                stage concepts: 19.9 minutes

                stage wikidata: 146.8 minutes

                stage spatial: 8.1 minutes

                stage sr: 22.1 minutes

 

Disk space is okay. (need 167.429 GBs, have 499.362 GBs)

                Warning: Available disk space may be INACCURATE if you have multiple drives.

                stage fetchlinks: 1.2 MBs

                stage download: 22080.0 MBs

                stage dumploader: 31542.9 MBs

                stage redirects: 1577.1 MBs

                stage wikitext: 45000.0 MBs

                stage lucene: 39428.6 MBs

                stage phrases: 9000.0 MBs

                stage concepts: 1577.1 MBs

                stage wikidata: 6000.0 MBs

                stage spatial: 240.0 MBs

                stage sr: 15000.0 MBs

 

Amount of memory allocated for the JVM is okay

                memory required: 8.0GB

                memory allocated: 8.3GB

 

Connection to database succeeded. Active configuration:

                partitions: "default"

                password: "wiki"

                url: "jdbc:postgresql://localhost/wikidata"

                connectionsPerPartition: 2

                username: "wiki"

                driver: "org.postgresql.Driver"

 

Diagnostics failed. Aborting execution.

LOADING FAILED!


Can anyone spot what is the problem ?Where the changes should be made?


Thanks

111.png

Shilad Sen

unread,
Jun 21, 2016, 11:25:27 AM6/21/16
to wikibrain
Thanks for using WikiBrain, and sorry for the delay! I just released a new version that may fix the problem. Would you give it a try?

Shreya

unread,
Jun 30, 2016, 12:44:21 PM6/30/16
to wikibrain
Thanks Sir.
I imported the given project in IntelliJ it was giving error that different packages does not exists.
Then as required converted the .class file into java files and tried to load in IntelliJ, but many of the java files are giving error.
I can not find out the way.

Can anybody will help me out of this.What steps should I follow for importing the data using newly released version.
As I am new using this IDE IntelliJ ,cant find out the way to solve.
error.png
Reply all
Reply to author
Forward
0 new messages