UR and postgres

63 views

Skip to first unread message

Aaron Mangum

unread,

Jun 27, 2016, 3:02:55 PM6/27/16

to actionml-user

Hi, I'm trying to set up the universal recommender with postgres.

These are the relevant parts of my pio-env.sh:

POSTGRES_JDBC_DRIVER=$PIO_HOME/lib/postgresql-9.4.1208.jar

# ES_CONF_DIR: You must configure this if you have advanced configuration for

# your Elasticsearch setup.

# ES_CONF_DIR=/opt/elasticsearch

# Filesystem paths where PredictionIO uses as block storage.

PIO_FS_BASEDIR=$HOME/.pio_store

PIO_FS_ENGINESDIR=$PIO_FS_BASEDIR/engines

PIO_FS_TMPDIR=$PIO_FS_BASEDIR/tmp

# PredictionIO Storage Configuration

# This section controls programs that make use of PredictionIO's built-in

# storage facilities. Default values are shown below.

# For more information on storage configuration please refer to

# https://docs.prediction.io/system/anotherdatastore/

# Storage Repositories

# Default is to use PostgreSQL

PIO_STORAGE_REPOSITORIES_METADATA_NAME=pio_meta

PIO_STORAGE_REPOSITORIES_METADATA_SOURCE=ELASTICSEARCH

PIO_STORAGE_REPOSITORIES_EVENTDATA_NAME=pio_event

PIO_STORAGE_REPOSITORIES_EVENTDATA_SOURCE=PGSQL

PIO_STORAGE_REPOSITORIES_MODELDATA_NAME=pio_model

PIO_STORAGE_REPOSITORIES_MODELDATA_SOURCE=LOCALFS

# Storage Data Sources

# PostgreSQL Default Settings

# Please change "pio" to your database name in PIO_STORAGE_SOURCES_PGSQL_URL

# Please change PIO_STORAGE_SOURCES_PGSQL_USERNAME and

# PIO_STORAGE_SOURCES_PGSQL_PASSWORD accordingly

PIO_STORAGE_SOURCES_PGSQL_TYPE=jdbc

But, I get the following error when running pio train.

[WARN] [TaskSetManager] Lost task 0.0 in stage 6.0 (TID 21, localhost): java.sql.BatchUpdateException: Batch entry 0 INSERT INTO pio_event_1 (id,event,entityType,entityId,targetEntityType,targetEntityId,properties,eventTime,eventTimeZone,tags,prId,creationTime,creationTimeZone) VALUES ('7067bbb81ba84086aeb2d70a8c572f04','view','user','53f208fd302a12e8e5000017','item','gonsie',NULL,'2015-3-6 13:52:33.032000 +0:0:0','UTC',NULL,NULL,'2015-3-6 13:52:33.032000 +0:0:0','UTC') was aborted. Call getNextException to see the cause.

at org.postgresql.jdbc.BatchResultHandler.handleError(BatchResultHandler.java:136)

at org.postgresql.core.v3.QueryExecutorImpl$1.handleError(QueryExecutorImpl.java:419)

at org.postgresql.core.v3.QueryExecutorImpl$ErrorTrackingResultHandler.handleError(QueryExecutorImpl.java:308)

at org.postgresql.core.v3.QueryExecutorImpl.processResults(QueryExecutorImpl.java:2004)

at org.postgresql.core.v3.QueryExecutorImpl.flushIfDeadlockRisk(QueryExecutorImpl.java:1187)

at org.postgresql.core.v3.QueryExecutorImpl.sendQuery(QueryExecutorImpl.java:1212)

at org.postgresql.core.v3.QueryExecutorImpl.execute(QueryExecutorImpl.java:351)

at org.postgresql.jdbc.PgStatement.executeBatch(PgStatement.java:1019)

at org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils$.savePartition(JdbcUtils.scala:215)

at org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils$$anonfun$saveTable$1.apply(JdbcUtils.scala:277)

at org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils$$anonfun$saveTable$1.apply(JdbcUtils.scala:276)

at org.apache.spark.rdd.RDD$$anonfun$foreachPartition$1$$anonfun$apply$33.apply(RDD.scala:920)

at org.apache.spark.SparkContext$$anonfun$runJob$5.apply(SparkContext.scala:1858)

at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:66)

at org.apache.spark.scheduler.Task.run(Task.scala:89)

at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:214)

at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)

at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)

at java.lang.Thread.run(Thread.java:745)

[ERROR] [Executor] Exception in task 1.0 in stage 6.0 (TID 22)

[ERROR] [TaskSetManager] Task 0 in stage 6.0 failed 1 times; aborting job

[WARN] [TaskSetManager] Lost task 1.0 in stage 6.0 (TID 22, localhost): java.sql.BatchUpdateException: Batch entry 0 INSERT INTO pio_event_1 (id,event,entityType,entityId,targetEntityType,targetEntityId,properties,eventTime,eventTimeZone,tags,prId,creationTime,creationTimeZone) VALUES ('16ec8e93eca74b51b5c6fb13ef069f93','view','user','54e3ce1174756e4b0ef60c00','item','stackoverflow_1636586',NULL,'2015-3-6 17:5:31.709000 +0:0:0','UTC',NULL,NULL,'2015-3-6 17:5:31.709000 +0:0:0','UTC') was aborted. Call getNextException to see the cause.