=========================
q05 Step 2/3:
logistic regression with spark-mllib with direct metastore access
=========================
spark-submit
--class io.bigdatabenchmark.v1.queries.q05.LogisticRegression
/home/biadmin/TPCx-BB_v1.0.1/engines/hive/queries/Resources/bigbench-ml-spark.jar
--fromHiveMetastore true -i bigbenchORC100s.q05_spark_sql_run_query_0_temp -o
/user/biadmin/benchmarks/bigbench/queryResults/q05_spark_sql_run_query_0_result//
--type LBFGS --step-size 1 --iterations 20 --lambda 0 --numClasses 2
--convergenceTol 1e-5 --numCorrections 10 --saveClassificationResult false
--saveMetaInfo true --verbose false
Run
LogisticRegression with options: Map('csvInputDelimiter -> ,,
'fromHiveMetastore -> true, 'verbose -> false, 'saveMetaInfo -> true,
'lambda -> 0, 'numCorrections -> 10, 'stepsize -> 1, 'convergenceTol
-> 1e-5, 'iter -> 20, 'output -> /user/biadmin/benchmarks/bigbench/queryResults/q05_spark_sql_run_query_0_result//,
'type -> LBFGS, 'saveClassificationResult -> false, 'input ->
bigbenchORC100s.q05_spark_sql_run_query_0_temp, 'numClasses -> 2)
16/03/14 14:41:29
INFO slf4j.Slf4jLogger: Slf4jLogger started
16/03/14 14:41:29
INFO Remoting: Starting remoting
16/03/14 14:41:29
INFO Remoting: Remoting started; listening on addresses
:[akka.tcp://spark...@9.30.4.94:62546]
loading data from
metastore table: "bigbenchORC100s.q05_spark_sql_run_query_0_temp" ...
16/03/14 14:41:31
INFO hive.metastore: Trying to connect to metastore with URI
thrift://luwperf5.svl.ibm.com:9083
16/03/14 14:41:31
INFO hive.metastore: Connected to metastore.
16/03/14 14:41:31
INFO hive.metastore: Trying to connect to metastore with URI
thrift://luwperf5.svl.ibm.com:9083
16/03/14 14:41:31
INFO hive.metastore: Connected to metastore.
^M[Stage 0:>
(0 + 88) / 200]^M[Stage 0:>
(1 + 99) / 200]^M[Stage
0:=========================================>
(152 + 48) / 200]^M[Stage
0:==============================================> (174
+ 26) / 200]^M
^Maverage:
[188.6796294836899]
Training Model
^M[Stage 3:>
(1 + 0) / 200]