Unable to open H2O Flow in Databricks

93 views
Skip to first unread message

Ethan Zou

unread,
May 17, 2016, 6:39:51 PM5/17/16
to H2O Open Source Scalable Machine Learning - h2ostream
Hi,

I am using Databricks community edition and I have installed Sparking Water on my spark cluster. Based on online instructions, I would like to access H2O's web UI called H2O Flow. But I couldn't open Flow through command "openFlow". Can anyone please help? Attached please find my screenshot.

Thanks,
Ethan


Capture.JPG

Lauren DiPerna

unread,
May 17, 2016, 7:20:21 PM5/17/16
to Ethan Zou, H2O Open Source Scalable Machine Learning - h2ostream
Hi Ethan,

Were you able to open Flow by entering the url given in the output you attached (where it says "Open H2O Flow in browser: http://10.172.243.109:54321 ') into your browser?

cheers,

Lauren

--
You received this message because you are subscribed to the Google Groups "H2O Open Source Scalable Machine Learning - h2ostream" group.
To unsubscribe from this group and stop receiving emails from it, send an email to h2ostream+...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Ethan Zou

unread,
May 17, 2016, 7:23:07 PM5/17/16
to Lauren DiPerna, H2O Open Source Scalable Machine Learning - h2ostream

Michal Malohlava

unread,
May 18, 2016, 1:45:19 PM5/18/16
to h2os...@googlegroups.com
Hi Ethan,

there is a problem that Databricks instnaces are living in private network, so you need to expose H2O's instance port 54321 to outside world.
That means you need to somehow make instance which is hosting H2OContext (the same machine which is hosting Spark COntext) visible on public IP.

SO there are two solutions:
 1) One is described in Databricks documentation: assign elastic IP with a private instance which hosts SparkContext. Look for "Assign elastic IP" notebook in your Databricks account

OR

 2) Login into your AWS account, create an EC2 instance and make it colocated with created Databricks ec2 machines (you have to select the same private network), assign a public EIP with it, and run proxy on it which will be connected to a DBC instance which hosts SparkContext.

Please let us know if it works for you!
michal

Ethan Zou

unread,
May 20, 2016, 2:01:56 PM5/20/16
to mic...@h2oai.com, H2O Open Source Scalable Machine Learning - h2ostream
Hi Michal,

Thank you for your prompt reply. I tried the first solution but got an error at the very last step. Could you please take a look at my program? I understand it may be a question for Databricks, but they haven't replied to my questions so far. Any suggestion would be much appreciated. 

Followed the "AWS Configuration / Elastic IPAddress" from Databricks, I created an Elastic IP on my own AWS account, provisioned an IAMPolicy and attached to my IAM user. 
gathered ACCESS_KEY, SECRET_KEY, and EIP_ADDRESS. Then I follow the notebook step by step, until the last line. I got an error says Instance not found, but I do see the instance exist. Any suggestions?

Inline images 1

Inline images 3

Inline images 4

Thanks,
Ethan

--
You received this message because you are subscribed to a topic in the Google Groups "H2O Open Source Scalable Machine Learning - h2ostream" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/h2ostream/IqcAb9xZLFg/unsubscribe.
To unsubscribe from this group and all its topics, send an email to h2ostream+...@googlegroups.com.

rich...@gmail.com

unread,
Jul 7, 2016, 4:59:48 AM7/7/16
to H2O Open Source Scalable Machine Learning - h2ostream, mic...@h2oai.com
Hi

Same problem for me I've success to assign EIP to my databricks cluster as described in the guide but H2o context give the same private IP

I've check security group and it's open for the port 54321
Do you have any idea ?

Reply all
Reply to author
Forward
0 new messages