You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to mlflow-users
Hey Guys,
First of all, great job with this effort. It's certainly something a lot of people are waiting for (or have tried to create themselves).
I was wondering about the scalability of the tracking server. I see in the code there is an abstraction of for the tracking Store, which is currently a FileStore if I'm correct. What are the plans to support other stores for this (ElasticSearch, Kafka, S3, ...?)
Cheers,
D.
Matei Zaharia
unread,
Jun 23, 2018, 7:06:44 PM6/23/18
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to daan....@gmail.com, mlflow...@googlegroups.com
Hi Daan,
We do intend to add other ones. There are actually two elements here,
the metadata store and the artifact store (which can contain large
files uploaded by the job). For the metadata part we'll probably add a
database option, and for the artifacts we'll support cloud storage
systems.
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to mlflow-users
Hi,
Is there issues in github for these two stores? would be great to understand plans & participate
Thanks!
Matei Zaharia
unread,
Nov 8, 2018, 6:36:23 PM11/8/18
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Steve Casey, mlflow-users
We’ve already received pull requests for a few artifact store backends (Google Cloud Storage, Azure Storage, SFTP, and others). If you’d like to work on another one, or if you’d like to work on a database store for metadata, that would be awesome. There is an open pull request for a DynamoDB metadata store but we’d prefer to use something like SQLAlchemy that can work with a variety of backend databases if possible.
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Steve Casey, mlflow-users
BTW I’ll also add that the MLflow team at Databricks will probably implement this at some point if we don’t receive an external patch, but it might be a bit further down the line since we also have requests about the UI, model scoring, etc right now. In any case though we’re happy to provide feedback to anyone interested in it. The metadata store has a clearly separated API already and it shouldn’t be a huge amount of work to make a new one, though some care might need to be taken to make sure we can support database migrations, etc.