ArangoML Pipeline Cloud – Managed Machine Learning Metadata Service

January 30, 2020/General

Estimated reading time: 4 minutes

We all know how crucial training data for data scientists is to build quality machine learning models. But when productionizing Machine Learning, Metadata is equally important.

Consider for example:

Capture of Lineage Information (e.g., Which dataset influences which Model?)
Capture of Audit Information (e.g, A given model was trained two months ago with the following training/validation performance)
Reproducible Model Training
Model Serving Policy (e.g., Which model should be deployed in production based on training statistics)

If you would like to see a live demo of ArangoML Pipeline Cloud, join our Head of Engineering and Machine Learning, Jörg Schad, on February 13, 2020 – 10am PT/ 1pm ET/ 7pm CET for a live webinar.

This is the reason we built ArangoML Pipeline, a flexible Metadata store which can be used with your existing ML Pipeline. ArangoML Pipeline can be used as a simple extension of existing ML pipelines through simple python/HTTP APIs.

Check out this page for further details on the challenge of Metadata in Machine Learning and ArangoML Pipeline.

ArangoML Pipeline Cloud

Today we are happy to announce a first version of Managed ML Metadata. Now you can start using ArangoML Pipeline without having to even start a separate docker container.

Additionally, as a cloud-based service based on ArangoDB’s managed cloud service ArangoGraph, it can be up & running in just a few clicks and in the Free-to-Try tier even without a lengthy registration.

ArangoML Pipeline Cloud

If you already have an existing notebook for your Machine Learning project it is as simple as adding the ArangoML Pipeline configuration pointing to our Free-to-Try tier `arangoml.arangodb.cloud` and a dedicated environment (aka ArangoDB database with custom login credentials) will be generated for you and persisted in the config.

Connecting to Arangopipe Using ArangoGraph

ArangoML Pipeline Cloud – Manage Machine Learning Metadata

SLAs

ArangoML Pipeline Cloud currently comes with two different service levels:

Free-to-Try
The Free-to-Try tier allows for a no-hassle setup as it automatically configures your own environment based on a simple API call shown above and is ideas to test ArangoML Pipeline Cloud, but comes with no guarantees for your production data.
Production
If you are considering to use ArangoML Pipeline Cloud for production setup this is
- Own ArangoGraph cluster with all of ArangoGraph Enterprise features
- Regular Backup
- It comes with a free 14-day trial period and afterwards follows the ArangoGraph pricing model

Please reach out to arangoml@arangodb.cloud for sign-up and details.

How to get started

To show how easy it is to get started with ArangoML Pipeline Cloud in your existing ML pipeline we have a notebook with a modified TensorFlow Tutorial example with no setup or signup required!

If you are already using ArangoML Pipeline and just want to check how to migrate to ArangoML Pipeline Cloud we suggest to take a look at the minimal minimal example notebook.

While these notebook are mostly focused on the storing of metadata, we have a number of exciting notebooks with use-cases of how to further leverage and analyze metadata including for example datashift analysis.

Learn more:

Learn more by checking out our example notebook on Google Colab
Checkout the examples directory in our open source repository.
Find here a tutorial notebook to get started with ArangoML Pipeline
Learn more about using Arangopipe with common components of a machine learning stack like Tensorflow, hyperopt and pytorch
Learn more about ArangoML Pipeline: Visit the blog
To join a webinar for a live demo of how ArangoML Pipeline Cloud works: Register here

Continue Reading

InfoCamere investigated graph databases and chose ArangoDB

Performance analysis with pyArango: Part III Measuring possible capacity with usage Scenarios

Milestone 2 ArangoDB 3.3 – New Data Replication Engine and Hot Standby

Joerg Schad

Jörg Schad is our CTO. In a previous life, he worked on Machine Learning Infrastructure in health care, distributed systems at Mesosphere, implemented distributed and in-memory databases, and conducted research in the Hadoop and Cloud area. He’s a frequent speaker at meetups, international conferences, and lecture halls.

January 30, 2020,Joerg Schad

Download Now
ArangoDB Enterprise

ArangoML Pipeline Cloud – Managed Machine Learning Metadata Service

ArangoML Pipeline Cloud

SLAs

How to get started

Continue Reading

Joerg Schad

Leave a Comment Cancel Reply

Tags

Quick Links

Info

About Us

Stay In Touch

Download NowArangoDB Enterprise

ArangoML Pipeline Cloud – Managed Machine Learning Metadata Service

ArangoML Pipeline Cloud

SLAs

How to get started

Continue Reading

Joerg Schad

Leave a Comment Cancel Reply

Tags

Quick Links

Info

About Us

Stay In Touch

Download Now
ArangoDB Enterprise