home shape

ArangoML Pipeline Cloud – Managed Machine Learning Metadata Service

Estimated reading time: 4 minutes

We all know how crucial training data for data scientists is to build quality machine learning models. But when productionizing Machine Learning, Metadata is equally important.

Consider for example:

  • Capture of Lineage Information (e.g., Which dataset influences which Model?)
  • Capture of Audit Information (e.g, A given model was trained two months ago with the following training/validation performance)
  • Reproducible Model Training
  • Model Serving Policy (e.g., Which model should be deployed in production based on training statistics)

If you would like to see a live demo of ArangoML Pipeline Cloud, join our Head of Engineering and Machine Learning, Jörg Schad, on February 13, 2020 – 10am PT/ 1pm ET/ 7pm CET for a live webinar.


This is the reason we built ArangoML Pipeline, a flexible Metadata store which can be used with your existing ML Pipeline. ArangoML Pipeline can be used as a simple extension of existing ML pipelines through simple python/HTTP APIs.

Check out this page for further details on the challenge of Metadata in Machine Learning and ArangoML Pipeline.

ArangoML Pipeline Cloud

Today we are happy to announce a first version of Managed ML Metadata. Now you can start using ArangoML Pipeline without having to even start a separate docker container.

Additionally, as a cloud-based service based on ArangoDB’s managed cloud service ArangoGraph, it can be up & running in just a few clicks and in the Free-to-Try tier even without a lengthy registration.

ArangoML Pipeline Cloud

If you already have an existing notebook for your Machine Learning project it is as simple as adding the ArangoML Pipeline configuration pointing to our Free-to-Try tier `arangoml.arangodb.cloud` and a dedicated environment (aka ArangoDB database with custom login credentials) will be generated for you and persisted in the config.

 

 

 

 

 

 

SLAs

ArangoML Pipeline Cloud currently comes with two different service levels:

  • Free-to-Try
    The Free-to-Try tier allows for a no-hassle setup as it automatically configures your own environment based on a simple API call shown above and is ideas to test ArangoML Pipeline Cloud, but comes with no guarantees for your production data.
  • Production
    If you are considering to use ArangoML Pipeline Cloud for production setup this is

Please reach out to arangoml@arangodb.cloud for sign-up and details.

How to get started

To show how easy it is to get started with ArangoML Pipeline Cloud in your existing ML pipeline we have a notebook with a modified TensorFlow Tutorial example with no setup or signup required!

If you are already using ArangoML Pipeline and just want to check how to migrate to ArangoML Pipeline Cloud we suggest to take a look at the minimal minimal example notebook.

While these notebook are mostly focused on the storing of metadata, we have a number of exciting notebooks with use-cases of how to further leverage and analyze metadata including for example datashift analysis.

Learn more:

Continue Reading

InfoCamere investigated graph databases and chose ArangoDB

Performance analysis with pyArango: Part III Measuring possible capacity with usage Scenarios

Milestone 2 ArangoDB 3.3 – New Data Replication Engine and Hot Standby

Joerg Schad

Joerg Schad

Jörg Schad is our CTO. In a previous life, he worked on Machine Learning Infrastructure in health care, distributed systems at Mesosphere, implemented distributed and in-memory databases, and conducted research in the Hadoop and Cloud area. He’s a frequent speaker at meetups, international conferences, and lecture halls.

Leave a Comment





Get the latest tutorials, blog posts and news: