Frameworks for Machine Learning Model Management

2019-04-04T10:02:24+00:00

This blog post will compare three different tools developed to support reproducible machine learning model development: MLFlow developed by DataBricks (the company behind Apache Spark), DVC, a software product of the London based startup iterative.ai, and Sacred, an academic project developed by different researchers.

In my previous blog post „how to manage machine learning models“ I explained the difficulties within the process of developing a good machine learning mod

Frameworks for Machine Learning Model Management 2019-04-04T10:02:24+00:00

Machine Learning Interpretability: Do You Know What Your Model Is Doing?

2019-04-02T13:36:57+00:00

Unlike usual performance metrics, fairness, safety and transparency in machine learning models are much harder if not impossible to quantify. Here are some techniques (and examples) to provide interpretability, to make decision systems understandable not only for their creators, but also for their customers and users.

Machine learning has a great potential to improve data products and business processes. It is used to propose products and news articles that we might be interested i

Machine Learning Interpretability: Do You Know What Your Model Is Doing? 2019-04-02T13:36:57+00:00

Working efficiently with Jupyter Notebooks

2018-11-20T11:31:51+00:00

Being in the data science domain for quite some years, I have seen good Jupyter notebooks but also a lot of ugly ones. Follow these best practices to to work more efficiently with your notebooks and strike the perfect balance between text, code and visualisations.

If you have ever done something analytical or anything closely related to data science in Python, there is just no way you have not heard of or IPython or Jupyter not

Working efficiently with Jupyter Notebooks 2018-11-20T11:31:51+00:00

From Exploration to Production—Bridging the Deployment Gap for Deep Learning (Part 2)

2019-04-02T13:47:25+00:00

In this blogposts on deep learning model exploration, translation, and deployment we expand on the previous article with two additional approaches for model deployment: TensorFlow Serving and Docker as well as a rather hobbyist approach in which we build a simple web application that serves our model.

This is the second part of a series of two blogposts on deep learning model exploration, translation, and deployment. Both involve many technologies like PyTorch, Ten

From Exploration to Production—Bridging the Deployment Gap for Deep Learning (Part 2) 2019-04-02T13:47:25+00:00

How to Manage Machine Learning Models

2018-12-18T17:06:14+00:00

In the past few moths a slew of Machine Learning management platforms arose. In this article we have a look at ModelDB which supports data scientists by keeping track of models, datasources and parameters. If you use scikit-learn or SparkML it promises easy integration and offers additional visualisation tools.

Developing a good machine learning model is not straight forward, but rather an iterative process which involves many steps. Mostly Data Scientists start by building

How to Manage Machine Learning Models 2018-12-18T17:06:14+00:00

From Exploration to Production — Bridging the Deployment Gap for Deep Learning

2018-10-01T14:49:13+00:00

This article introduces EMNIST, we develop and train models with PyTorch, translate them with the Open Neural Network eXchange format ONNX and serve them through GraphPipe. We will orchestrate these technologies to solve the task of image classification using the more challenging and less popular EMNIST dataset.

This is the first part of a series of two blogposts on deep learning model exploration, translation, and deployment. Both involve many technologies like PyTorch, Tens

From Exploration to Production — Bridging the Deployment Gap for Deep Learning 2018-10-01T14:49:13+00:00

Time Series Forecasting with Machine Learning Models

2018-09-13T11:38:23+00:00

In this article we explain how time series forecasting tasks can be solved with machine learning models, starting with the problem modeling and ending with visualizing the results by embedding the models in a web app for demonstration purposes.

Recently, Machine Learning (ML) models have been widely discussed and successfully applied in time series forecasting tasks (Bontempi et al., 2012). In this blog arti

Time Series Forecasting with Machine Learning Models 2018-09-13T11:38:23+00:00

Multiplicative LSTM for sequence-based Recommenders

2019-04-02T18:00:51+00:00

Traditional user-item recommenders often neglect the dimension of time, finding for each user a latent representation based on the user’s historical item interactions without any notion of recency and sequence of interactions. Sequence-based recommenders such as Multiplicative LSTMs tackle this issue.

Recommender Systems support the decision making processes of customers with personalized suggestions. They are widely used and influence the daily life of almost ever

Multiplicative LSTM for sequence-based Recommenders 2019-04-02T18:00:51+00:00

Managing isolated Environments with PySpark

2018-04-10T13:30:43+00:00

In this article we present a simple solution for managing Isolated Environments with PySpark that we have been using in production for more than a year.

With the sustained success of the Spark data processing platform even data scientists with a strong focus on the Python ecosystem can no longer ignore it. Fortunately

Managing isolated Environments with PySpark 2018-04-10T13:30:43+00:00

Data Science in Production: Packaging, Versioning and Continuous Integration

2018-02-07T14:53:36+00:00

Here's what changes when your data science project grows from a proof of concept. How do you deploy your model, how can updates be rolled out, ...?

A common pattern in most data science projects I participated in is that it’s all fun and games until someone wants to put it into production. From that point in time

Data Science in Production: Packaging, Versioning and Continuous Integration 2018-02-07T14:53:36+00:00

Real-time detection of anomalies in computer networks with methods of machine learning: Stop the (data)-thief!

2019-02-15T12:54:16+00:00

This blog post describes some basic concepts and shows a prototypical architecture for network anomaly detection in real-time.

This blog post shows some results and concepts of a master’s thesis here at inovex. It describes some basic concepts and shows a prototypical architecture for detecti

Real-time detection of anomalies in computer networks with methods of machine learning: Stop the (data)-thief! 2019-02-15T12:54:16+00:00
Mehr Beiträge laden