Improving Image Retrieval with User Feedback

2021-02-08T12:05:31+00:00

A problem occurs when an image retrieval method delivers irrelevant results. This post shows how user interaction can be utilized to overcome this problem.

Content based image retrieval is a field in computer vision. The aim is to find the most similar images to a given input image, where the similarity refers to the sem

Improving Image Retrieval with User Feedback2021-02-08T12:05:31+00:00

End-to-End Image Captioning

2021-02-10T08:54:02+00:00

We had the unique opportunity to develop an image captioning system combining computer vision and NLP from a prototype model to a fully scalable data product with a team of five interdisciplinary students from the TUM Data Innovation Lab during a period of six months as part of an educational research experience.

tl;dr Data Science, Machine Learning Engineering, Software Engineering, and IT-Operations know-how is required to turn a prototypical machine-learning model into an end-

End-to-End Image Captioning2021-02-10T08:54:02+00:00

Machine Learning on the Edge for Parking Guidance Systems

2021-02-10T09:14:02+00:00

Machine Learning on the Edge becomes more and more important for Smart Cities. We investigate how Deep Learning models can be optimized and deployed on edge devices for smart parking guidance systems.

This blog post investigates how deep learning models can be optimized and deployed on edge devices for parking guidance systems. I will present two different approach

Machine Learning on the Edge for Parking Guidance Systems2021-02-10T09:14:02+00:00

Digitize your Receipts using Computer Vision

2021-02-10T09:16:04+00:00

In this article I describe the steps and approaches to image recognition for receipt digitalization using computer vision. This is the basic functionality behind apps such as Google Lens, Evernote, PaperScan and taggun.io.

“Would you like the receipt?”—It’s hard to say no to that. Not because you actually want it (you may even throw it in the trash before exiting the store), but because

Digitize your Receipts using Computer Vision2021-02-10T09:16:04+00:00

Text Spotting using semi-supervised Generative Adversarial Networks

2021-02-10T09:01:48+00:00

We built a text spotting (OCR) pipeline that out-performed Google Cloud Vision using semi-supervised Generative Adversarial Networks.

Despite all advances in machine learning due to the advent of deep learning, the latter has one major shortcoming: It requires a lot of data during the learning proce

Text Spotting using semi-supervised Generative Adversarial Networks2021-02-10T09:01:48+00:00