We’re on it! We will reach out to email@company.com to schedule your demo. So we can prepare for the call, please provide a little more information.
We’re committed to your privacy. Tamr uses the information you provide to contact you about our relevant content, products, and services. For more information, read our privacy policy.
Tamr Insights
Tamr Insights
AI-native MDM
SHARE
Updated
May 23, 2019
| Published

Tamr's Classification Engine Gets Smarter with Active Learning

Tamr Insights
Tamr Insights
AI-native MDM
Tamr's Classification Engine Gets Smarter with Active Learning

Tamr’s human-guided machine learning platform has an exciting new feature, active learning for categorization, that will increase the accuracy and efficiency of categorization projects by highlighting high impact entities to categorize. A categorization project solves the task of placing records into categories. It is a top-down organizational project designed to classify individual records into a collection of hierarchical categories, referred to as a taxonomy.

Human labeling and categorization is a cumbersome problem which requires a huge amount of time and compromises the accuracy of the output due to human error. A huge Tamr advantage with categorization is the ease of multi-user collaboration and the benefit that comes from humans training the machine learning models. As part of the workflow of a categorization project, users are able to collaborate and iterate on the categorization or taxonomy.

Since the human users are the subject matter experts when it comes to the input data, Tamr requires a percentage of the entities to be categorized by humans in order to teach Tamr how to categorize the remaining data or subsequent data added. Active learning eliminates the need for the user(s) to provide a balanced amount of training examples for each category in the dataset; it removes the uncertainty/iterations on how much training the model needs and which category needs it to accelerate model training.

The screenshot shown below shows the menu of possible filters you can apply on the categorized data within Tamr. Users can collaborate and accelerate model training by using a combination of filters.

image

Let’s focus on the filter names “high impact”. When this filter is selected, Tamr produces simple high-impact questions regarding whether or not certain records, that are representative of a large portion of the unified dataset records, are categorized appropriately. Reviewer(s) then give their feedback–driving accuracy and enhancing future automation. High impact entities are denoted with the lightning bolt symbol. A screenshot of what the UI looks like once the “high impact” filter is selected is shown below.

image

This active learning for categorization feature which highlights high impact entities is game-changing for categorization projects. This allows the user(s) to easily see what data needs their attention and automatically balance out the training by selecting strong representations of the data to categorize. The time savings and increased confidence in the machine learning model is invaluable.

Schedule a demo to learn more about Tamr’s products.

Get a free, no-obligation 30-minute demo of Tamr.

Discover how our AI-native MDM solution can help you master your data with ease!

Thank you! Your submission has been received!
For more information, please view our Privacy Policy.
Oops! Something went wrong while submitting the form.