Amundsen Alternatives – DataHub, Metacat, and Apache Atlas

September 14th, 2022
header image

Share this article


What are some alternatives to Amundsen?

Amundsen is a popular open-source data discovery platform and metadata engine built by the Lyft engineering team.

Introduced and open-sourced in 2019 for adoption outside of Lyft, the platform was initially built to improve the productivity of engineers, data scientists, and analysts at the ride-hailing app.

In addition to a high adoption at Lyft, Amundsen has an open-source community currently spanning ~100 contributors and 39 organizations.

While you are evaluating Amundsen in the process of building an open-source-based data catalog, you might want to consider these alternatives to Amundsen as well.


Is Open Source really free? Estimate the cost of deploying an open-source data catalog 👉 Download Free Calculator


3 open source Amundsen alternatives

  • LinkedIn’s DataHub
  • Netflix’s Metacat
  • Hortonworks’ Apache Atlas

A Guide to Building a Business Case for a Data Catalog

Download free ebook


DataHub

Built by LinkedIn, DataHub is an open-source metadata search and discovery tool.

Open-sourced in 2020, this tool is LinkedIn’s second attempt at solving cataloging and data discovery problems. Its first attempt, WhereHows, was in 2016.

Popular use cases of DataHub include:

  • Automating metadata ingestion from multiple data sources
  • Streamlining data discovery via searching and browsing data assets
  • Enhancing understanding of data with context

Further reading about DataHub as an Amundsen alternative:



Metacat

An open-source metadata management platform, Metacat powers data discovery and metadata interoperability at Netflix.

With a single access layer for data across a diverse mesh of data sources, Metacat simplifies data discovery, cataloging, processing, and management.

Metacat’s best-known capabilities include:

  • Easy data discovery
  • Data change notifications
  • One common abstraction layer
  • Provisions for the user-and business-defined metadata storage

Further reading about Metacat as an Amundsen alternative:


Apache Atlas

Apache Atlas is a popular open-source software used to build catalogs of data assets.

With an active community of committers from Hortonworks, Aetna, Merck, IBM, Target, and more leading companies, the Apache Atlas project expands year by year.

Apache Atlas has the following main capabilities:

  • Visualizing metadata lineage
  • Adding entities to metadata to streamline searches
  • Creating classifications for data

Further reading about Apache Atlas as an Amundsen alternative:


Evaluating open source data catalog tools

One of the crucial steps in enabling a collaborative and efficient data culture in your organization is deploying a data catalog software. Discovering the right one for your organization requires answering many different questions — all at once.

  • Will this platform support all our primary use cases?
  • Will it still work if our data stack changes?
  • Will our users feel comfortable using it?
  • Should we build it or buy it?
  • How do we justify the money we’re asking for?

We know thinking about all of that can be overwhelming. It helps to have a framework to objectively evaluate. Here’s how we could help:


The Ultimate Guide to Evaluating an Enterprise Data Catalog

Download free ebook


Share this article

resource image

Free Guide: Find the Right Data Catalog in 5 Simple Steps.

This step-by-step guide shows how to navigate existing data cataloging solutions in the market. Compare features and capabilities, create customized evaluation criteria, and execute hands-on Proof of Concepts (POCs) that help your business see value. Download now!

[Website env: production]