Amundsen Alternatives – DataHub, Metacat, and Apache Atlas

September 14th, 2022

header image for Amundsen Alternatives – DataHub, Metacat, and Apache Atlas

What are some alternatives to Amundsen?

Amundsen is a popular open-source data discovery platform and metadata engine built by the Lyft engineering team.

Introduced and open-sourced in 2019 for adoption outside of Lyft, the platform was initially built to improve the productivity of engineers, data scientists, and analysts at the ride-hailing app.

In addition to a high adoption at Lyft, Amundsen has an open source community currently spanning ~100 contributors and 39 organizations.

While you are evaluating Amundsen in the process of building an open-source based data catalog, you might want to consider these alternatives to Amundsen as well.

3 open source Amundsen alternatives


A Guide to Building a Business Case for a Data Catalog

Download free ebook


DataHub

Built by LinkedIn, DataHub is an open-source metadata search and discovery tool.

Open sourced in 2020, this tool is LinkedIn’s second attempt at solving cataloging and data discovery problems. Its first attempt, WhereHows, was in 2016.

Popular use cases of DataHub include:

  • Automating metadata ingestion from multiple data sources
  • Streamlining data discovery via searching and browsing data assets
  • Enhancing understanding of data with context

Further reading about DataHub as an Amundsen alternative:


[Download] → Forrester Wave™: Enterprise Data Catalog for DataOps, Q2 2022


Metacat

An open-source metadata management platform, Metacat powers data discovery and metadata interoperability at Netflix.

With a single access layer for data across a diverse mesh of data sources, Metacat simplifies data discovery, cataloging, processing, and management.

Metacat’s best-known capabilities include:

  • Easy data discovery
  • Data change notifications
  • One common abstraction layer
  • Provisions for the user-and business-defined metadata storage

Further reading about Metacat as an Amundsen alternative:


Apache Atlas

Apache Atlas is a popular open-source software used to build catalogs of data assets.

With an active community of committers from Hortonworks, Aetna, Merck, IBM, Target, and more leading companies, the Apache Atlas project expands year by year.

Apache Atlas has the following main capabilities:

  • Visualizing metadata lineage
  • Adding entities to metadata to streamline searches
  • Creating classifications for data

Further reading about Apache Atlas as an Amundsen alternative:


Evaluating open source data catalog tools

One of the crucial steps in enabling a collaborative and efficient data culture in your organization is deploying a data catalog software. Discovering the right one for your organization requires answering many different questions — all at once.

  • Will this platform support all our primary use cases?
  • Will it still work if our data stack changes?
  • Will our users feel comfortable using it?
  • Should we build it or buy it?
  • How do we justify the money we’re asking for?

We know thinking about all of that can be overwhelming. It helps to think through your options out loud. Here’s how we could help:


The Ultimate Guide to Evaluating an Enterprise Data Catalog

Download free ebook


Free Guide: Find the Right Data Catalog in 5 Simple Steps.

This step-by-step guide shows how to navigate existing data cataloging solutions in the market. Compare features and capabilities, create customized evaluation criteria, and execute hands-on Proof of Concepts (POCs) that help your business see value. Download now!