Data Fabric vs Data Mesh: What are the Key Differences?

Emily Winks profile picture
Data Governance Expert
Updated:03/02/2023
|
Published:03/24/2022
10 min read

Key takeaways

  • Understanding data fabric vs data mesh: what are the key differences? is key for modern data teams.

Quick Answer: What is Data Fabric vs Data Mesh?

Data fabric is a technology-centric architecture that centralizes data access through APIs and AI automation, while data mesh is an organizational approach that decentralizes data ownership to domain-specific teams. Data fabric solves data silos and replication challenges; data mesh focuses on agility, scalability, and domain expertise.

Key differences include:

  • Centralized vs decentralized data fabric centralizes data access, data mesh distributes it across domains
  • AI-driven automation data fabric relies on ML to eliminate human effort in data management
  • Domain ownership data mesh assigns data responsibility to business domain experts
  • Data access patterns fabric uses APIs and data stores, mesh copies data for specific use cases
  • Governance approach fabric automates governance centrally, mesh uses federated computational governance

Want to skip the manual work?

See Atlan in Action



## Data mesh vs data fabric - key differences

The main difference between a data fabric and a data mesh is that a data fabric allows everyone (within permission) easy access to data at the right time. but data mesh takes a decentralized approach by keeping separate domain-specific datasets.

The differences between Data mesh and data fabric are summarized using this table.



AspectData FabricData Mesh
ArchitectureData is centralized. Data made available through APIs. Aims to eliminate human effort with machine learning and AI.Data is stored within each domain of a company. Data is copied into specific datasets for specific use-cases. Less emphasis on AI, since work is handled by domain experts.
BenefitsSelf-service data consumption and collaboration. Automates governance, data protection, and security. Automates data integration and data engineering.Agility and scalability with fast access and accurate data delivery. Platform connectivity and data security. Robust data governance and end-to-end compliance.
Use CasesBusiness applications - challenges of data availability and reliance for business applications. Data discovery - what data is available and where. Machine learning - minimizes the data preparation phase when training ML models.Financial sector - fast fraud threat analysis without copying data to a central database. Sales and marketing - targeted campaigns based on user profiles. Machine learning - create virtual data warehouses as a basis for training ML models.

Table 1: Key Differences Between Data mesh and data fabric


Table of content

Permalink to “Table of content”
  1. Data mesh vs data fabric - key differences
  2. Is data fabric the same as data mesh?
  3. What is a Data Fabric?
  4. What is a Data Mesh?
  5. Data Mesh Vs. Data Fabric: Main Differences
  6. #1- Architecture
  7. #2- Benefits
  8. #3- Use cases
  9. Data Mesh Vs. Data Fabric: How to choose?
  10. Conclusion on data mesh vs data fabric
  11. Data fabric vs. Data mesh: Related reads

Is data fabric the same as data mesh?

Permalink to “Is data fabric the same as data mesh?”

Managing data strategy for a small organization is, well, manageable. The problem first occurs when an organization starts growing in size, and there are too many different data sources to keep track of.

Once you have the basics in place, you want to know how to optimize your data management architecture. That’s where two design concepts come into place - Data Fabric and Data Mesh.

Data Fabric vs Data Mesh. What’s the difference? Let’s look at a quote from James Serra, a Data Platform Architecture Lead at EY:

“A data fabric and a data mesh both provide an architecture to access data across multiple technologies and platforms, but a data fabric is technology-centric, while a data mesh focuses on organizational change.”


Data Mesh vs Data Fabric - Forrester + Atlan - Masterclass


Before we understand the key differences between data fabric and data mesh, let’s revisit our basic understanding of these two concepts.



What is a Data Fabric?

Permalink to “What is a Data Fabric?”

According to Gartner, data fabric is a design concept that serves as an integrated layer (fabric) of data and connecting processes. It is a composable, flexible, and scalable way to maximize the value of data in an organization. It’s not one tool or process, but rather an emerging design concept that gives a framework to think about how to stack existing tools, resources, and processes.

Data fabric connects data from multiple sources and prepares it in a way in which you - a business user or a data professional - can access and analyze it easily.

According to Forrester Wave, a data fabric can help enterprise users with the following:

  • Increase agility: As data fabric creates a semantic layer that accelerates the delivery of data and insights by automating key processes, it automatically increases agility while engaging business users and analysts in the data preparation process.
  • Minimize complexity: Data fabric aims to automate the processes of ingestion, curation, and integration of data sources to enable the analytics and insights that are critical for business success. It minimizes complexity by automating processes, workflows, and pipelines, generating code automatically, and streaming data to simplify deployment.

Data fabric solves the following pain points:

  1. Data silos: No data is transferred to the “central storage”, which means you completely avoid the problem of data hosting and privacy policies. As a result, you can spend more time analyzing the data.
  2. Replication: No need to maintain consistency and replication across all repositories.
  3. Latency: There’s no need to constantly move huge amounts of data to central storage. The amount of data grows over time, which often causes delays in the systems if we’re talking about centralized approaches.

Read more about data fabric here.

What is a Data Mesh?

Permalink to “What is a Data Mesh?”

One of the most recognizable Data Mesh definitions comes from Zhamak Dehghani:

The data mesh platform is an intentionally designed distributed data architecture, under centralized governance and standardization for interoperability, enabled by a shared and harmonized self-serve data infrastructure.

The data mesh paradigm stands for decentralized and domain-specific data ownership that is easily discoverable and ready for consumption for everyone in the organization.

Data mesh has a couple of key characteristics that distinguish it from other paradigms, as mentioned in the article Data Mesh Principles and Logical Architecture by Martin Flower:

  1. Data ownership: Data mesh stores data across different domains. This data is maintained and managed by domain experts.
  2. Data as a product: Each data domain is seen as a product, and the users are its customers.
  3. Self-serve data platform: A data mesh advocates setting up an ecosystem that supports creating, using, and maintaining data products without needing specialized knowledge or expertise in sophisticated tooling and technologies.
  4. Federated computational governance: Decentralized data products can lead to data silos. A federated approach to governance standardizes rules, definitions, and procedures related to data.

Data mesh moves away from the concept of storing, transforming, and processing analytical data centrally. Instead, it advocates that each business domain is responsible for hosting, preparing, and serving its data to its own domain and larger audience.

Several companies have publicly presented their data mesh strategy - here are just a few:

Read more about data mesh here.

Data Mesh Vs. Data Fabric: Main Differences

Permalink to “Data Mesh Vs. Data Fabric: Main Differences”


 Data FabricData Mesh
ArchitectureData is centralized. Data made available through APIs. Aims to eliminate human effort with machine learning and AI.Data is stored within each domain of a company. Data is copied into specific datasets for specific use-cases. Less emphasis on AI, since work is handled by domain experts.
BenefitsSelf-service data consumption and collaboration. Automates governance, data protection, and security. Automates data integration and data engineering.Agility and scalability with fast access and accurate data delivery. Platform connectivity and data security. Robust data governance and end-to-end compliance.
Use CasesBusiness applications - challenges of data availability and reliance for business applications. Data discovery - what data is available and where. Machine learning - minimizes the data preparation phase when training ML models.Financial sector - fast fraud threat analysis without copying data to a central database. Sales and marketing - targeted campaigns based on user profiles. Machine learning - create virtual data warehouses as a basis for training ML models.

#1- Architecture

Permalink to “#1- Architecture”

In data fabric, the data is centralized, whereas the data is stored within each domain of a company in a data mesh. Each node in the data mesh has local storage and computation power, but no single point of control.

But probably the key difference between the data mesh and data fabric is in how data is accessed. In data fabric, data is made available with objective-based APIs or data stores. On the other hand, in data mesh, data is copied into specific datasets for a specific use case, but under complete control of the business domain.

It’s also worth pointing out that a data fabric requires a central team that owns the critical functions for the fabric orchestration. It’s unlikely for that team to become a bottleneck, as much of their work is automated by artificial intelligence. On the other hand, there’s no need for a human team in data mesh. There’s less emphasis on AI since all of the work is handled by domain experts.

#2- Benefits

Permalink to “#2- Benefits”

Right off the bat, there are three data fabric benefits worth discussing:

  1. Self-service data consumption and collaboration: Enable pertinent users of data within organizations to find quality data quicker, and spend more time exploring the data.
  2. Automate governance, data protection, and security: AI-enhanced automation creates data governance rules and definitions by extracting content from regulatory documents automatically.
  3. Automate data integration and data engineering tasks: Data fabric optimizes and accelerates data delivery within the organization, and hence eliminates inefficient, repetitive, and manual data integration workflows.

The following are the benefits of data mesh:

  1. Agility and scalability with fast access and accurate data delivery: Data mesh improves time-to-market, scalability, and business domain agility. Businesses can access data from anywhere with SQL queries with much lower latency.
  2. Platform connectivity and data security: Data mesh attacks data where it lives, instead of requiring users to make a copy and route it through a public network to a data warehouse. Hence, there’s a low risk of a data breach or information loss.
  3. Robust data governance and end-to-end compliance: The decentralized data operations simplify compliance with global data governance guidelines.

#3- Use cases

Permalink to “#3- Use cases”

Use cases of data fabric:

  1. Business applications - Data fabric solves the challenges of business data availability, the unreliability of data storage formats and security, poor scalability, and reliance on underperforming legacy systems. Read more.
  2. Data discovery - This layer controls access to the right data. The data discovery layer unfolds what data is available and where.
  3. Machine learning - AI engineers can use machine learning models efficiently in a data fabric environment because the data preparation time is minimized. They also have access to secure data, which facilitates enhanced machine-learning processes. Learn more.

Data mesh use cases, according to Starburst:

  1. Financial sector - Data mesh allows international financial bodies to analyze data locally to identify fraud threads without replicating datasets and transporting them to a central database.
  2. Sales and marketing - These departments can get a 360-degree view of the consumer profile and behaviors from different systems. As a result, the departments can make more targeted campaigns which leads to an increase in the customer lifetime value and a decrease in churn.
  3. Machine Learning - Data mesh enables AI engineers to create virtual data warehouses from different sources on which machine learning models are trained without having to consolidate data in a central location.

Data mesh Vs. data fabric: how to choose?

Permalink to “Data mesh Vs. data fabric: how to choose?”

Data fabric and data mesh aren’t conflicting design approaches but instead solve different problems.

Data fabric is extremely useful if you want to manage the trouble of dealing with huge amounts of data stored in silos. This design approach provides a centralized data integration across multi-cloud, hybrid clouds, on-premise, and stand-alone hosted systems. All data in a data fabric is made available through APIs or data stores. It’s also worth mentioning that data fabric relies more on machine learning than data mesh.

On the other hand, data mesh focuses more on organizational changes. Data is managed via controlled datasets that are domain-specific, which implies a decentralized approach. All data in a data mesh is copied into a specific dataset for specific use cases but under the complete control of the business domain.

Conclusion on data mesh vs data fabric

Permalink to “Conclusion on data mesh vs data fabric”

To summarize, both data fabric and data mesh provide powerful solutions to make your organization data-driven and even data-led. Data fabric allows everyone (within permission) easy access to data at the right time. Data mesh takes a decentralized approach by keeping separate domain-specific datasets.

Choosing one over the other essentially boils down to the problem your organization is dealing with.

Share this article

signoff-panel-logo

Atlan is the next-generation platform for data and AI governance. It is a control plane that stitches together a business's disparate data infrastructure, cataloging and enriching data with business context and security.

 

Everyone's talking about the context layer. We're the first to build one, live. April 29, 11 AM ET · Save Your Spot →

Bridge the context gap.
Ship AI that works.

[Website env: production]