Updated December 04th, 2024

What is Data Governance? How Atlan Views and Implements It

Futureproof your data governance with Atlan's open and extensible platform. Ensure compliance and drive adoption.
Book a Demo →

Share this article

Data governance is a structured framework that defines how organizations manage, protect, and use their data. It establishes policies, roles, and processes to ensure data security, privacy, and compliance.

Effective data governance enhances data quality, promotes accountability, and supports decision-making by providing reliable and consistent data.
See How Atlan Simplifies Data Governance – Start Product Tour

It also aligns data management practices with regulatory requirements, minimizing risks and improving operational efficiency.

By implementing data governance frameworks, organizations can safeguard sensitive information, optimize data usage, and drive strategic initiatives with confidence.

Traditionally, data governance was synonymous with centralized control, rules, and policies — something conjured up by middle managers to add friction to data scientists’ lives.

Top 10 barriers to achieving data governance objectives

Top 10 barriers to achieving data governance objectives. Source: Gartner.

That’s why modern data governance must be community-led, centered on collaboration and data democratization, with privacy at its core. Everyone in an organization should be able to access, understand, and use the right data to unearth valuable business insights.

In this article, we’ll cover the principles behind such an approach to governance. We’ll follow that up with an implementation action plan and data governance tools that can help.


Table of Contents #

  1. What is data governance?
  2. Traditional vs. modern data governance: What’s the difference?
  3. Benefits of data governance
  4. Challenges of data governance
  5. Principles driving modern data governance
  6. How Atlan handles data governance
  7. What Atlan has to offer
  8. Data governance: Bottom line
  9. FAQs about Data Governance
  10. Data governance: Related reads

What is data governance? #

Gartner defines data governance as a way to “specify decision rights and accountability to ensure appropriate behavior as organizations seek to value, create, consume, and control their data, analytics, and information assets.”

Most data governance definitions focus on establishing organization-wide policies on data use. Traditionally, data governance was all about controlling data for compliance purposes. Employees associated these measures with bureaucracy and red tape.

Besides a top-down approach, traditional data governance models also involved appointing a data steward responsible for enforcing data governance policies and standards.

But with today’s ever-increasing data volumes, this isn’t scalable. We need a new approach to data governance.


Traditional vs. modern data governance: What’s the difference? #

The first step is to change employee’s perception of data governance. It’s not about control, red tape, and bottlenecks. Instead, data governance ensures your data is trustworthy, useful, and easily available.

That’s why we define data governance as a personalized, community-centered approach toward data enablement.

To implement governance at scale, everyone should take these principles to heart::

  1. The goal of data governance isn’t merely regulatory compliance
  2. The purpose of governance is to increase the value of data
  3. Everyone — not just data stewards or data governance steering committees — is responsible for data governance
Traditional approach to data governance Modern data governance
A centralized, top-down approach focused on controlling data access A decentralized, community-centered approach focused on data enablement
Requires enforcers — data stewards and steering committees Combines traditional stewardship with self-governing behaviors to distribute the work and ensure everyone is responsible and held accountable
Attempts to ensure data governance after the fact with post-hoc data testing and validation Shifts data governance left, incorporating tagging and validation early in the data lifecycle
Involves manual processes in granting approvals, tagging and classifying data, and more Automates as many processes as possible to eliminate human error and achieve data governance at scale

Tradtional and modern data governance: What is the difference?

Tradtional and modern data governance: What is the difference? Source: Gartner.

Now let’s look at the benefits of implementing data governance at scale.


Benefits of data governance #

Good governance can maximize the value of data assets and help your data teams work better. The core benefits of modern data governance are:

  • Effective metadata management leads to better data security and accuracy
  • Better productivity and faster data discovery by eliminating time spent on non-value-added tasks
  • Lower risks and costs due to poor data management and manual processes

Let’s explore each aspect further.

1. Effective metadata management #


Metadata management requires a reliable single source of truth and enterprise-scale visibility into all data assets.

Data governance ensures that data is gathered, organized, and used appropriately, without redundancies. The goal is to know what data you have, where it came from, how it has changed, and how you can use it.

It all starts with metadata. For instance, Dr. Martens streamlined data governance by creating a streamlined modern data stack, with Atlan serving as their single metadata layer and the single source of truth. Before this change, questions about the impact of data changes on downstream data sources would take significant time and human capital to answer. With a strong metadata management system in place, Dr. Martens could answer those questions in seconds.

Robust metadata management is the key to better data governance

Robust metadata management is the key to better data governance. Source: Atlan.

2. Better productivity and faster data discovery #


According to Mike Loukides from O’Reilly Media, Inc:

Data governance and data discovery go together. You can’t use your data if you can’t find it. You can’t use your data if you don’t even know what data you have.”

Without good governance, your teams end up spending 30% of their time on non-value-added tasks such as data sourcing, processing, cleanup, and manual reporting. This affects their productivity substantially.

On the other hand, easy access to the right data helps data teams discover opportunities and insights faster, without wasting any resources.

Take, Sands Capital, which talked to its users to discover what they needed for effective data discovery. Ultimately, the company realized it needed a single metadata management system to enable discovery and build trust in its data.

Using Atlan, users could discover the answers they needed to data questions without data on the data engineering team. This enhanced visibility cut down on fire drills and frantic Slack threads, saving the company both time and money.

Faster data discovery: Key building block of modern data governance

Faster data discovery: Key building block of modern data governance. Source: Atlan.

3. Lower risks and costs due to poor data management and manual processes #


According to McKinsey’s 2019 Global Data Transformation Survey, data governance was one of the three reasons for leading firms to have “eliminated millions in cost from their data ecosystems and enabled digital and analytics use cases.”

Moreover, the companies underinvesting in governance — think unsustainable, manual processes and temporary controls — also exposed themselves to regulatory risk.

In 2020, a Citibank employee made an error while manually adjusting a Revlon loan. This error forced payment of the $900 million loan in full instead of the usual monthly interest payment.

Citibank had to deal with creditors who refused to return the money. Worse, the Office of the Comptroller of the Currency (OCC) fined the bank $400 million for improper data governance measures. These included persistent issues in risk management and internal controls.

The OCC also demanded a “re-engineering of processes to… maximize straight-through processing and minimize manual inputting and adjustments.” (They mean the Revlon fiasco.)

Incidents like these are why many companies are replacing error-prone manual processes with automated data governance tools. Tide, a UK-based digital bank, sought to improve its GDPR right-to-erasure compliance for the 500,000 small business customers it manages. Using Atlan Playbooks to automate identifying, tagging, and securing data, the company reduced a 50-day manual process into mere hours of work.

The 2024 State of Data & AI Literacy report by DataCamp indicates that over 550 leaders in the US and UK believe that enhancing data governance is crucial for improving their teams’ data and AI skills.

Read more: To further explore the benefits of data governance, check out this article on how good governance can help build great data teams.


Challenges of data governance #

Even when done with the right mindset, implementing data governance effectively carries challenges. Below are some big-ticket issues your organization should address up-front during the planning and design processes of any successful data governance initiative.

  • Organizational buy-in. A comprehensive data governance program is a significant change that impacts all areas of the company. It requires up-front investment in processes, tools, and training. Without senior leadership buy-in and approval of major stakeholders, the initiative will likely be over before it gets started. Make sure to forge a strong case that explicitly details the business value the organization will gain from a comprehensive data governance initiative.
  • Lack of standardization. According to Gartner, poor-quality data costs companies $12.9 million annually. Data governance not only improves security and compliance, but it also improves overall data quality. But that’s impossible if every team reinvents its standards for data formats, data types, and data freshness.Devote time during the data governance definition process to creating standards for data quality and selecting solutions - such as a data catalog - that will enable teams to apply these standards easily to their data.
  • Lack of self-service data management tools. Stakeholders are more apt to view data governance negatively if every step in the process requires manual approvals and three weeks of work by the data engineering team. Ask what tools and processes your company can provide to stakeholders to enable working with data in an automated, on-demand fashion. This may include self-service reporting or even full self-service data domain management via a data mesh architecture.

Statista’s 2024 data on privacy and data governance AI risk relevance indicates that privacy and data governance are significant concerns for organizations globally. In Europe, nearly 60% of organizations identified these as primary risks associated with AI adoption, followed closely by Asia at approximately 55%.

Also, read → Governance and Issue Resolution


Want to make data governance a business priority? We can help you craft a plan that’s too good to ignore! 👉 Talk to us


Principles driving modern data governance #

Four principles are central to ensuring effective data governance for the modern data stack:

  1. Governance is for data and analytics
  2. Governance is personalized
  3. Governance is community-led
  4. Governance is a part of your daily workflow

1. Governance is for data and analytics #


As Gartner highlights in its definition, data governance must extend to data, analytics, and information assets.

It’s important to ensure data sharing is easy and well-managed. But it’s also vital to ensure the same applies to analytics. That’s because data assets go beyond raw data - they extend to dashboards, models, and analytics. Analytics also has real, measurable value, and, as such, must be similarly governed.

In their 2024 report titled “A data leader’s technical guide to scaling gen AI”, McKinsey emphasizes that effective data governance is pivotal for organizations aiming to leverage generative AI (gen AI) technologies. They note that 70% of top-performing companies have faced challenges integrating data into AI models, underscoring the necessity for robust data governance frameworks.

2. Governance is personalized #


As mentioned earlier, your governance policies should be built around your data team’s projects or use cases - not the other way around.

For instance, you may define policies that mirror your finance team’s projects and curate financial data assets under one roof. That will enable your finance team to access all public financial KPIs automatically.

Similarly, you can create persona-based policies by displaying custom metadata relevant to each user. This requires understanding the various data domains, projects, and user roles thoroughly and setting up rules that support such personalized experiences.

Atlan's Personas are a way to control access to users who belong to a group/domain

Atlan's "Personas" are a way to control access to users who belong to a group/domain. Source: Atlan.

3. Governance is community-led #


Across an organization, different teams will have different relationships with data. Even though product and sales teams refer to the same data sets, they use them differently.

The product team might use customer survey results to decide which product features to build. Meanwhile, the marketing team might use that data to decide which keywords to target.

If both teams are to access and use the data effectively, they must be involved in defining the policies around that data. That means how it gets categorized and what rules govern its usage. This requires a collaborative approach to governance where the processes, policies, and roles that comprise data governance are crowd-sourced.

Such an approach also ensures that your people truly adopt the governance structure you build.

Modern data governance tools must foster collaboration

Modern data governance tools must foster collaboration. Source: Atlan.

4. Governance is a part of your daily workflow #


If governance is to be integral to your organization’s daily workflow, it cannot be an extra step in your operations.

For instance, everyone should be able to know what data exists within the organization and look up business definitions, descriptions, classifications, and more. This ensures that while request processes exist for gaining access, nobody’s barred from seeing what data is available.

Let’s revisit the previous example of product and sales teams. Say someone from sales comes across a dataset on product usage that doesn’t have a description or an owner. They can make a request or offer a suggestion to the product team (i.e., the domain owner) via Slack. The product team can review the request and then decide to approve or reject it.

Effective data governance enables users to find and understand data right with in the tools they use everyday

Effective data governance enables users to find and understand data right with in the tools they use everyday . Source: Atlan.

Read more: Delve further into the principles of data governance by exploring this article.

Also, read → Data executives’ 2024 agenda includes governance, generative AI | Data Governance Approach That Enables Business Outcomes


How Atlan handles data governance #

Atlan is an active data governance platform that offers a scalable way to secure your data while ensuring data democratization. Atlan’s design, refined by hundreds of projects, gives data teams the governance tools they need to activate their metadata.

At Atlan, we don’t just provide tech. We help organizations build full data governance plans. This includes restructuring production pipelines, reorganizing teams, and training personnel.

With tech and training together, Atlan enables teams to handle more of their own data products, leaving core data teams more agile and productive. This distributed approach to data governance has helped organizations like Nasdaq, Autodesk, and Fox cut delivery times for data products, improve understanding of data context, and build trust in their data.


What Atlan has to offer #

As a modern data governance platform, Atlan offers:

  • Auto-tagging and classification
  • Role-based and policy-based access controls (i.e., RBAC and ABAC)
  • End-to-end column-level data lineage mapping
  • Auto-policy propagation via lineage hierarchy
  • Simple data sharing and collaboration
  • Rich context via READMEs, business glossary, descriptions, and more

Atlan enables implementing secure data governance at enterprise scale. Using Atlan, you can customize your data governance policies according to data team personas, projects, or domains, i.e., RBAC (role-based access controls) and ABAC (attribute-based access control). You can also build custom bots to auto-classify sensitive data and propagate policies automatically via column-level lineage mapping.

Additionally, Atlan’s open API architecture allows you to integrate it with popular modern data stack tools such as Databricks, Snowflake, and Amazon S3. With Atlan, you can set up an interoperable data ecosystem no matter how complex your data architecture.

Check out all the supported sources on Atlan.

The recently published Forrester Wave report compared all the major enterprise data catalogs and positioned Atlan as the market leader ahead of all others. The comparison was based on 24 different aspects of cataloging, broadly across the following three criteria:

  1. Automatic cataloging of the entire technology, data, and AI ecosystem
  2. Enabling the data ecosystem AI and automation first
  3. Prioritizing data democratization and self-service

These criteria made Atlan the ideal choice for a major audio content platform, where the data ecosystem was centered around Snowflake. The platform sought a “one-stop shop for governance and discovery,” and Atlan played a crucial role in ensuring their data was “understandable, reliable, high-quality, and discoverable.”

For another organization, Aliaxis, which also uses Snowflake as their core data platform, Atlan served as “a bridge” between various tools and technologies across the data ecosystem. With its organization-wide business glossary, Atlan became the go-to platform for finding, accessing, and using data. It also significantly reduced the time spent by data engineers and analysts on pipeline debugging and troubleshooting.

A key goal of Atlan is to help organizations maximize the use of their data for AI use cases. As generative AI capabilities have advanced in recent years, organizations can now do more with both structured and unstructured data—provided it is discoverable and trustworthy, or in other words, AI-ready.

Tide’s Story of GDPR Compliance: Embedding Privacy into Automated Processes #


  • Tide, a UK-based digital bank with nearly 500,000 small business customers, sought to improve their compliance with GDPR’s Right to Erasure, commonly known as the “Right to be forgotten”.
  • After adopting Atlan as their metadata platform, Tide’s data and legal teams collaborated to define personally identifiable information in order to propagate those definitions and tags across their data estate.
  • Tide used Atlan Playbooks (rule-based bulk automations) to automatically identify, tag, and secure personal data, turning a 50-day manual process into mere hours of work.

Book your personalized demo today to find out how Atlan can help your organization in establishing and scaling data governance programs.


Data governance: Bottom line #

As the modern data stack has evolved, the diversity of data, its consumers, and the technologies used have also expanded. We’re in an era where new data and analytics use cases crop up daily.

In this environment, building a rigid set of policies that define data and its use in consultation with a handful of middle managers won’t cut it. The reason most data governance programs fail is because of a one-dimensional approach focusing on control, restrictions, and bureaucratic processes.

That’s why the perception toward data governance must change. Data governance programs must be seen as a way to extract value from data. They should be approached holistically and involve everyone in the process.

Are you evaluating and looking to deploy best-in-class data access governance for your data ecosystem without compromising on data democratization? If so, give Atlan a spin.


FAQs about Data Governance #

1. What is data governance, and why is it important? #


Data governance is a set of processes, roles, and policies that ensure data is managed effectively, securely, and responsibly. It helps organizations maintain data accuracy, improve collaboration, and ensure compliance with regulations like GDPR and CCPA.

2. How do I implement data governance in my organization? #


Implementing data governance involves defining data policies, establishing roles and responsibilities, and using tools to monitor and enforce data practices. It’s crucial to foster collaboration and adopt a flexible, community-driven governance approach.

3. What are the key components of a data governance framework? #


A robust data governance framework includes data stewardship, data quality management, privacy and security policies, compliance monitoring, and data cataloging. These components work together to ensure consistent and secure data use.

4. How does data governance relate to compliance (e.g., GDPR, CCPA)? #


Data governance ensures compliance by enforcing policies that align with legal frameworks like GDPR and CCPA. It helps organizations manage consent, maintain data privacy, and respond effectively to regulatory audits.



Share this article

[Website env: production]