OvalEdge Blog - our knowledge about data catalog and data governance

What is Data Mesh? Principles & Architecture

Written by OvalEdge Team | Oct 31, 2022 6:19:08 PM

Until recently, data lakes were the hot thing in data architecture, but more and more companies are turning away from this approach in search of an alternative.

This is because organizations using this centralized approach have found that it’s inefficient due to the over reliance on dedicated data teams. Not only did this cause bottlenecks, but it also caused problems, because it’s impossible for a data team to know and understand every team's needs.

In fact, Gartner reported that 80% of organizations seeking to scale digital business between now and 2025 will fail because they don’t take a modern approach to data and analytics governance.

For many in the industry, the alternative of choice is Data Mesh, a new architecture that could well be the solution.

There are already some fantastic resources available on the topic, but we wanted to give you a summary, and explain where OvalEdge fits in.

If you want to know more about implementing Data Mesh, or want to see how OvalEdge can help, schedule a demo here.

What is Data Mesh?

As with any new concept or technology, the first thing we need to know is what Data Mesh actually is? It’s only a couple years old, so resources are scarce. But as it was created by Zhamak Dehghani at ThoughtWorks, it makes sense to kick things off with their definition:

Data Mesh is an analytical data architecture and operating model where data is treated as a product and owned by teams that most intimately know and consume the data.

This is a good high-level definition, and it introduces the core concept that data ownership should be decentralized.

To help achieve this, there are four principles of Data Mesh:

  1. Domain Ownership
  2. Data as a product
  3. Self-serve data platforms
  4. Federated computational governance

These principles form an architecture that can be applied across your business, helping avoid bottlenecks and creating shared ownership.

But before we delve further into these principles, it’s important to first talk about what Data Mesh isn’t.

  • It isn’t something that can be implemented by a single person
  • No single tool can generate or implement a ‘mesh’
  • It’s not a silver bullet that will magically fix all your data issues

Instead, it’s a strategic framework for getting value from data by changing how you share data, structure your teams, and carry out governance. Decentralization lies at the heart of it, and if implemented well, your data can go beyond BI reports to drive innovation and analytics at scale.

It’s called a “mesh” because teams can access data products created and owned by other teams, creating a connected network of insights across the organization.

What are the benefits of implementing Data Mesh?

Organizations are moving away from traditional data lakes because they create bottlenecks. Centralized data control can lead to slow decision-making and disjointed understanding of data needs.

Data Mesh eliminates these challenges by decentralizing architecture and empowering domain-level autonomy.

Key benefits include:

  • Reduced bottlenecks: Teams can operate independently without waiting on a central team.
  • Faster insights: Each team manages and accesses data directly relevant to their function.
  • Improved accountability: Data quality and governance are owned by domain experts.
  • Scalable collaboration: Teams can share, consume, and build on each other’s data products.

It’s worth mentioning that if your current architecture already works effectively, adopting Data Mesh won’t automatically yield better results. These benefits primarily apply to organizations struggling with rigid data lake systems or seeking to modernize their data strategy.

Data Mesh Principles and Architecture

As noted earlier, Data Mesh operates on four foundational principles. These are interdependent — following all four ensures maximum efficiency and scalability.

Domain ownership

This principle often represents the largest cultural shift. Instead of a centralized team managing all enterprise data, each domain team takes responsibility for their data lifecycle.

This decentralization reduces reliance on dedicated data engineers and eliminates friction between teams and the data they use daily.

For example:

  • If George from marketing needs web analytics, they no longer wait on IT.
  • If developer Maggie needs debugging data, she can directly access relevant datasets.

However, with this freedom comes greater responsibility — teams must ensure data quality, compliance, and reliability within their own domains.

Data as a product

In a Data Mesh, data is treated as a product, not just a byproduct of operations.

Each data domain team becomes both a producer and a custodian. Their role extends to:

  • Ensuring data is discoverable, understandable, and trustworthy.
  • Designing access mechanisms that serve internal “data consumers.”
  • Maintaining clear documentation and metadata for transparency.

This mindset shift enforces accountability and creates a consumer-friendly data ecosystem across the enterprise.

Self-serve data platforms

You might ask, “How do we enable all teams to build and manage their data products effectively?”

That’s where self-service data platforms come in.

A domain-agnostic platform team manages infrastructure and provides the tools, templates, and environments domain teams use to deploy their products.

This principle empowers teams to:

  • Build, test, and publish their own data products.
  • Access analytics capabilities without deep technical expertise.
  • Reduce dependency on IT or centralized data engineers.

It simplifies innovation while maintaining control, making domain autonomy sustainable and scalable.

Federated computational governance

Even in a decentralized setup, governance must be consistent. Federated governance ensures that standards and policies remain uniform across all domains.

This principle involves:

  • Forming a governance council of domain and platform product owners.
  • Defining global policies for interoperability, security, and compliance.
  • Enforcing data quality, access control, and metadata consistency.

It strikes the right balance between local autonomy and global consistency, ensuring seamless cross-domain collaboration.

Data Mesh Logical Architecture

A Data Mesh architecture enables domain teams to perform cross-domain data analysis using self-service tools. Each domain maintains operational and analytical data, builds models, and creates products consumable by others.

Teams collaborate to set global interoperability, documentation, and security standards through a federated governance framework.

An enabling team supports all domains by guiding them on modeling, platform usage, and product interoperability, ensuring smooth adoption of Data Mesh principles.

How do you create a Data Mesh using OvalEdge? 

As much as we’d love to say you can simply install the Data Mesh plugin, and OvalEdge will do the rest for you, it’s not quite that simple. As I mentioned previously, this is an architecture, and can’t be solved by one tool. But OvalEdge can play a crucial role in solving the puzzle. 

Here’s an example of how OvalEdge fits into your physical architecture:


Establish Federated Governance and Control it

Governance is at the heart of everything OvalEdge does, so establishing federated governance for your Data Mesh is as simple as possible. OvalEdge is a unified platform with Data Catalog, Access Management and various kinds of policy enforcement. 

Using OvalEdge, you can easily manage and control the following policies. 

  • Privacy policies, which are dictated by various regulations like GDPR and CCPA
  • Documentation policies, compliant with regulations like BCBS in the banking sector, and SOX for all public companies. 
  • Establish policies for confidential data.
  • Establish locationality policy, which is dictated by locationality laws. 
  • Establish policies for secret data
  • Standardization & documentation policies

Then once they’re established, you can monitor and control these policies within OvalEdge.

Build a self-service platform

Data Mesh doesn’t work unless everyone can find the data they need. OvalEdge makes it easy to build your own self-service platform. You can configure many data sources, and use it in a number of ways:

  • Data Catalog: which allows everyone to easily find and understand the data at scale. 
  • Data Access Management: this allows you to maintain security, while also allowing end-users to access data from other domains using an approval process. 
  • Business Glossary: this allows you to standardize your documentation and apply it across the data. 
  • Collaborate on data with the right context. 
  • Analyze data using various tools like Querysheet and other third-party tools.

Establish Domain-based Teams

As we’ve discussed, splitting people into domain teams is a key part of Data Mesh. Once you’ve decided what these teams are, you can simply organize them into teams within OvalEdge. We also provide various ways to organize teams, by roles, by teams, etc. 

This helps you assign people the right privileges, and make changes to the configuration in bulk. 

Divide Data into Domain

Once you have your data sources and governance set up, you can divide your data into domains that match your team structure.

Following traditional data governance, you can ensure everyone has access to the data they need, while remaining compliant in every market you’re in.

Build new Data Products

Now your teams can work to create their data products, and start creating value for each other.

Working together in OvalEdge, it’s easy to organize the data into meaningful products, and share them.

Share & Collaborate

Colleagues can then share and collaborate on their data products within OvalEdge. Whether that’s within their domain team, or across teams.

At this point, you have the makings of an effective Data Mesh, where teams own and maintain their own data products. And teams can easily use each other's products to work towards a shared goal.

Integrate

When used alongside a data storage provider like Snowflake, OvalEdge is the perfect platform for managing your decentralized data across your domain teams.

We manage your orchestration layer, including data governance and data analytics. Another key part of OvalEdge is the Data Catalog. This brings all your datasets together, allowing you to mesh them, create relationships, etc.

Now, I’m not going to sit here and insist that you need OvalEdge if you want to implement an effective Data Mesh architecture.

But what I can say is that we make it a whole lot easier!

Why Choose OvalEdge for Data Mesh Implementation?

While no single tool can “do” Data Mesh, OvalEdge streamlines every aspect of it from governance to access control to collaboration. The platform provides federated visibility, compliance automation, and data discovery, all critical enablers of a scalable Data Mesh.

To see how easily you can modernize your architecture:
👉 Request a Data Mesh Demo and experience how OvalEdge simplifies governance and enables autonomy.

Frequently Asked Questions

  1. What is Data Mesh in simple terms?
    Data Mesh is a decentralized approach to managing and analyzing data where each business domain owns and governs its own data, treating it as a product that other teams can use.
  2. What are the four key principles of Data Mesh?
    They are: Domain Ownership, Data as a Product, Self-Serve Data Platform, and Federated Computational Governance.

3. How does OvalEdge support Data Mesh implementation?
OvalEdge helps by providing data cataloging, access management, policy enforcement, and governance tools to build and manage federated data systems efficiently.

 

What you should do now

  1. Schedule a Demo to learn more about OvalEdge
  2. Increase your knowledge on everything related to Data Governance with our free WhitepapersWebinars and Academy
  3. If you know anyone who'd enjoy this content, share it with them via email, LinkedIn, Twitter or Facebook.