What we do

Resources

Cases

About us

Select Language

We help you climb the data maturity ladder

Scaling for success with data and AI

Trusted by:

Give your data team magical powers

Granting your data team supernatural capabilities for unrivaled insights and innovation

Data Strategy

A solid data strategy, architecture and infrastructure allow organizations to focus efforts on where it matters most i.e. value creation.

Gen AI

GenAI offers a transformative pathway for businesses, empowering them with unparalleled efficiency, insights, and competitive advantage.

Data Platforms

A data strategy translates the corporate and digital strategy for your data organization, while a data architecture study clarifies how to implement it.

Data Platforms

A data strategy translates the corporate and digital strategy for your data organization, while a data architecture study clarifies how to implement it.

Data Platforms

A data strategy translates the corporate and digital strategy for your data organization, while a data architecture study clarifies how to implement it.

Clients

138

data initiatives delivered

400+

people trained

365+

Conveyor projects

Learning/Knowledge base

Take a deeper dive

How we reduced our docker build time by 40%

How we reduced our docker build times by 40%

This post describes two ways to speed up building your Docker images: caching build info remotely, using the link option when copying files

The building blocks of successful Data Teams

During my 7 years of experience in the field of data, I have worked in multiple data teams of which some were successful but others…

How to organize a data team to get the most balue out of data

How to organize a data team to get the most value out of data

To state the obvious: a data team is there to bring value to the company. But is it this obvious? Haven’t companies too often created a…

Learning/Knowledge base

Take a deeper dive

Rethinking the data product workbench in the age of AI

Rethinking how AI empowers data teams to build and maintain better data products without replacing them.

Episode 12. RADAR by publiq: How AI is Reshaping Cultural Discovery, Without Compromising Privacy

Episode 12. RADAR by publiq: How AI is Reshaping Cultural Discovery, Without Compromising Privacy

How Publiq uses AI and LLMs in RADAR to enrich cultural data and deliver smarter, privacy-first event recommendations.

When writing SQL isn't enough: debugging PostgreSQL in production

When writing SQL isn't enough: debugging PostgreSQL in production

SQL alone won’t fix broken data. Debugging pipelines requires context, lineage, and collaborationnot just queries.

Episode 11. The Art of the Data Platform: Reducing Cognitive Load and Driving Adoption with Jelle De Vleminck

Episode 11. The Art of the Data Platform: Reducing Cognitive Load and Driving Adoption with Jelle De Vleminck

Lessons, mistakes & trends in building data platforms devs love, insights from Jelle De Vleminck for data leaders.

AI Innovation That Delivers: How to Align Strategy, Adoption, and Business Value with Joris Renkens

Episode 10. AI Innovation That Delivers: How to Align Strategy, Adoption, and Business Value with Joris Renkens

How can companies truly innovate with AI? Joris Renkens shares practical lessons on strategy, adoption, and building the right teams.

You Don’t Need the Latest Stack. You Need Better Questions. Episode with Rushil Daya & Kris Peeters

Episode 9. You Don’t Need the Latest Stack. You Need Better Questions. Episode with Rushil Daya & Kris Peeters

Learn to build high-performing data teams with agile practices, strong leadership, CI/CD, testing, mentoring, and collaboration.

Portable by design: Rethinking data platforms in the age of digital sovereignty

Portable by design: Rethinking data platforms in the age of digital sovereignty

Build a portable, EU-compliant data platform and avoid vendor lock-in—discover our cloud-neutral stack in this deep-dive blog.

Episode 8. What It Really Takes to Build a Data-Centric Organization, with Jonny Daenen & Kris Peeters

Episode 8. What It Really Takes to Build a Data-Centric Organization, with Jonny Daenen & Kris Peeters

What does it take to build truly data-centric organizations?This episode goes beyond tools and dashboards to uncover lasting value.

European Clouds vs. Hyperscalers: Can You Really Build a Sovereign Data Platform?

European Clouds vs. Hyperscalers: Can You Really Build a Sovereign Data Platform?

Episode 7: A Deep Dive into SQLMesh: Structured Query Validation and Safe Pipeline Testing

Episode 7. A Deep Dive into SQLMesh: Structured Query Validation and Safe Pipeline Testing with Michiel De Muynck

Michiel De Muynck explores SQLMesh: semantic layer, virtual data environments, and unit testing for SQL-based analytics.

Cloud Independence: Testing a European Cloud Provider Against the Giants

Cloud Independence: Testing a European Cloud Provider Against the Giants

Can a European cloud provider like Ionos replace AWS or Azure? We test it—and find surprising advantages in cost, control, and independence.

Data Mesh Live: How to make it successful in organisations - The Data Playbook podcast by Dataminded

Episode 6. Data Mesh Live: How to make it successful in organisations, with Jacek Majchrzak & Andrew Jones

Can Data Mesh solve your data bottlenecks? Join us to learn how Data Mesh tackles decentralization and scalability challenges.

Data Modeling: Why should I care? Unveiling the Sense and Nonsense with Jonas De Keuster

Episode 5. Data Modeling: Why should I care? Unveiling the Sense and Nonsense with Jonas De Keuster

Explore the sense & nonsense of data modeling with Jonas De Keuster (VaultSpeed) on automation, integration & best practices.

Episode 4. From On-Prem to Cloud (Again): How a Government Agency Made the Big Bang Work

Episode 4. From On-Prem to Cloud (Again): How a Government Agency Made the Big Bang Work

A government client moved back to the cloud. Standardization made a 14h Big Bang migration secure, seamless and practically invisible.

Stop loading bad quality data

Ingesting all data without quality checks leads to recurring issues. Prioritize data quality upfront to prevent downstream problems.

A 5-step approach to improve data platform experience

A 5-step approach to improve data platform experience

Boost data platform UX with a 5-step process:gather feedback, map user journeys, reduce friction, and continuously improve through iteration

Episode 3. Building Sustainable Data Products that actually get used

Episode 3. Building Sustainable Data Products that actually get used

Discover how sustainable data products drive real impact, lasting value, and better business decisions—without unnecessary complexity.

Episode 2. What Not to Build with AI: Avoiding the New Technical Debt in Data-Driven Organizations

Episode 2. What Not to Build with AI: Avoiding the New Technical Debt in Data-Driven Organizations

Why AI acceleration can backfire: lessons on digital sprawl, governance trade-offs, and building what truly matters in data-driven teams.

April 29th - Belgium dbt Meetup #10 (in-person)

What Is Data Product Thinking?

Data Product Thinking treats data as a product, empowering domain teams to own, improve, and scale trusted, user-focused data assets.

Why the ‘Private’ API Gateway of AWS Might Not Be as Secure as You Think

Why the ‘Private’ API Gateway of AWS Might Not Be as Secure as You Think

AWS Private API Gateways aren’t always private, misconfigs can expose access. Use resource policies to secure them properly.

The Data Engineer’s guide to optimizing Kubernetes

The Data Engineer’s guide to optimizing Kubernetes

Boost Kubernetes batch workload efficiency with smarter scheduling, autoscaling tweaks & spot instance handling.

Episode 1. Building Data Architecture from Scratch: Cloud, Data Mesh, and the Real-World Tradeoffs

Episode 1. Building Data Architecture from Scratch: Cloud, Data Mesh, and the Real-World Tradeoffs

Cloud vs on-prem, US tech risks, data mesh, team dynamics & the hidden costs behind “modern” data platforms—hard truths for data leaders.

Integrating MegaLinter to Automate Linting Across Multiple Codebases. A Technical Description.

Integrating MegaLinter to Automate Linting Across Multiple Codebases. A Technical Description.

Automate code quality with MegaLinter, SQLFluff, and custom checks in Azure DevOps CI. Supports multi-language linting and dbt integration.

Previous Edition: State of Data 2025

State of Data – An event for data leaders & engineers on AI, data platforms, MLOps & analytics. Missed event? Watch the recordings!

Are your AKS logging costs too high? Here’s how to reduce them

Are your AKS logging costs too high? Here’s how to reduce them

Cut Azure logging costs: reduce log volume, use Basic tables via the new ingestion API, and try a custom Fluentbit plugin with Go.

Source-Aligned Data Products: The Foundation of a Scalable Data Mesh

Source-Aligned Data Products: The Foundation of a Scalable Data Mesh

Source-Aligned Data Products ensure trusted, domain-owned data at the source—vital for scalable, governed Data Mesh success.

World Class Business Leaders Event in Frankfurt

World Class Business Leaders Event in Frankfurt

We share practical insights on how organisations can establish scalable, self-service data platforms to drive efficiency and ownership.

The State of Data Work in 2025: Insights From 32 In-Depth Conversations

The State of Data Work in 2025: Insights From 32 In-Depth Conversations

Insights from 32 data professionals reveal 2025 challenges: balancing AI innovation, governance, quality, cost, collaboration, and literacy.

Data Engineering Winter School

Monitoring thousands of Spark applications without losing your cool

Monitoring thousands of Spark applications without losing your cool

Monitor Spark apps at scale with CPU efficiency to cut costs. Use Dataflint for insights and track potential monthly savings.

Data Modelling In A Data Product World

Central DWHs hit scaling limits. Data products offer a modular, federated solution—flexible, reusable, and closer to business reality.

SAP CDC with Azure Data Factory

Build SAP CDC in Azure Data Factory with SAP views, but high IR costs. Kafka + Confluent offers a cheaper, scalable alternative.

THE DATA PRODUCT THINKING MEETUP - Dataminded Germany

THE DATA PRODUCT THINKING MEETUP - Dataminded Germany

Beyond the Buzzwords: Let’s Talk About the Real Challenges in Data

Beyond the Buzzwords: Let’s Talk About the Real Challenges in Data

Cut through data buzzwords join honest chats with data pros to uncover real challenges, knowledge gaps & clever wins.

Getting Data Done in Healthcare - Lessons from the Frontlines

From Good AI to Good Data Engineering.AI to

From Good AI to Good Data Engineering. Or how Responsible AI interplays with High Data Quality

Responsible AI depends on high-quality data engineering to ensure ethical, fair, and transparent AI systems.

A glimpse into the life of a data leader

A glimpse into the life of a data leader

Data leaders face pressure to balance AI hype with data landscape organization. Here’s how they stay focused, pragmatic, and strategic.

Dataminded Germany Meetup: THE DOCTOR'S DATA KNIGHT ⚔️🛡️

Beyond Medallion: How to Structure Data for Self-Service Data Teams

Beyond Medallion: How to Structure Data for Self-Service Data Teams

Medallion architecture limits self-service. Shift to data product thinking with input, private, and output data for agile, governed scaling.

Empowering Every User: How Data Product Portal Accelerates Collaboration

Empowering Every User: How Data Product Portal Accelerates Collaboration

How we democratized data access with Streamlit and Microsoft-powered automation

How we democratized data access with Streamlit and Microsoft-powered automation

Streamlit app + Power Automate = easy, self-serviced data access at scale, no YAML editing needed, just governance that actually works.

Unlocking the new Power of Advanced Analytics

Unlocking the new Power of Advanced Analytics

Advanced analytics powered by LLMs and strong data engineering enables smarter predictions, deeper insights, and AI you can trust.

Simplifying Data Management: Introducing the Data Product Portal

Join us for an exclusive webinar hosted by Kris Peeters and Wannes Rosiers as they unveil the Data Product Portal.

How To Conquer The Complexity Of The Modern Data Stack

How To Conquer The Complexity Of The Modern Data Stack

The more people on a team, the more communication lines. Same goes for tools in your data stack, complexity scales fast

Webinar: "Initial Reactions: Market Insights on the Data Product Portal"

Webinar: "Initial Reactions: Market Insights on the Data Product Portal"

The Data Product Portal Integrates With Your Preferred Data Platform

The Data Product Portal Integrates With Your Preferred Data Platform

Data Product Portal integrates with AWS to manage data products, access, and tooling—enabling scalable, self-service data platforms.

How To Reduce Pressure On Your Data Teams

How To Reduce Pressure On Your Data Teams

Data demand grows, pressuring small teams. Shift to focused data product teams and use portals to stay efficient and avoid data siloes.

Microsoft Fabric’s Migration Hurdles: My Experience

Microsoft Fabric’s Migration Hurdles: My Experience

Migrating to Microsoft Fabric?My experience shows it’s not ideal for modular platforms yet limited flexibility,IaC gaps & performance issues

Data Product Portal Integrations 2: Helm

Data Product Portal Integrations 2: Helm

Data Product Portal links governance, access & tools for self-service data on AWS. Supports Terraform & API integration for automation.

Data Stability with Python: How to Catch Even the Smallest Changes

Detect data changes efficiently by sorting and hashing DataFrames with Python—avoid re-running pipelines and reduce infrastructure costs.

Why You Should Build A User Interface To Your Data Platform

Why You Should Build A User Interface To Your Data Platform

Don’t give users a bag of tools—build a UI for your data platform to reduce complexity, boost adoption, and enable true self-service.

Data Product Portal Integrations 1: OIDC

Data Product Portal Integrations 1: OIDC

Integrate OIDC with the Data Product Portal for secure, user-specific access via SSO. Easy setup with AWS Cognito, Docker, or Helm.

The State of Data Products in 2024

Data Products are rising fast in 2024, focusing on user experience, collaboration, and governance—set to reach maturity within 2–3 years.

Clear signals: Enhancing communication within a data team

Clear signals: Enhancing communication within a data team

Clear team communication boosts data project success. Focus on root problems, structured discussions, and effective feedback to align better

Demystifying Device Flow

Implement OAuth 2.0 Device Flow with AWS Cognito & FastAPI to enable secure logins for headless devices like CLIs and smart TVs.

Introducing Data Product Portal: An open source tool for scaling your data products

The Data Product Portal is an open-source tool to build, manage & govern data products at scaleenabling clear access, lineage & self-service

The impact of product thinking for data

Data governance has been changing throughout the years, and the adoption of product thinking to data once again introduces change to data...

Short feedback cycles on AWS Lambda

Speed up AWS Lambda dev with a Makefile: build, deploy, test, and stream logs in one loop boost feedback cycles to just ~15 seconds.

Sustainable GenAI: Technology, Data and Gove

Sustainable GenAI: Technology, Data and Governance

Join us for an event focused on the crucial aspects of GenAI development and its implementation.

The Missing Piece to Data Democratization is More Actionable Than a Catalog

The Missing Piece to Data Democratization is More Actionable Than a Catalog

The Data Product Portal is the missing link for scaling data democratization, beyond catalogs, it unifies access, governance & tooling.

Prompt Engineering for a Better SQL Code Generation With LLMs Copy

Prompt Engineering for a Better SQL Code Generation With LLMs Copy

Boost SQL generation with LLMs using prompt engineering, schema context, user feedback & RAG for accurate, business-aware queries.

Data Engineering Summer School 2024

Learn data engineering from the pros!

Age of DataFrames 2: Polars Edition

In this publication, I showcase some Polars tricks and features.

Quack, Quack, Ka-Ching: Cut Costs by Querying Snowflake from Duck

Quack, Quack, Ka-Ching: Cut Costs by Querying Snowflake from DuckDB

How to leverage Snowflake’s support for interoperable open lakehouse technology — Iceberg — to save money.

The building blocks of successful Data Teams

The building blocks of successful Data Teams

5 key traits of successful data teams: ownership, business focus, software best practices, self-service, and company-wide strategy.

GenAI: Hands-On LLM Hackathon

An interactive, hands-on hackathon on building and deploying an entire LLM-based application from start to finish.

Conveyor x Luminus Event: "Fast lane to data value. Embracing platform and product thinking"

Conveyor x Luminus Event: "Fast lane to data value. Embracing platform and product thinking"

Querying Hierarchical Data with Postgres

Querying Hierarchical Data with Postgres

Query hierarchical data in Postgres using recursive CTEs. Navigate up/down trees, track depth, and aggregate—great for parent-child data.

Securely use Snowflake from VS Code in the browser

Securely use Snowflake from VS Code in the browser

Secure Snowflake SSO in browser-based VS Code using custom OAuth, CLI/API auth flow, and a dbt adapter for seamless cloud IDE integration.

The benefits of a data platform team

Build a dedicated data-platform team to manage ingest,storage & tools, freeing business data teams to focus on creating value from insights.

How to organize a data team to get the most value out of data

How to organize a data team to get the most value out of data

Data teams succeed by shifting from tech-only focus to value delivery—combine product thinking, business goals & cross-functional roles.

Why not to build your own data platform

A round-table discussion summary on imec’s approach to their data platform

Becoming Clout* certified

Hot takes about my experience with cloud certifications

You can use a supercomputer to send an email but should you?

You can use a supercomputer to send an email but should you?

Discover the next evolution in data processing with DuckDB and Polars

Two Lifecycle Policies Every S3 Bucket Should Have

Two Lifecycle Policies Every S3 Bucket Should Have

Abandoned multipart uploads and expired delete markers: what are they, and why you must care about them thanks to bad AWS defaults.

Decentralized vs Centralized: How to organize your data teams?

Decentralized vs Centralized: How to organize your data teams?

How we used GenAI to make sense of the government

How we used GenAI to make sense of the government

We built a RAG chatbot with AWS Bedrock and GPT4 to answer questions about the Flemish government.

My key takeaways after building a data engineering platform

My key takeaways after building a data engineering platform

Building a data platform taught me: deleting code is vital, poor design has long-term costs, and dependency updates are never-ending.

Leveraging Pydantic as a validation layer.

Leveraging Pydantic as a validation layer.

Ensuring clean and reliable input is crucial for building robust services.

State of Data 2024

7 Lessons Learned migrating dbt code from Snowflake to Trino

7 Lessons Learned migrating dbt code from Snowflake to Trino

Snowflake to Trino dbt migration: watch out for type casting, SQL functions, NULL order, and window function quirks.

Growing your data program with a use-case-driven approach

Growing your data program with a use-case-driven approach

Use-case-driven data programs balance planning & building, enabling fast value, reduced risk, and scalable transformation.

Everyone to the data dance floor: a story of trust

Everyone to the data dance floor: a story of trust

Data democratization is coming, but trust and governance are key. Start with pipeline observability: track runs, versions, and authors.

Quacking Queries in the Azure Cloud with DuckDB

Quacking Queries in the Azure Cloud with DuckDB

DuckDB on Azure: fsspec works for now, but native Azure extension is faster—especially with many small files. Full support is on the way.

Future-Proofing Your Data Engineering Career. Essential Skills for 2024 and Beyond

Future-Proofing Your Data Engineering Career. Essential Skills for 2024 and Beyond

Embark on a journey with our top-tier Data Engineers as they unveil the key skills that will shape the future of data engineering.

How we reduced our docker build times by 40%

How we reduced our docker build times by 40%

This post describes two ways to speed up building your Docker images: caching build info remotely, using the link option when copying files

Cross-DAG Dependencies in Apache Airflow: A Comprehensive Guide

Cross-DAG Dependencies in Apache Airflow: A Comprehensive Guide

Exploring four methods to effectively manage and scale your data workflow dependencies with Apache Airflow.

Upserting Data using Spark and Iceberg

Use Spark and Iceberg’s MERGE INTO syntax to efficiently store daily, incremental snapshots of a mutable source table.

How we reduced our docker build times by 40%

This post describes two ways to speed up building your Docker images: caching build info remotely, using the link option when copying files

The building blocks of successful Data Teams

During my 7 years of experience in the field of data, I have worked in multiple data teams of which some were successful but others…

How to organize a data team to get the most value out of data

To state the obvious: a data team is there to bring value to the company. But is it this obvious? Haven’t companies too often created a…

We empower businesses like yours

The best companies drive digital process adoption with Dataminded

ML scalability

Introducing cloud for doing ML training at scale

Cloud data platform

Scaling advanced analytics to continuously improve customer experience of media products

SAAS product

Setting up a SaaS product on AWS. Leveraging cloud scalability to meet client demand.

What our customers say

Join hundreds of all sizes and across all industries have made a big improvements with us.

“Success to resolve my issue quickly and efficiently, and their friendly and helpful made the experience even better.”
Ryan Call
Creative Director @Placeholder
“Their team was able to identify the most effective keywords and create ads that spoke directly to our target audience.”
Jenny Wilson
CEO Founder
“Success to resolve my issue quickly and efficiently, and their friendly and helpful made the experience even better.”
Dave Debryne
Creative Director @Placeholder
“Success to resolve my issue quickly and efficiently, and their friendly and helpful made the experience even better.”
Ryan Call
Creative Director @Placeholder
“Their team was able to identify the most effective keywords and create ads that spoke directly to our target audience.”
Jenny Wilson
CEO Founder
“Success to resolve my issue quickly and efficiently, and their friendly and helpful made the experience even better.”
Dave Debryne
Creative Director @Placeholder
“Success to resolve my issue quickly and efficiently, and their friendly and helpful made the experience even better.”
Ryan Call
Creative Director @Placeholder
“Their team was able to identify the most effective keywords and create ads that spoke directly to our target audience.”
Jenny Wilson
CEO Founder
“Success to resolve my issue quickly and efficiently, and their friendly and helpful made the experience even better.”
Dave Debryne
Creative Director @Placeholder
“Success to resolve my issue quickly and efficiently, and their friendly and helpful made the experience even better.”
Ryan Call
Creative Director @Placeholder
“Their team was able to identify the most effective keywords and create ads that spoke directly to our target audience.”
Jenny Wilson
CEO Founder
“Success to resolve my issue quickly and efficiently, and their friendly and helpful made the experience even better.”
Dave Debryne
Creative Director @Placeholder
“Success to resolve my issue quickly and efficiently, and their friendly and helpful made the experience even better.”
Ryan Call
Creative Director @Placeholder
“Their team was able to identify the most effective keywords and create ads that spoke directly to our target audience.”
Jenny Wilson
CEO Founder
“Success to resolve my issue quickly and efficiently, and their friendly and helpful made the experience even better.”
Dave Debryne
Creative Director @Placeholder
“Success to resolve my issue quickly and efficiently, and their friendly and helpful made the experience even better.”
Ryan Call
Creative Director @Placeholder
“Their team was able to identify the most effective keywords and create ads that spoke directly to our target audience.”
Jenny Wilson
CEO Founder
“Success to resolve my issue quickly and efficiently, and their friendly and helpful made the experience even better.”
Dave Debryne
Creative Director @Placeholder
“Success to resolve my issue quickly and efficiently, and their friendly and helpful made the experience even better.”
Ryan Call
Creative Director @Placeholder
“Their team was able to identify the most effective keywords and create ads that spoke directly to our target audience.”
Jenny Wilson
CEO Founder
“Success to resolve my issue quickly and efficiently, and their friendly and helpful made the experience even better.”
Dave Debryne
Creative Director @Placeholder
“Success to resolve my issue quickly and efficiently, and their friendly and helpful made the experience even better.”
Ryan Call
Creative Director @Placeholder
“Their team was able to identify the most effective keywords and create ads that spoke directly to our target audience.”
Jenny Wilson
CEO Founder
“Success to resolve my issue quickly and efficiently, and their friendly and helpful made the experience even better.”
Dave Debryne
Creative Director @Placeholder