Welcome to JGP.ai
Jean-Georges "jgp" Perrin is a technology consultant focusing on building innovative and modern data platforms, president of AIDAUG, and author of Spark in Action, 2nd edition (Manning). [More]
Strategic & tactical content
News of the world
DDL Is Not a Data Contract
This week, I had a surprising comment on one of my posts. I wanted to address it because your data… Continue Reading
Making Pretty Documentation from Data Contracts
Great documentation shouldn’t be an afterthought — it should be a natural outcome of well-defined data contracts.This tutorial is the last of… Continue Reading
Controlling Schema Drift
In this mission, we’re diving deep into one of the most subtle threats to data reliability: schema drift. As your… Continue Reading
Playing with Data Products
In the previous tutorial, you built and modified a data contract. In this tutorial, you will see that building a… Continue Reading
Experimenting with Data Contracts
Are you among the curious who understand a data contract but fail to apprehend its power? Are you writing data… Continue Reading
So You Want to Work With Data Contracts and Data Products?
Let’s be honest: data contracts and data products sound like something between legal paperwork and something you’d find in a… Continue Reading
The I in IBM stands for Inspiration, and the M for Massachusetts
I am back from Think 2025, the annual strategy-oriented IBM conference. The Hynes Conference Center, located in the heart of… Continue Reading
Data Product vs. Data Contract: What’s the Difference?
Data contracts and data products are like inseparable cousins — always working together, always aligned, and always making sure things run smoothly.… Continue Reading
Is DeepSeek an Enormous Geopolitical Teasing?
If you missed the news, DeepSeek is a Chinese version of OpenAI. They made the headlines a couple of weeks… Continue Reading
Defining Data Products: A Community Effort
Redefining data products through community insights, bridging product thinking and engineering principles for the future of data.
Continue ReadingODCS Roadmap
I am privileged to announce the availability of Open Data Contract Standard (ODCS) v2.2.2 from Bitol, a common project of… Continue Reading
SLA in Data
When I talk to people about service levels, and specifically service level agreements (SLA), they focus on a system's availability,… Continue Reading
Seven Dimensions of Data Quality
In this reference article, I wanted to go back on data quality and list its seven dimensions, giving examples to… Continue Reading
What is going on in the Modern Data Engineering world?
How is 2024 going for you? Some people announced recently that the Modern Data Stack is dead, but is it… Continue Reading
Getting started with ODCS
Like every new technology or concept, Open Data Contract Standard (ODCS) may require a quick getting started page. In this… Continue Reading
Why the need for standardizing Data Contracts?
Why the Need for Standardizing Data Contracts?This article describes the genesis of Bitol, the open-source data contract standard and solutions… Continue Reading