Welcome to JGP.ai
Jean-Georges "jgp" Perrin is a technology consultant focusing on building innovative and modern data platforms, president of AIDAUG, and author of Spark in Action, 2nd edition (Manning). [More]
Strategic & tactical content
News of the world
DDL Is Not a Data Contract
This week, I had a surprising comment on one of my posts. I wanted to address it because your data… Continue Reading
Making Pretty Documentation from Data Contracts
Great documentation shouldn’t be an afterthought — it should be a natural outcome of well-defined data contracts.This tutorial is the last of… Continue Reading
Controlling Schema Drift
In this mission, we’re diving deep into one of the most subtle threats to data reliability: schema drift. As your… Continue Reading
Playing with Data Products
In the previous tutorial, you built and modified a data contract. In this tutorial, you will see that building a… Continue Reading
Experimenting with Data Contracts
Are you among the curious who understand a data contract but fail to apprehend its power? Are you writing data… Continue Reading
So You Want to Work With Data Contracts and Data Products?
Let’s be honest: data contracts and data products sound like something between legal paperwork and something you’d find in a… Continue Reading
The I in IBM stands for Inspiration, and the M for Massachusetts
I am back from Think 2025, the annual strategy-oriented IBM conference. The Hynes Conference Center, located in the heart of… Continue Reading
Data Product vs. Data Contract: What’s the Difference?
Data contracts and data products are like inseparable cousins — always working together, always aligned, and always making sure things run smoothly.… Continue Reading
Is DeepSeek an Enormous Geopolitical Teasing?
If you missed the news, DeepSeek is a Chinese version of OpenAI. They made the headlines a couple of weeks… Continue Reading
Defining Data Products: A Community Effort
Redefining data products through community insights, bridging product thinking and engineering principles for the future of data.
Continue ReadingODCS Roadmap
I am privileged to announce the availability of Open Data Contract Standard (ODCS) v2.2.2 from Bitol, a common project of… Continue Reading
SLA in Data
When I talk to people about service levels, and specifically service level agreements (SLA), they focus on a system's availability,… Continue Reading
Seven Dimensions of Data Quality
In this reference article, I wanted to go back on data quality and list its seven dimensions, giving examples to… Continue Reading