Data Mesh raises more questions, here are the answers

Second day of Q&A around Data Mesh with IBM’s Technical Group about “ten lessons learned from building a Data Mesh.”

Bringing vision to Apache Spark

Drsti (pronounced drishti) is an effortless data visualization that interfaces easily with Apache Spark

Get your own copy of Spark in Action 2e

Spark in Action, second edition is a favorite for the Big Bag Theory gang Spark in Action, second edition, has been out for about a month and was running a…

Awaited Apache Spark v3.0.0 is finally released

Apache Spark v3.0.0 hits the road, let’s celebrate! Apache Spark v3.0.0 has been released on June 18th, 2020, just before Spark + AI Summit 2020, which is being held virtually…

DataFriday: basic ETL ops with Apache Spark

In this episode, you will learn about doing a basic ETL (extract, transform, and load) operation using Apache Spark. You will load a basic CSV file with Apache Spark, make…

DataFriday: load a CSV file with Apache Spark

Starting today, I will host a weekly live show about data. You may join, attend “live,” and ask questions as I go through a data-oriented topic. For now, the topic…

Spark in Action’s Chapter Eleven on Working with SQL is in MEAP

A new chapter of Spark in Action, 2e, (formerly known as Spark with Java) is available. Chapter 11 is titled “Working with SQL”. In chapter 11, you will explore how…

(Almost) All you need to know about file ingestion in Apache Spark

As you may know, I start writing Apache Spark with Java (now renamed Spark in Action, 2nd edition). Usually, as the book develops, authors share a few excerpt of the book…

Eight very hot data trends for 2019

Read about eight very hot predictions for data management in 2019, in usages, shapes, governance, and people.

What is Apache Spark, the podcast

A couple of weeks ago, I chatted about Apache Spark with Tobias Macey on data engineering on more specifically Apache Spark. Tobias Macey runs the data engineering podcast, which you can directly…

Spark

Data Mesh raises more questions, here are the answers

Bringing vision to Apache Spark

Get your own copy of Spark in Action 2e

Awaited Apache Spark v3.0.0 is finally released

Like this:

DataFriday: basic ETL ops with Apache Spark

Like this:

DataFriday: load a CSV file with Apache Spark

Like this:

Spark in Action’s Chapter Eleven on Working with SQL is in MEAP

(Almost) All you need to know about file ingestion in Apache Spark

Eight very hot data trends for 2019

What is Apache Spark, the podcast

Let's be social

jgperrin.substack

/in/jgperrin

/jgperrin

Help share:

Help share:

Help share:

Help share:

Like this:

Help share:

Like this:

Help share:

Like this:

Help share:

Help share:

Help share:

Help share:

Let's be social

jgperrin.substack

/in/jgperrin

/jgperrin