How I built the perfect data science team

When I assembled my first data science team, the term was barely getting printed in the Harvard Business Review. I had no clue that I was building a team pioneering…

(Almost) All you need to know about file ingestion in Apache Spark

As you may know, I start writing Apache Spark with Java (now renamed Spark in Action, 2nd edition). Usually, as the book develops, authors share a few excerpt of the book…

Eight very hot data trends for 2019

Read about eight very hot predictions for data management in 2019, in usages, shapes, governance, and people.

What is Apache Spark, the podcast

A couple of weeks ago, I chatted about Apache Spark with Tobias Macey on data engineering on more specifically Apache Spark. Tobias Macey runs the data engineering podcast, which you can directly…

Microsoft SQL Server 2019 gets a Spark

Yesterday, during Ignite 2018, Microsoft announced that they will integrate Apache Spark more tightly with SQL Server 2019. If you missed previous announcements around SQL Server, it now runs on…

Ingestion of data from databases into Apache Spark

Chapter 8 of Spark with Java is out and it covers ingestion, as did chapter 7. However, as chapter 7 was focusing on ingestion from files, chapter 8 focus on…

Apache Spark with Java

Apache Spark has been a game changer for distributed data processing, thanks to an easy to understand API, a focus on simplicity, and an adoption of modern infrastructure. However, rumors…

Apache Spark Maturity on the Rise

Spark Summit Europe 2017 just concluded, here, in Dublin. More than 102 speakers, 1200 attendees, and an impressive Databricks team attended the 3-day long celebration. Spark is reaching a new…

Spark is Making Big Data Easy at NCDevCon

NCDevCon is a yearly event in the Triangle, targeted for developers of all breeds, from front-end to back-end. Its origin starts in the ol’ days of Adobe ColdFusion, and thus…

A Deep-Dive Introduction to Spark for RDBMS Users

Earlier in the summer, I start a series of articles for IBM developerWorks. Those articles focus on Apache Spark from a RDBMS user perspective, of course, the database of choice…

IBM

How I built the perfect data science team

(Almost) All you need to know about file ingestion in Apache Spark

Eight very hot data trends for 2019

What is Apache Spark, the podcast

Microsoft SQL Server 2019 gets a Spark

Ingestion of data from databases into Apache Spark

Apache Spark with Java

Apache Spark Maturity on the Rise

Spark is Making Big Data Easy at NCDevCon

A Deep-Dive Introduction to Spark for RDBMS Users

Let's be social

jgperrin.substack

/in/jgperrin

/jgperrin

Help share:

Help share:

Help share:

Help share:

Help share:

Help share:

Help share:

Help share:

Help share:

Help share:

Let's be social

jgperrin.substack

/in/jgperrin

/jgperrin