Spark is Making Big Data Easy at NCDevCon

NCDevCon is a yearly event in the Triangle, targeted for developers of all breeds, from front-end to back-end. Its origin starts in the ol’ days of Adobe ColdFusion, and thus […]

Loading CSV in Spark

Loading CSV in Apache Spark is a standard feature since version 2.0, previously you required a free plugin (provided by Databricks). Although it starts with a basic value proposition: Comma […]

A New Dimension for Apache Spark Clusters

Summer has been busy and it’s now behind us. I won’t annoy you with all the details of what happened but I wanted to come back on a project I […]

A Deep-Dive Introduction to Spark for RDBMS Users

Earlier in the summer, I start a series of articles for IBM developerWorks. Those articles focus on Apache Spark from a RDBMS user perspective, of course, the database of choice […]

Getting Ready for This Pint of Guinness

Next month, I’ll be heading to Dublin, the capital of Ireland. I have been to Ireland quite a few times – I was 3 the first time. However this time, […]

Apache Spark 2.2 is Out

The Best Spark in Town Yesterday, Apache Spark v2.2.0 has been released. Excitement started a few months ago, reaching a “summit” during Spark Summit where a lot of the features […]

The Key to Machine Learning is Prepping the Right Data

Earlier this month, I was in San Francisco, CA, to attend Spark Summit 2017. I gave a talk on the phase before you can apply Machine Learning on data, using […]

A new Informix Book is Out for MacOS and Java

Why a book on Informix? Informix has been a passion for almost 20 years. Very often, a younger version of myself would say: “I don’t like databases, that’s why I […]

What are Spark Checkpoints on Dataframes?

Let’s understand what can checkpoints do for your Spark dataframes and go through a Java example on how we can use them. Checkpoint on Dataframe In v2.1.0, Apache Spark introduced […]

Spark Recipes Updated

This week has seen the release of Apache Spark v2.0.0. As with every major releases, you can expect some changes. My Java recipes for Apache Spark have been affected, but […]

Java

Spark is Making Big Data Easy at NCDevCon

Loading CSV in Spark

A New Dimension for Apache Spark Clusters

A Deep-Dive Introduction to Spark for RDBMS Users

Getting Ready for This Pint of Guinness

Apache Spark 2.2 is Out

The Key to Machine Learning is Prepping the Right Data

A new Informix Book is Out for MacOS and Java

What are Spark Checkpoints on Dataframes?

Spark Recipes Updated

Let's be social

@jgperrin

/jgperrin

/jgperrin

Help share:

Help share:

Help share:

Help share:

Help share:

Help share:

Help share:

Help share:

Help share:

Help share:

Let's be social

@jgperrin

/jgperrin

/jgperrin