Participating in first podcast ever, of course, we talked Spark and data management
Participating in first podcast ever, of course, we talked Spark and data management

A couple of weeks ago, I chatted about Apache Spark with Tobias Macey on data engineering on more specifically Apache Spark. Tobias Macey runs the data engineering podcast, which you can directly access on his website or through iTunes.

Together we went on various topics like what Spark is, or more precisely what Apache Spark is for me. We talked use cases, data scientists, data engineers (spoiler alert: we talked more about engineering than science), setting up a cluster, developing for Spark, how does it compare with some other technologies like Flink, Kafka, or Storm. Naturally, we talked IBM and Informix. Apache Spark v2.4 is out and we briefly went over the history of Spark.

We also talked a bit more about my background and my motivation for writing “Spark in Action, 2nd edition” with Manning. I did not think that could interest anyone, but I gladly share why I am doing it.

Finally, I must admit it, it was my first podcast. I was on the radio, TV, but this is my first podcast… If you are courageous enough to go through all of it, you will find discount codes and even a raffle for a few free books. Enjoy.

Updates:

  • 2018-12-19: direct link to podcast and embed the player in the post.