Episode 102 – Roaring News

Big Data News at the end of the summer is not easy to find, but we did end up with three topics to discuss: from isolating GPUs in Hadoop 3.x to replicating big data (to the cloud) and quick tips from Adam’s blog. Continue reading “Episode 102 – Roaring News”

Episode 101 – Apache Pulsar update with Matteo and Sijie from Streamlio

Matteo and Sijie from Streamlio reached out to us and let us know they had an update on Apache Pulsar. It turned out they had a lot to talk about so we cut the interview in two parts and here is the first part where they introduce Apache Pulsar, go in depth on the correct deployment scaling of a stable Pulsar cluster and clarify Pulsars “at least once vs exactly once” strategy. Part two will go in more depth on what’s new. Stay tuned! Continue reading “Episode 101 – Apache Pulsar update with Matteo and Sijie from Streamlio”

Episode 100 – Celebrating our Centennial with the history of Hadoop

100 Big Data episodes! We made it, in no small part thanks to our audience: you are who keeps us going! In this episode we celebrate our centennial by going over the history of Hadoop releases, highlighting the most noteworthy events along the way. Join us down the twisty paths of our  memory lanes! Continue reading “Episode 100 – Celebrating our Centennial with the history of Hadoop”

Episode 99 – The State of Big Data at Codemotion Amsterdam

The Roaring Elephant podcast was a guest at the Codemotion conference in Amsterdam a little while ago. This episode contains the audio of the talk we did on the State of Big Data. Continue reading “Episode 99 – The State of Big Data at Codemotion Amsterdam”

Episode 98 – Roaring news

In this episode of Big Data Roaring News, Dave laments another announcement of Hadoop’s demise and exposes A.I. imposters. Jhon has articles comparing Ranger with Sentry and Apache Nifi reaching the ripe age of 1.7 with a Minifi charged practical demo to prove the point. Continue reading “Episode 98 – Roaring news”