In this episode, we took an online article by Chris Riccomini and give our take on the discussion on having a single big cluster versus many smaller ones. If you are architecting a Hadoop cluster and are faced with this choice, this episode should give you a lot of information on the subject. Continue reading “Episode 54 – Hadoop sizing part 1: One big cluster, or many small ones”
In this episode of Roaring News, Dave brings up the newly released HDP 2.6.2 which incorporates IBM’s move from their proprietary IOP to HDP.
Jhon brings an update on the MLEAP story for productionizing your spark model. We finish off discussing the newly released Apache Atlas version 0.8.1
Over the summer, when your hosts enjoyed a well-earned vacation (well, we like to think we earned it) we could not stop being Big-Data Nerds and in this episode we talk about the Hadoop opportunities we spotted. Continue reading “Episode 52 – Big data in travel”
In this news episode (our very first one), Dave is all-out on Artificial Intelligence and its use in naming “stuff”; for some subjects it apparently works very well, for other subjects not so much…
Jhon brings a blog on deploying new Kerberos functionality and a tutorial for Kafka Connect for those that have not really looked at it. The ensuing discussion on Nifi vs kafka is purely coincidental. Continue reading “Episode 51 – Roaring News”
This is the final part of our long interview with Alan Gates. In this part, Alan talks more about ODPI, Cloud First, Apache Flink, Apache Pig and we finish off with a little bit of Philosophy.
A big thank you to Alan for sharing his pearls of wisdom with us!