Episode 82 – DataWorks Summit Berlin 2018 Preview

Next week is DataWorks Summit Berlin week! Your two hosts will be in attendance and in this episode we go over the agenda and plan which sessions we want to attend and why. Peppered throughout we add further insights and experiences from previous years. Continue reading “Episode 82 – DataWorks Summit Berlin 2018 Preview”

Episode 81 – Roaring News

In this installment of Big Data News, we talk about the recent Facebook leak, how everybody is still doing it wrong (according to some at least) and installing Hadoop “the old-fashioned way”. Also briefly covered is Elastic’s X-Pack, now even more “open” than before, but still rather closed it would seem. Continue reading “Episode 81 – Roaring News”

Episode 80 – Big Data Tracking

Last June, Wolfie Christl published a 93 page report Corporate Surveillance in Everyday Life using big data tracking. Apart from the massive pdf that can be downloaded on the net, an extensive summary can be found on the Cracked Labs website.

In this episode we go over the content and give our views on the subject.

Continue reading “Episode 80 – Big Data Tracking”

Episode 79 – Roaring News

Another Big Data news episode! This time we consider the Big or small nodes conundrum based on an article that after close scrutiny doesn’t really seem to test the real issue. Other things that get covered are Linkedin’s Dynanometer, Cloudera’s full production architecture advise for a recommendation service and a really interesting visualization technique based on blobs. Continue reading “Episode 79 – Roaring News”

Episode 78 – Apache Trafodion transactional SQL for Hadoop (Part 2)

This episode, a group of people from Esgyn join us to talk about the Apache Trafodion transactional SQL for Hadoop database engine.

In this second part Rohit, Ken and Rao talk about the internal workings and best practices of Apache Trafodion. Continue reading “Episode 78 – Apache Trafodion transactional SQL for Hadoop (Part 2)”