Friday, June 22, 2012

Real Time Big Data Analytics at Hadoop Summit

I took the opportunity on June 12 to attend the BigDataCamp. The real-time analytics session of the unconference portion was led by Michael Hummel, CEO of ParStream. It ran less than 30 minutes as the conference center was shutting down for the day.

My take from the short discussion:
* Hadoop is good for adding value to data, not consuming data... it is not designed for "real time"

What is real time? it is predictability, with a "decision time", within the "allowed" time, which begs the question of who actually defines this.

So is real time big data analytics possible? It's not a fantasy according to GigaOM in 2011 (article). Some (e.g. IBM) choose to not use the words "real time" in products/solutions, as there are different definitions. Depending on how "real time" is defined, real-time analytics is entirely possible. Take a look at Jike Chong's notes on different fraud detection use cases from the Feb SF Hadoop User Group here and form your assessment.

No comments: