Access and Feeds

Big Data: Challenges Facing Hadoop in the Enterprise

By Dick Weisinger

Hadoop has become the de facto tool for managing Big Data.  Hadoop provides a way to organize, search, share and analyze data that was collected from many different data sources.  And scalability from a few to dozens to thousands of servers is one of Hadoop’s most attractive features.

But will enterprises be able to fully take advantage of the benefits of Big Data analytics that Hadoop offers?

David Richards, CEO at WANdisco, took on that question and said: “imagine a world in which one bank has Hadoop and everyone else is using proprietary storage – it is not going to work.  One bank will have a massive competitive advantage because they can scale up storage at ten percent the cost of everybody else.  The market for big data has got nothing to do with anything other than this: the requirement for data storage is growing at a sixty percent compound annual growth rate and budgets are not. So you can’t use proprietary storage to solve that problem.  It isn’t a technologically-driven market place; it is an economically-driven market place.”

The problem is that Hadoop isn’t easy — at least not yet, although it is getting easier.  Enterprises are finding that taking advantage of Hadoop is challenging.  Tony Baer, principal analyst at Ovum, identified two reasons for this:

Distributed Architecture.  “For one thing, it’s a distributed architecture, not a database.  For anyone with database experience, distributed architectures have always been very difficult hurdle to deal with.”  Hadoop’s distributed architecture is different from traditional enterprise software, and it doesn’t come prepackaged with tools like self-tuning, security and lifecycle management.

Complexity.  A second factor is that it is a collection of projects rather than a single monolithic piece of software, and choosing the right pieces and learning them can be complex.  And more recently, with the introduction of tools like Spark, it is enabling real-time analytics.  “It [real-time capabilities] multiplies its value to organizations, but also multiplies the potential complexities you have to deal with.”

Digg This
Reddit This
Stumble Now!
Buzz This
Vote on DZone
Share on Facebook
Bookmark this on Delicious
Kick It on DotNetKicks.com
Shout it
Share on LinkedIn
Bookmark this on Technorati
Post on Twitter
Google Buzz (aka. Google Reader)
One comment on “Big Data: Challenges Facing Hadoop in the Enterprise
  1. kesava says:

    In real time what type of problems we are facing in hadoop,hive,pig,sqoop. please share this information,i need that information.

Leave a Reply

Your email address will not be published. Required fields are marked *

*