The most popular and comprehensive Open Source ECM platform
One problem with straight-up Hadoop solutions has been the difficulty of performing ad-hoc queries across the entire data set. Cloudera now offers a solution that combines Apache Solr with Hadoop, a step that could greatly shrink the skill gap that exists in many businesses wanting to use Hadoop.
Mike Olson, Chief Strategy Officer and Chairman of the Board of Cloudera, said that ”we’re bringing the band back together with Cloudera Search. Based on 100% open source Apache Solr, a Lucene project and another Doug Cutting original, Cloudera Search is now fully integrated into our industry leading CDH big data platform. After a successful private beta, it’s the latest in a series of major innovations that we’ve brought to market designed to speed up and simplify an organization’s ability to get the most out of their data. We are further democratizing access to mission-critical information stored in Hadoop by ensuring those without programming expertise can gain insight, find patterns and derive true value from their information assets.”
Olson said that “the key benefit is that anybody can now use this platform. When Hadoop first appeared on the market, the knock against it by the existing analytics vendors was that you had to learn this new MapReduce thing and you have got to be a Java programmer. We have added SQL, but there are people who don’t know that language, either. People want to search for data they know exists in their cluster, but with a petabyte of data, there is not set of folders that makes sense any more. What we have learned from Google is that we just want to type terms into a search box.”
Charles Zedlewski, Vice President of Products at Cloudera, said that “There’s all sorts of data that never fit readily in a row or column. You could always store that data in Hadoop, but getting it out was exceptionally technically difficult… Tens of thousands of people know how to write MapReduce, millions of people can do SQL queries, but billions of people know how to use a search engine. This kind of data is now available for people who don’t know SQL, but who do know how to Google their data. It opens up data in a Hadoop cluster to a much wider audience of users.”