The past few years have seen an explosion of new technologies for storing, analyzing and displaying the enormous amount of data available to businesses today. Feature articles on big data, predictive analytics and data science, rarely heard about a few years ago, regularly appear in the mainstream business press. Studies from major think tanks and management consulting companies have shown again and again that companies can leverage technologies such as Hadoop, RDBMS, Hive, Pig, YARN, Tez, Spark, Sqoop and so on, to generate new, significant benefits from their existing data assets.
The pressure to “do something” related to big data is growing. But do what, exactly? Businesses are eager to acquire the latest and greatest technology, both to gain a competitive advantage and to demonstrate to their stockholders that they operate on the cutting edge of their industry. However, hidden below the excitement of publicized successes lies the reality that achieving these intended goals is neither automatic nor assured.
This was the start of a whitepaper that I wrote as a Senior Data Scientist for CSC, a Fortune 500 IT services firm. To read the entire six-page pdf, simply click here: whitepaper.