Access and Feeds

The Big Data Paradox: Why Structured Data Still Dominates IT Spending

By Dick Weisinger

Big data is a term that refers to the massive amounts of data that are generated, collected, and analyzed by various organizations and industries. Big data can be classified into two types: structured and unstructured. Structured data is data that has a predefined format and can be easily stored and queried in databases, such as numbers, dates, and text. Unstructured data is data that has no fixed format and can be more complex and diverse, such as images, videos, audio, social media posts, and sensor data.

One might expect that unstructured data would account for most of the IT spending on data management, given its rapid growth and potential value. However, according to a recent report by IDC, structured data still drives most of the spending on compute and storage infrastructure, with $34.7 billion in 2022. This is followed by AI lifecycle workloads, which include platforms mostly related to AI training tasks, with $4.4 billion. Unstructured databases are still one of the smallest consumers of compute and storage infrastructure, with $2.3 billion.

Why is this the case? There are several possible reasons:

  • Structured data is still the backbone of many business processes and applications, such as transactions, analytics, and reporting. It requires high performance, reliability, and security, which translate into higher costs.
  • Unstructured data is more challenging to store, manage, and analyze than structured data. It requires more advanced technologies and skills, such as cloud computing, machine learning, and natural language processing. These are not yet widely adopted or mature enough to handle the scale and complexity of unstructured data.
  • Unstructured data is often underutilized or ignored by many organizations. They may not have a clear strategy or vision for how to leverage unstructured data for business value. They may also lack the tools and processes to extract insights from unstructured data and integrate them with structured data.

These reasons suggest that there is a huge opportunity for organizations to tap into the potential of unstructured data and gain a competitive edge in the market. However, this also requires a shift in mindset and culture, as well as investments in technology and talent. Some of the steps that organizations can take to achieve this are:

  • Adopt a data-centric approach that treats data as a strategic asset and aligns it with business goals and outcomes.
  • Implement a data governance framework that defines the roles, responsibilities, policies, and standards for managing data quality, security, privacy, and compliance.
  • Deploy a data platform that enables seamless integration, storage, processing, and analysis of structured and unstructured data across different sources and locations.
  • Empower users with self-service tools that allow them to access, explore, visualize, and act on data without relying on IT or external vendors.
  • Develop a data culture that fosters collaboration, innovation, and learning from data across the organization.

Unstructured data is a gold mine of information that can transform businesses and industries. However, it also poses significant challenges and risks that need to be addressed. By adopting a holistic and proactive approach to unstructured data management, organizations can unlock its value and achieve better results.

Digg This
Reddit This
Stumble Now!
Buzz This
Vote on DZone
Share on Facebook
Bookmark this on Delicious
Kick It on DotNetKicks.com
Shout it
Share on LinkedIn
Bookmark this on Technorati
Post on Twitter
Google Buzz (aka. Google Reader)

Leave a Reply

Your email address will not be published. Required fields are marked *

*