The problem with Big Data is not the Data
The real problem with Big Data isn't volume—it's knowing what you want to achieve and starting with clear business challenges, not technology.
Loading...
Preparing your content
6 posts in Big Data
The real problem with Big Data isn't volume—it's knowing what you want to achieve and starting with clear business challenges, not technology.
Key insights from IBM Research's webinar featuring Netflix and StubHub on implicit data collection, recommendation strategies, and the evolution from BI to Data Science.
Forget petabytes and Hadoop hype — true Big Data isn't about volume, it's about processing two orders of magnitude more data than you currently handle.
Updated ZipFileInputFormat framework for processing thousands of ZIP files in Hadoop with failure tolerance and comprehensive examples
[](http://www. flickr.
Custom utility classes to extract and parse ZIP file contents in Hadoop MapReduce jobs using ZipFileInputFormat and ZipFileRecordReader