Big data is collection of large data sets very large and complex that it becomes difficult to process using traditional database management tools. The difficulty can be in terms data capture, storage, search, sharing, analytic and visualization etc.
Big Data is explained using 4 V’s:
(i) Volume: The size of data is very large and in terabytes and petabytes.
(ii) Velocity: It should be used when streaming in to the enterprise in order to maximize its value to the business. Exponential growth of both personal and corporate data.
(iii) Variety: It extends beyond the structured data, including semi-structured and unstructured data of all varieties: text, audio, video, posts, log files , location sensor data etc.
(iv) Variability: It deals with continuous and rapid change in data. Dealing with variability in traditional databases is very slow process. For example: it took years for traditional RDBMS to add an XML column, Still no JSON column type in RDBMS, many more new formats will come in future.