Big Data Essay

638 Words2 Pages

1.1 Big Data

Big data[1] is defined as massive amount of data , which is difficult to process, capture, manage and analyze by traditional software techniques in reasonable time . These data can be complex, vast, diverse, heterogeneous in nature. Sources[2] of these data can be online transactions, video, emails, audio, images, logs, posts, tweets, search queries, science data, health records, sensor data, social media interactions.
In order to describe Big data, following are the properties[4][1] :-

1) Volume : Now, enterprises are awash with data. Data is in exabytes or zettabytes , rather than in tera or peta bytes.
2) Variety: Big Data comes from various sources, so data is not of single format. Basically, data is divided as - Structured, Unstructured and Semi structured. Structured data is the data which resides within fixed field . Semi structured data is tabular, relational, categorical or meta -data. Unstructured data is text , messages, tweets or posts.

3) Velocity: Velocity in Big data describes the speed at which data is created, retrieved from various sources, stored and analyzed. Data is streaming in a very unprecedented motion and should be dealt in timely manner.
4) Variability: This basically deals with inconsistencies of data flow during periodic peaks. Data loads are challengeable job to maintain during increase in usage ,that causes peak loads due to specific event triggered.
5) Veracity: This basically deals with quality and provenance of received data. Data can be characterized as good, bad, undefined, ambiguous or inconsistent[2].
1.1.1 Challenges of Big Data

• Growth of Data: With the advent in Technology and Science, we are inundated with data. Unstructured data in enterprises...

... middle of paper ...

...lysis. Sensor data is basically data at rest and data in motion. So Huge, massive data needs to be analyzed for safety and efficiency purpose. [4]
4) Social Media : Big data is most in use for social media and maintaining customer relationship[10]. Analyzing customers reviews about product, help business organizations to understand their market reputation and competitors. Analyzing a large record of users accessing sites. Analyzing and calculating number of sites accessed by different users, gives an idea of which sites gets maximum hits ,helpful to upload advertisement having maximum hits by users.

5) Risk Analysis : Financials organizations needs to process large amount of data in order to calculate risks. There is large amount of data which is still underutilized, needs adequate amount of processing and integration to be done to analyze risks patterns .[4]

Open Document