Big Data Analysis

766 Words2 Pages

Summary: Along with the rapid development of Information Technology, “Big Data” is becoming more and more popular nowadays. Although, most of people are potentially getting touch with “Big Data” every day, they might still do not have any knowledge about it. On March 28 2014, Dr. Phil Chan, Dr. Ryan Stansifer and Dr. Debasis Mitra who are the professors and researchers of Florida Institute of Technology together brought us, “Big Data: A Closer Look”. In the presentation, they introduced the development and utility of “Big Data” in a variety of fields and some researches. First of all, Dr. Chan introduced the main idea of “Big Data” using four V’s. They are: Volume, Velocity, Variety and Veracity. First, “Big Data” is large Volume, millions, even billions of data. Second, in a single minute, there are millions of data are being processed in the world. Third, variety kinds of data are available like video, text etc. Last, the data need to be accurate. He addressed that those huge amounts of data not only need to be stored, but also need to be analyzed. The approaches of analyzing those data involve Data Mining, Machine Learning, Clustering and so on. After these processes, people could predict whether, recommend products to others like amazon, or organizing articles like Google news. Secondly, Dr. Stansifer expressed that “Big Data” is a computational thinking and most of the time it is invisible. He also illustrated his research about SPAM in E-mails. He mentioned that to understand what SPAM is, people need to analysis huge amount of E-mails. Enron Corpus which is a database containing more than 600,000 E-mails could be used as a dataset for SPAM E-mails pattern formation. [1] To analysis those E-mails, Dr. Stansifer intro... ... middle of paper ... ...e presentation. Also, in this presentation, Dr. Chan, Dr. Stansifer and Dr. Mitra have given some daily life examples like how Amazon recommends products to customers. All those examples would make people feel that “Big Data” is not that mysterious. However, for the structure of this presentation, the last part which was presented by Dr. Mitra is relatively longer. It could be a standalone presentation. In my opinion, only the core and “Big Data” related materials of last part need to be included in this presentation. So, the audience would not feel difficult about the biology part and still enjoy the whole presentation. In summary, Dr. Chan, Dr. Stansifer and Dr. Mitra have given us a very brilliant presentation. Everyone would enjoy this seminar and learn something from it. The only drawback is the length of this seminar is too long for a 50 minutes’ seminar.

More about Big Data Analysis

Open Document