What Is The Difference Between Data Mining And Data Analysis?

1048 Words3 Pages

Top 10 Data Analytics Interview Questions #1- What is the difference between Data Mining and Data Analysis? Data Mining Data Analysis A hypothesis is not required for Data Mining Data analysis begins with a hypothesis. Data Mining demands clean and well-documented data. Data analysis involves data cleaning. Results of data mining are not always easy to interpret. Data analysts interpret the results and present it to the stakeholders. Data mining algorithms automatically develop equations. Data analysts have to develop their own equations. #2- Mention what are the various steps in an analytics project? Data analytics deals with collecting, cleansing, transforming and modelling data to gain valuable insights and support better decision making …show more content…

#7- Explain K-mean Algorithm and Hierarchical Clustering Algorithm? K-Mean Algorithm - K mean is a famous partitioning method. In the K-mean algorithm, the clusters are spherical i.e. the data points in a cluster are centered on that cluster. Also, the variance of the clusters is similar i.e. each data point belongs to the closest cluster Hierarchical Clustering Algorithm - Hierarchical clustering algorithm combines and divides existing groups and creating a hierarchical structure for them to show the order in which groups are divided. #8- What is data cleansing? Mention few best practices that you need to follow while doing data cleansing? From a given dataset, it is extremely important to sort the information required for data analysis. Data cleaning is a crucial step wherein data is inspected to find any anomalies, remove repetitive and incorrect information, etc. Data cleansing does not involve removing any existing information from the database, it just enhances the data quality so that it can be used for analysis. Some of the best practices for data cleansing include …show more content…

• Simplex algorithm • Mathematical optimization #10- Explain what is imputation? List out different types of imputation techniques? Which imputation method is more favourable? During imputation, we have a tendency to replace missing information with substituted values. The kinds of imputation techniques involve are – • Single Imputation: Single imputation denotes that the missing value is replaced by a value. In this method, the sample size is retrieved. • Hot-deck imputation: A missing value is imputed from a randomly selected similar record by using punch card • Cold deck imputation: It works same as hot-deck imputation, but a little more advanced and chooses donors from other datasets • Mean imputation: It involves replacing missing value with the predicted values of other variables. • Regression imputation: It involves replacing missing value with the predicted values of a certain value depending on other variables. • Stochastic regression: It is same as regression imputation, however, it adds the common regression variance to the regression imputation • Multiple Imputation: Unlike single imputation, multiple imputations estimates the values multiple

More about What Is The Difference Between Data Mining And Data Analysis?

Open Document