Supervised and unsupervised data mining
a. Define and explain what supervised and unsupervised data mining are and provide an example for each.
b. Define and explain the terms data mining and big data and describe the relationship between the two. Provide a real-world example of how data mining is being used and for what purpose it is being used.
c. Explain what report authoring, report management, and report delivery are and the business purpose each serves within a typical business organization
Sample Answer
Unlocking the Power of Data: A Guide to Data Mining and Beyond
Here’s a breakdown of the concepts you asked about:
a. Supervised vs. Unsupervised Data Mining:
-
Supervised Data Mining: This involves training a model on a labeled dataset, meaning the data contains both input features and the desired output or target variable. The goal is to predict the output for new, unseen data based on the patterns learned from the labeled dataset.
-
Example: Training a model on historical customer data (age, income, purchase history) and their corresponding credit score to predict the creditworthiness of new applicants.
-
-
Unsupervised Data Mining: This deals with unlabeled data, where the goal is to discover hidden patterns, structures, and relationships in the data without a predefined target variable.