Q1Use the following dataset to find Euclidian, Manhattan and Minkowski Distance
between object A and B. Assume lambda (h) is 3.
Object /
Cost
Time
Weight
Incentive
Object A
0
3
4
5
Object B
7
6
3
-1
Attribute
Q2
What is data sampling in data mining? Explain four (4) types of data sampling
techniques.
Q3
Do you think improvements in database technology helped the surge of data
mining? Explain your answer in details.
Q4
Suppose that the data for analysis includes the attribute age. The age values for the
data tuples are (in increasing order) 13, 14, 15, 16, 20, 20, 20, 20, 20, 23, 24, 24, 25,
26, 27, 28, 30, 30, 30, 30, 30, 32, 33, 35, 36, 37, 39, 44, 45.
(a) What is the mean of the data?
(b) What is the median?
(c) What is the mode of the data?
(d) Comment on the data’s modality (i.e., bimodal, trimodal, etc.).
(e) Show a Histogram of the data (10 bars).