First Semester
MIT - 101
MIT - 102
MIT - 103
MIT - 104
MIT - 105
MIT - 106
MIT - 107
MIT - 108
Second Semester
MIT - 201
MIT - 202
MIT - 203
MIT - 204
MIT - 205
MIT - 206
MIT - 207
MIT - 208
Third Semester
MIT - 301
MIT - 302
MIT - 303
MIT - 304
MIT - 305
MIT - 306
MIT - 308
Fourth Semester
|
Time: 3hrs Max. Marks: 75 Attempt any five questions. All questions carry equal marks
- Question.
- Briefly describe all phases in the KDD process. How does KDD differ from data mining?
- Given an initial population {<101010>, <001100>, <010101>, <000010>}, apply the genetic algorithm to find a better population. Suppose the fitness function is defined as the sum of bit values for each individual and that mutation always occurs by negating the second bit. The termination condition is that the average fitness value for the entire population must be greater than 4. Also, an individual is chosen for crossover only if its fitness is greater than 2.
- Question.
- Discuss briefly various data mining issues.
- Apply Bayesian classification on the following set of data: -
| Number |
Gender |
Height |
Class |
| 1 | F | 1.6 | Short |
| 2 | M | 2.0 | Tall |
| 3 | F | 1.9 | Medium |
| 4 | F | 1.88 | Medium |
| 5 | F | 1.7 | Short |
| 6 | M | 1.85 | Medium |
| 7 | F | 1.6 | Short |
| 8 | M | 1.7 | Short |
| 9 | M | 2.2 | Tall |
| 10 | M | 2.1 | Tall |
| 11 | F | 1.8 | Medium |
| 12 | M | 1.95 | Medium |
| 13 | F | 1.9 | Medium |
| 14 | F | 1.8 | Medium |
| 15 | F | 1.75 | Medium |
and classify the tuple < , M, 1.9>
- Question.
- Define decision tree and write a decision tree based classification algorithm.
- How the splitting attribute is selected in decision tree based algorithm. Show it for the training set given in 2(b).
- Question.
- Write the K-means clustering algorithm and apply the same for the following data to create three clusters: -
{2, 4, 10, 12, 3, 20, 30, 11, 25}
- A major problem with single link algorithm is that clusters consisting of long chains may be created. Describe and illustrate this concept.
- Question.
- Define the concept of association rule and describe Apriority algorithm.
- Apply apriority algorithm for the following transactions to generate association rules taking s = 20% and = 40% -
| Transaction | Items |
| t1 | Bread, Jam, Butter |
| t2 | Bread, Butter |
| t3 | Bread, Milk, Butter |
| t4 | Curd, Bread |
| t5 | Curd, Milk |
| t6 | Bread, Milk |
- Question.
- Describe Web mining with the help of suitable examples.
- Given the following sessions: -
{< A, B, A, C>, < C, B, D, F>, < A, B, A>}, Indicate the sequential patterns, forward sequences and maximal frequent sequences assuming a minimum support of 30%. Assume that each session occurs from a different user.
- Question.
- What do you understand from spatial data mining? Describe briefly some of its applications.
- Describe briefly various data structures used in spatial data mining.
- Question.
- What do you understand from temporal data mining? Describe briefly some of its applications.
- Plot the following time series values as well as the moving average found by replacing a given value with the average of it and ones preceding and following it: -
{5, 15, 7, 20, 13, 5, 8, 10, 12, 11, 9, 15}. For the first and last values, you are to use only the two values available to calculate the average.
|