CS 4/59995 Fall 2009 Introduction to Data Mining

fantasicgilamonsterΔιαχείριση Δεδομένων

20 Νοε 2013 (πριν από 3 χρόνια και 6 μήνες)

96 εμφανίσεις

CS 4/59995 Fall 2009 Introduction to Data Mining



Homework 1 (Due 9/16/09


no late submissions are accepted)


1.

Discuss whether
or not each of the following activities is a data mining task

a.

Dividing customers according to their gender

b.

Dividing the cust
omers of a company according to their profitability

c.

Computing the total sales of a company

d.

Sorting a student database based on student identification number

e.

Predicting the outcomes of tossing a pair of dice

f.

Monitoring the heart rate of a patient for abnorm
alities

2.

Describe differences between a database and a data warehouse. What is more
difficult to support

3.

Generate a data mining system architecture

4.

Using a definition of term frequency as a number of occurrence of a term in a
document, compute TF
-
IDF s
cores of each word in this text.

5.

What is the difference between false positive and false drop?

6.

Exercise 3 on page 45 of your textbook

7.

Exercises 2
-
4 on page 19 of your textbook