ISQS 6347 Data & Text Mining
Projects
Project 2
Project 2 takes the same format as that for project 1
with the following differences:
1) Project 2 may use the same dataset as used in Project
1, but the focuses must be differentiated from those in Project1. If the same dataset is used, you don’t need to
explain too much of the dataset if you have done this well in the project 1
report.
2) Using a new dataset is also a good choice. Topics of Text mining are encouraged.
1) You solve the problem with a comprehensive data
mining model beyond the issues addressed in the class, demonstrating that you
have self-taught and did creative work in using the tool.
2) You present very well the issues in the report based
on the data analysis outcomes, which leads to some significant findings that could
potentially be the contribution to the research literature.
3) You use a real dataset from some business and the
topic and the findings of the data mining results have important implications
to the specific business background.
4) You have solved some special technical problem in using
SAS Enterprise Miner. You need to explain how you solve the problem.
Group Presentations:
-------------------------------------------------------------------------------------------------------------
Project 1 (check the Example)
This project will allow
students to practice data mining methods and SAS EM skills learned from the
class. The project will be done on the individual basis. The following are the
suggested steps to fulfill the project:
Step 1: Identify a project
topic and determine the objectives of the data mining project. Find an
available dataset for the project. You can use one of the datasets you found
for homework 1.
Step 2: Study and understand
the dataset by exploring it. Pay attention at the quality of the data (any
missing value), the meaningful attributes (variables), attributes (variables)
distributions, and the types of variable values.
Step 3: Choose Decision
Tree, Clustering, OR Association
Analysis and use SAS Enterprise Miner to develop a data mining model upon
the dataset.
Step 4: Fine tune the model
and try to explain the outcomes of the data mining as much as possible with
regard to the project objectives
Step 5: Conceive a project
report based on the data mining analysis outcomes
The project report is the
final deliverable for the project. It includes the following sections:
Issues during project
accomplishment:
The project report is due on
March 30.