ISQS 6339 Lecture Notes

 

Instructor: Zhangxi Lin

 

------- + ------ + ------- + ------- + ------- + -------

 

Lecture 1 & 2, 03/21/2017, Tuesday

 

Topic: Introduction

1)       Basic BI Concepts

2)       BI trend

 

Review questions:

 

1)     What is BI?

2)     Why is BI getting hot?

 

Topic: Database vs. data warehousing

 

1)       Database vs. data warehousing

2)       Big data

 

Terminology: data, information, knowledge, business intelligence, big data, artificial intelligence, data ware housing

 

Reading:

1)     How Much Information? 2003” (http://www2.sims.berkeley.edu/research/projects/how-much-info-2003/execsum.htm)

2)     Find a BI application case from the web, and understand how it works.

3)     Find paper “CACM2011 Overview of BI.pdf” in the network drive under ~\Texts\Readings\. Read it carefully.

 

------- + ------ + ------- + ------- + ------- + -------

 

Lecture 3 & 4, 03/23/2017, Thursday

 

Topic: Anatomy of Business Intelligence

 

1)       DW concepts

2)       Data warehousing with Microsoft SQL Server 2008

 

Topic: Data warehousing

 

1)       Demonstration

2)       Concepts of data warehousing

3)       Architecture of data warehouse

4)       Data Integration

5)       Hadoop & Spark

6)       Exercise 0 – database review using IMW data

 

References:

1)     SQL Server Data Warehousing http://technet.microsoft.com/en-us/sqlserver/dd421879.aspx

2)     SQL Server Tutorial for Beginners http://www.youtube.com/watch?v=ZNObiptSMSI&list=PL08903FB7ACA1C2FB

 

------- + ------ + ------- + ------- + ------- + -------

Lecture 5 & 6, 03/28/2017, Tuesday

Topic: Data Modeling

1)     Quiz 1

2)     Dimensional modeling

3)     Unified dimensional model

4)     Data warehousing methodology

5)     Three phases of dimensional modeling

 

References:

 

1)     Inmon vs. Kimball - Book

2)     Inmon vs. Kimball - comments

3)     Kimball and Inmon DW Models

4)     Design of data warehouse

Topic: Creating data mart

 

1)     Exercise 1

2)     Fact tables

3)     Dimension tables

 

------- + ------ + ------- + ------- + ------- + -------

 

Lecture 7 & 8, 03/30/2017, Thursday

 

Topic: Data warehousing

 

1)     Quiz 1 review

2)     Exercise 2 - Create a data mart

3)     How to create data mart with SSAS - demo

 

Online References:

1)     Measures: http://en.wikipedia.org/wiki/Measure_%28data_warehouse%29

2)     Dimensions: http://en.wikipedia.org/wiki/Dimension_%28data_warehouse%29

3)     Slowly changing dimensions : http://en.wikipedia.org/wiki/Slowly_changing_dimension

4)     Surrogate keys : http://www.geekinterview.com/question_details/17591

5)     Aggregate: http://en.wikipedia.org/wiki/Aggregate_%28Data_Warehouse%29

6)     Many-to-many relationship in data warehousing: http://www.pythian.com/news/364/implementing-many-to-many-relationships-in-data-warehousing/

7)     Degenerate dimensions: http://en.wikipedia.org/wiki/Degenerate_dimension

 

Homework assignment 1 (optional):

 

Search the web to find the comprehensive explanations of the following terms. Give an expample for each of them.

1)     Junk dimensions

2)     Many-to-many or multivalued dimensions

3)     Degenerate dimensions

 

Homwork is due a week later after the class meeting. Send your answers by email isqs6347@gmail.com, with a subject title "ISQS 6339 homework 1 - <your name>". A late submission with a week after the deadline is also fine, but may not receive the timely feedback.

 

 

------- + ------ + ------- + ------- + ------- + -------

 

Lecture 9 & 10, 04/04/2017, Tuesday

 

 

Topic: ETL system development (1)

 

1)     Exercise 3 – Create data mart with Visual Studio

2)     Term project orientation

3)     An introduction to SSIS

4)     Control flow tasks

5)     Data flow items

 

Topic: ETL system development (2)

 

1)     ETL application debugging

2)     Exercise 4 – Populating Dimension Tables for Maximum Miniatures Manufacturing Data Mart (Guidelines)

 

 

------- + ------ + ------- + ------- + ------- + -------