Quiz 6 Review Outlines:

Use Tableau to explore the datasets used in EG-EX1 & 2. Test the features of Tableau in

1)      Dataset connection

2)      Dataset filtering

3)      Add a new column into a table

4)      Joining tables

5)      Data graphing

6)      Statistical analysis

Quiz 5 Review Outlines:

SAS Enterprise Guide Topics

1.       Import a text file

2.       Create one-way & two-way frequency reports

3.       Create queries

4.       Add new columns

5.       Data summarization

6.       Data graphing

Review materials and exercises:

Simply finish the assignments for Chapter 2-5 in EG-EX1 to EG-EX3.

Quiz 4 Review Outlines:

Questions

1.       The relationship between Hadoop & Spark

2.       What are main components in Hadoop based data warehousing?

3.       What are differences between HBase and Hive?

4.       What are the products used in Hadoop’s ETL?

5.       Main features of Hadoop ETL products.

6.       Will Spark replace Hadoop? Why?

7.       What is HQL? What is its relationship with SQL?

The above questions may need some online search for more information.

Review materials and exercises:

1.       Lecture slides (Hadoop&Spark, HDFS/Hive)

2.       Your exercises (BigData Ex 1 & 2)

3.       Readings about Hadoop, Spark, HDFS, Hive, HBase, NoSQL, etc.

----------------------

 

Quiz 3 Review Outlines:

Questions

1.       What is data integration?

2.       What are ETL topics?

3.       What are major functions of SQL Server SSIS?

4.       What is referential integrity? How does it affect the process of ETL?

5.       What are three types of measures?

6.       What are aggregate functions in SSIS?

7.       What is granularity in data warehousing?

8.       Review types of dimensions

Review materials and exercises:

1.       Lecture slides (BI-06 & BI-08)

2.       Your exercises (4, 5 & 6)

----------------------

 

 

Quiz 2 Review Outlines:

Questions

1.       How to design a dimensional model? Review the cases lectured in the class. Solve the following case.

 

The following three tables are in a course enrollment database.

STUDENT (StudentNumber, CustomerLastName, CustomerFirstName, Phone, IntStudentFlag, AthleteFlag)

COURSE (CourseNumber, Course, StartDate, Cost)

ENROLLMENT (StudentNumber, CourseNumber, PaidDate, PaidFlag, RetakeFlag)

1)      Identify the measures

2)      Design an enrollment fact table attributes for an enrollment fact table:

FactEnrollment( ,  ,  , …). Highlight the primary key.

3)      Identify 3-4 dimension tables, and indicate their types

 

2.       What are two types of dimensional models?

3.       What is UDM?

4.       How to plan a data mart project?

5.       What are three fact tables? Provide the examples.

6.       What are 7 popular dimension tables? Provide examples for at least three of them

Review materials and exercises:

3.       Lecture slides

4.       Your exercises

--+--+--

Quiz 1 Review Outlines:

Questions

1.       Concepts/Terms:

BI, Visualization, Data mart, Data warehouse, OLAP, OLTP, metadata, Hadoop, Map/Reduce, Operational data store, 4 Vs of big data

2.       Short answers questions

Comparison: Database vs. data warehouse, data warehouse vs. data mart, SQL Server vs. Hadoop, data vs. information vs. knowledge, OLAP vs. OLTP

Properties/characteristics: data ware house, OLAP

3.       Other fundamental concepts and knowledge about business intelligence

4.       Basic knowledge about SQL Server BIDS

Review materials and exercises:

5.       Slides and videos in the slides

6.       Basic knowledge in using Citrix SQL Serve