???????
Problem 4. Snowflake Schema (18 points) Suppose that a data warehouse for Big Ten universities consists of the four dimensions: student, course, semester, and instructor, and two measures: avg grade and count. Each dimension has a concept hierarchy, e.g., for student, the concept hierarchy is "student < major < college < university < all"; for course, the concept hierarchy is "course < department < college < university < all ", etc. At the lowest conceptual level, i.e., for a given student, course, semester, instructor combination, the avg grade measure stores the actual course grade of the student. At higher levels of the concept hierarchy for one or more dimensions, avg grade stores the average grade for the given combination. (1) (8 points) Assume that student has attributes S1?,S2?,S3,key?,S4?, where S3,key? has attributes S3,key?,S3,1?,S3,2?,S3,3?; course has attributes C1?,C2,key?,C3?, where C2,key? has attributes C2,key?,C2,1?; semester has attributes T1?,T2?, and instructor has attributes I1?,I2?. Draw a snowflake schema diagram for the data warehouse, where the dimension tables will be based on the attributes of the dimensions. (2) (5 points) Starting with the base cuboid [student, course, semester, instructor], what specific OLAP operations (e.g., roll-up, drill down, etc.) should you perform in order to list the average grade of Computer Science courses (i.e., department = "Computer Science" for dimension course) for each student. (3) (5 points) If each of the four dimensions in the data warehouse has five levels (including all), e.g., "student < major < status < university < all" for student, how many cuboids will the data cube contain (including the base and apex cuboids)? Clearly justify your answer. and we have the option of not having a dimension.