CMU 15-445 Lecture #24: Distributed OLAP Databases
目录
CMU 15-445 Database Systems
Lecture #24: Distributed OLAP Databases
Decision Support Systems
-
OLAP其他称呼,就是分析数据进行公司的决策分析的
-
两种“数据”架构
- Star Schema
- Snowflake Schema
-
Star Schema
- 对比
- 雪花型更省空间,而且避免了规范化的问题(前面星型的枚举人们可能用不同的词汇描述,比如一个人记low,另一个记bad)
- 雪花型的查询更复杂,跑的会更慢
Execution Models
-
PUSH QUERY TO DATA
-
PULL DATA TO QUERY
Query Planning
Distributed Join Algorithms
- 把可以JOIN的部分集中到一个节点JOIN
Cloud Systems
- Newer systems are starting to blur the lines between shared-nothing and shared-disk.
- Example: You can do simple filtering on Amazon S3 before copying data to compute nodes.