O`Reilly 的 《 Architecting Data Lakes Data Management Architectures for Advanced Business Use Cases 》,全面介绍了数据湖的构架、工作机理、构建与管理、规划、价值、展望等诸多方面的内容。
其目录如下:
1. Overview
What Is a Data Lake?
Data Management and Governance in the Data Lake
How to Deploy a Data Lake Management Platform
2. How Data Lakes Work
Four Basic Functions of a Data Lake
Management and Monitoring
3. Challenges and Complications
Challenges of Building a Data Lake
Challenges of Managing the Data Lake
Deriving Value from the Data Lake
4. Curating the Data Lake
Data Governance
Data Acquisition
Data Organization
Capturing Metadata
Data Preparation
Data Provisioning
Benefits of an Automated Approach
5. Deriving Value from the Data Lake
Self-Service
Controlling and Allowing Access
Using a Bottom-Up Approach to Data Governance to Rank Data Sets
Data Lakes in Different Industries
6. Looking Ahead
Ground-to-Cloud Deployment Options
Looking Beyond Hadoop: Logical Data Lakes
Federated Queries
Data Discovery Portals
In Conclusion
A Checklist for Success
完整内容,可以在此下载:http://www.oreilly.com/data/free/architecting-data-lakes.csp?intcmp=il-data-free-lp-lgen_free_reports_page
也可以随时Email:Hiweb@Outlook.com 沟通探讨。
原文地址:http://hiweb.blog.51cto.com/3355462/1911174