توضیحاتی در مورد کتاب Practical Implementation of a Data Lake: Translating Customer Expectations into Tangible Technical Goals
نام کتاب : Practical Implementation of a Data Lake: Translating Customer Expectations into Tangible Technical Goals
عنوان ترجمه شده به فارسی : اجرای عملی دریاچه داده: تبدیل انتظارات مشتری به اهداف فنی ملموس
سری :
نویسندگان : Nayanjyoti Paul
ناشر : Apress
سال نشر : 2023
تعداد صفحات : 219
ISBN (شابک) : 1484297342 , 9781484297346
زبان کتاب : English
فرمت کتاب : pdf
حجم کتاب : 5 مگابایت
بعد از تکمیل فرایند پرداخت لینک دانلود کتاب ارائه خواهد شد. درصورت ثبت نام و ورود به حساب کاربری خود قادر خواهید بود لیست کتاب های خریداری شده را مشاهده فرمایید.
فهرست مطالب :
Table of Contents
About the Author
About the Technical Reviewer
Preface
Introduction
Chapter 1: Understanding “the Ask”
Objective: Asking the Right Questions
The Recommendations
Decide on the Migration Path, Modernization Techniques, Enhancements, and the Cloud Vendor
Assess the Current Challenges
Understand Why Modernizing Data Platforms Is Hard
Determine the Top Five Issues to Solve
Determine What Is Available On-Premise vs. on the Cloud
Create the Meetings Needed Throughout the Project
Define Common Terms and Jargon
Key Takeaways
Chapter 2: Enabling the Security Model
Objective: Identifying the Security Considerations
The Recommendations
PII Columns: RBAC, ABAC Features
Central Access Control
Authentication and Authorization (SAML vs. PING, etc.)
Strategy for Data Obfuscation
GDPR and Other Data Privacy
Ownership of the Platform, Interaction with Other Stakeholders (CISO, Legal Teams, etc.)
Legal/Contractual Obligations on Getting/Connecting Data from a Third Party on the Cloud
Key Takeaways
Chapter 3: Enabling the Organizational Structure
Objective: Identifying the Organizational Structure and Role
The Recommendations
Example Template for the Project
Key Takeaways
Chapter 4: The Data Lake Setup
Objective: Detailed Design of the Data Lake
The Recommendations
Structuring the Different Zones in the Data Lake
Defining the Folder Structure of the Zones with a Hierarchy
Structuring Data from Relational Stores (Raw Zone)
Structuring Data from Relational Stores (Curated Zone)
Structuring Data from Relational Stores (Provisioned/Gold Zone)
Managing Data Sensitivity as Part of the Folder Structure Design
Setting the Encryption/Data Management Keys for Organizing Data
Quick FAQs on the Data-at-Rest and Data-in-Transit Encryption
Looking at Data Management Principles
Understanding Data Flows
Setting the Right Access Control for Each Zone
Understanding File Formats and Structures in Each Zone
Key Takeaways
Chapter 5: Production Playground
Objective: Production Playground
The Recommendations
What Is a Production Playground?
What Issues Will This Address?
What Is a Production Playground Not ?
What Does the Production Playground Consist Of?
Key Takeaways
Chapter 6: Production Operationalization
Objective: Production Operationalization
The Recommendations
Key Takeaways
Chapter 7: Miscellaneous
Objective: Advice to Follow
Recommendations
Managing a Central Framework Along with Project-Specific Extensions
Allowing Project Teams to Build “User-Defined Procedures” and Contribute to the Central Framework
Advantages and Disadvantages of a Single vs. Multi-account Strategy
Creating a New Organizational Unit AWS Account vs. Onboard Teams to a Central IT Managed AWS Account
Considerations for Integrating with Schedulers
Choosing a Data Warehouse Technology
Managing Autoscaling
Managing Disaster Recovery
AWS Accounts Used for Delivery
Data Platform Cost Controls
Common Anti-patterns to Avoid
One-Size-Fits-All
Ignoring Security
Data Sprawl
Poor Data Governance
Lack of Quality Controls
Poor Metadata Management
Wrong Tools
Avoid Over-Engineering
Poor Data Integration
Unstructured Data Overload
Key Takeaways
Index