The Scope of AWS
A data lake on AWS provides organizations with the ability to store data of any type (structured and unstructured) in a centralized repository. This engagement is a collaboration between Customer IT team and Data-FactZ engineers to understand the business and technology requirements for a Data Lake solution on AWS. Below is the scope of this engagement:
-
Identify Data Lake pilot use case (fits into the 4-week window).
-
Cost optimized collection, storage and serving of data from sources (max 2) on the identified use case.
-
Operations plan for orchestrating developed POC modules.
-
Enable visualizations/reporting on stored data using existing BI tools or open-source frameworks.
-
Provide a reference architecture and future roadmap for subsequent use-case implementations