Data Lake Implementation (Health Care)
Challenges
- Rapid expansion and complexity in data volume
- Newly acquired business units and partners deliver diverse data inputs
- Need centralized repository for both structured and unstructured data — at scale
- Templates, well architected solutions missing
Industry: Health Care
UK Agency
Solution
- Data Lake solution feeding Data Warehouses and BI analytics in AWS
- Use of a data pipeline pattern involving source files, S3, AWS Glue, Redshift, Quicksight and Athena (with S3)
- AWS Security Best-Practices for data security and HIPAA Compliance
Benefits
- Data is now segmented into value streams: operational, security, application, customer
- Different data types are now collected and analysed (structured, unstructured, semi-structured)
- Data storage is optimised