
Redshift vs Redshift Spectrum
Amazon Redshift is a petabyte scalable columnar data warehouse that is very efficient in storing raw data and collecting data from various sources. Redshift su…
Read More »

Amazon Redshift is a petabyte scalable columnar data warehouse that is very efficient in storing raw data and collecting data from various sources. Redshift su…
Read More »
Data products are the end result of file or data movements to the cloud; ETL; processing; de-duplication; curation and storage in a consumable layer. There is …
Read More »
A typical Technology Stack for a Data Lake. S3 as the Golden Source. Snowflake as a corporate Data Share with SQL use cases. If AWS-S3 and Redshift are not pro…
Read More »
(ETL engine in the above could be AWS Glue) There are various ways to define performance and what that means. A simple way to be consistent with management is …
Read More »
Iceberg Cometh Open table formats, such as Apache Iceberg, enable scale-out data warehousing directly on a data lake. This architecture has become known as a d…
Read More »
Traditional Data Product Management Federated data management and data product builds and sharing has little to do with traditional data product management. Tr…
Read More »
Moving data from S3 to Snowflake to satisfy use cases around analysis, corporate reporting, or cross-domain information collaboration is best achieved through …
Read More »
If you are building a Data platform the first consideration should be around data quality, cleaning, traceability, ownership and reporting on quality. If the d…
Read More »
There are components to a Data Architecture, namely a Data Model, a Reference Architecture, and a Star Graph. A Data Architecture is the language and represent…
Read More »