How to design a cloud-based data warehousing solution using Amazon Redshift, the petabyte-scale data warehouse in AWS.
- Choosing Sort Keys Sorting enables efficient handling of range-restricted predicates.
- Distribution Styles
- Tutorial: Tuning Table Design in this tutorial, you will learn how to optimize the design of your tables. You will start by creating tables based on the Star Schema Benchmark (SSB) schema without sort keys, distribution styles, and compression encodings.
- 10 Best Practices for Amazon Redshift Spectrum
- Amazon Redshift Best Practices for Designing Tables These design choices also have a significant effect on storage requirements, which in turn affects query performance by reducing the number of I/O operations and minimizing the memory required to process queries.
- TICKIT Sample Database Schema
- Data Format Parameters for the COPY command
- Zero Administration Amazon Redshift Database Loader With this AWS Lambda function, you simply push files into a variety of locations on Amazon S3 and have them automatically loaded into your Amazon Redshift clusters.
- Amazon Redshift Deep Dive: Tuning and Best Practices WS re:Invent 2015 | (BDT401)
- Amazon Redshift Engineering’s Advanced Table Design Playbook: AWS Data Blog series from 2016 covers Distribution Styles and Distribution Keys, Compound and Interleaved Sort Keys, Compression Encodings and Table Data Durability.
- Querying the Catalog Tables – Amazon Redshift
- Top 8 Best Practices for High-Performance ETL Processing Using Amazon Redshift
- Amazon Redshift Query Plan
- Concurrency Scaling for Amazon Redshift Peak Performance at All Times
- Resizing Clusters in Amazon Redshift
- Amazon Redshift Advisor Recommendations
- This GitHub provides a collection of scripts and utilities that will assist you in getting the best performance possible from Amazon Redshift.
- Amazon RedShift Parameter Groups can be used to configure database settings such as query timeout and datestyle.
- Analyzing Tables – Automatic analyze is enabled by default.
- VACUUM Resorts rows and reclaims space in either a specified table or all tables in the current database. Amazon Redshift automatically sorts data and runs VACUUM DELETE in the background.
- Performing a Deep Copy
- Run Mixed Workloads with Amazon Redshift Workload Management
- Working with Concurrency Scaling – automatically adds additional cluster capacity when you need it to process an increase in concurrent read queries.
- Implementing Automatic WLM
- What is a data lake?
- Migrating Your Traditional Data Warehouse to a Modern Data Lake AWS re:Invent 2017: (ABD327)
- Building Data Lakes on AWS AWS white paper.
- AWS Lake Formation makes it easy to set up a secure data lake in days.
- Environmental control with
- Automating Landing Zones
- Building AWS CloudFormation Templates
- Automated Deployments using AWS Perspective (design to code)
- Service Auto Scaling or Amazon Elastic Container Service
- EC2 Auto Scaling Groups Multiple Instance Types & Purchase Options
- ELB and Auto scaling
- Autoscaling and SQS
- Clustered Auto-scaling with Containers
- Auto scaling and Kubernetes
- CloudFormation Drift Detection
- First Steps for Modelling Relational Data in DynamoDB
- Amazon DynamoDB Deep Dive: Advanced Design Patterns (DAT403-R1) AWS re:Invent 2019
- Amazon DynamoDB On-Demand No Capacity Planning and Pay-Per-Request Pricing.
- Amazon DynamoDB Transactions provide atomicity, consistency, isolation, and durability (ACID) across one or more tables within a single AWS account and region.
- Amazon Quantum Ledger Database (QLDB) is a fully managed ledger database that provides a transparent, immutable, and cryptographically verifiable transaction log owned by a central trusted authority.
- Amazon Timestream is a fast, scalable, fully managed time series database service for IoT and operational applications that makes it easy to store and analyse trillions of events per day at 1/10th the cost of relational databases.
- How Aurora Serverless Works
- Using the Data API for Aurora Serverless
Using Database Activity Streams with Aurora PostgreSQL
- Amazon S3 Block Public Access Another Layer of Protection for Your Accounts and Buckets
- Using Amazon S3 Block Public Access
- How Do I Block Public Access to S3 Buckets?
- AWS DataSync
- New – Amazon FSx for Windows File Server
- Running Containers on AWS using Amazon ECS and AWS Fargate this guide is designed to help you get started with Docker containers on Amazon ECS and AWS Fargate quickly and easily.
- Amazon ECS Workshop for AWS Fargate configure GitHub, CodePipeline, CodeBuild, VPC, ALB, ECS, Autoscaling, and log aggregation using a framework called Mu to launch microservices on AWS Elastic Container Service.
- Amazon EKS Workshop explore multiple ways to configure VPC, ALB, and EC2 Kubernetes workers, and Amazon Elastic Container Service for Kubernetes.
- Hands-on Workshops for Serverless Computing
- Best Practices for Organizational Units with AWS Organization
- Trusted Advisor further information.
- AWS Resource Access Manager Cross-Account Resource Sharing
- Introducing AWS Global Accelerator and Elastic Fabric Adapter
- AWS License Manager
- AWS App Mesh Monitor and Control Microservices
- AWS Cloud Map Service discovery for cloud resources
- AWS Workshops hands-on events designed to teach or introduce practical skills, techniques, or concepts which you can use to solve business problems.