Over the past 9 months, I’ve been working on a book to be published by O’Reilly Media. This past week, Data Pipelines Pocket Reference was…
In an ideal world, data engineers are presented with a source of ingesting data (the Extract and Load steps in ELT) from source systems that’s…
The death of the data warehouse, long prophesied, seems to always been on the horizon yet never realized. Much like cold fusion power and fully…
Deep breath. In and out. I’ve been doing a lot of that in 2020, and I know I’m not alone. I hesitated to write a…
Workflow management platforms are what data engineers use to schedule and coordinate the steps in a data pipeline – an activity sometimes referred to as…
Loading data that’s been stored in an S3 bucket into a Snowflake data warehouse is an incredibly common task for a data engineer. In an…
Though the title of this post may sound obvious to some, it’s not how most organizations function. Many excellent software engineers are led to believe…
Data Engineers are a hot commodity in 2020, but it’s surprising how misunderstood they are. Are they a software engineer with a hyped up job…
A while back, I wrote a post on why ELT is preferable to ETL with Amazon Redshift and other modern data warehouses such as Snowflake…
In the last few years, there’s been a noticeable shift at cutting edge organizations in how data teams are structured. No longer is data engineering…