A while back, I wrote a post on why ELT is preferable to ETL with Amazon Redshift and other modern data warehouses such as Snowflake…
In the last few years, there’s been a noticeable shift at cutting edge organizations in how data teams are structured. No longer is data engineering…
When I read about the $30 Million dollar series C funding raised by Crunchbase this morning, it reminded me of a common struggle I see…
Whenever you visit a website, fill out a signup form, or request data from an API, you’re most likely making an HTTP POST or GET…
Cron is perhaps the most universally used scheduling tool in the data engineering community. It has been battle tested over decades on Linux and Unix…
Last week I had the pleasure of attending and speaking at the MinneBOS 2019 data science and analytics conference in Boston. The folks at MinneAnalytics…
In a previous post, I wrote about using the COPY command to load data from an S3 bucket into a Redshift table. In this post,…
Python is a programming language that’s popular with software engineers, data engineers and data scientists alike. The 2.x branch of Python (aka “Python 2”) was…
Considering building a data warehouse in Amazon Redshift? It’s a great option, even in an increasingly crowded market of cloud data warehouse platforms. If you’re…
If you don’t yet have a data science team, you may have moments of panic feeling that you’re falling behind. It’s true, data science and…