How to Build an ELT with Python
Extracting, Loading and Transforming Data- 21931Murphy ≡ DeepGuide
How to Set Up a Simple ETL Pipeline with AWS Lambda for Data Science
Introduction to ETL with AWS Lambda When it comes time to build an ETL pipeline, many options exist. You can use a tool like Astronomer or Prefect for Orchestration, but you will also need somewhere to run the compute. With this, you have a few options: V- 26696Murphy ≡ DeepGuide
Cooking with Snowflake
Snowflake optimisation The Snowflake community is rife with information dumps on how to optimize expensive queries. We know because we combed through a ton of them. What we present here are three tactical ways in which we’ve done this at Toplyne. To- 26759Murphy ≡ DeepGuide
Data Modeling Techniques For Data Warehouse
Data modeling is a process of creating a conceptual representation of the data and its relationships within an organization or system...- 29366Murphy ≡ DeepGuide
The Docker Compose of ETL: Meerschaum Compose
This article is about Meerschaum Compose, a tool for defining ETL pipelines in YAML and a plugin for the data engineering framework...- 29434Murphy ≡ DeepGuide
Building a Batch Data Pipeline with Athena and MySQL
An End-To-End Tutorial for Beginners- 23715Murphy ≡ DeepGuide
Automating ETL to SFTP Server Using Python and SQL
Learn how to automate a daily data transfer process on Windows, from PostgreSQL database to a remote server- 21511Murphy ≡ DeepGuide
Calculate the percentage of the total with RLS in place in Power BI
With RLS in place, some users can see only some aspects of the data set. But what happens when they must compare the result to the total?- 27855Murphy ≡ DeepGuide
Pandas for Data Engineers
Advanced techniques to process and load data efficiently- 26268Murphy ≡ DeepGuide
Database Data Transformation for Data Engineers
Advanced techniques for beginners- 25425Murphy ≡ DeepGuide
Deploy Long-Running ETL Pipelines to ECS with Fargate
To keep things simple and costs to a minimum- 26998Murphy ≡ DeepGuide
What Does it Take to Get into Data Engineering in 2024?
Career advice for aspiring data practitioners- 27165Murphy ≡ DeepGuide
3 Essential Questions to Address When Building an API-Involved Incremental Data Loading Script
This article explains both the conceptual framework and practical code implementation for syncing data from API endpoints to your database.- 28818Murphy ≡ DeepGuide
Efficient Testing of ETL Pipelines with Python
How to Instantly Detect Data Quality Issues and Identify their Causes- 26025Murphy ≡ DeepGuide
Why ETL-Zero? Understanding the shift in Data Integration
When I was preparing for the Salesforce Data Cloud certification, I came across the term Zero-ETL. The Data Cloud offers the possibility...- 28709Murphy ≡ DeepGuide
5 Simple Projects to Start Today: A Learning Roadmap for Data Engineering
Start with 5 practical projects to lay the foundation for your data engineering roadmap.- 22556Murphy ≡ DeepGuide
We look at an implementation of the HyperLogLog cardinality estimati
Using clustering algorithms such as K-means is one of the most popul
Level up Your Data Game by Mastering These 4 Skills
Learn how to create an object-oriented approach to compare and evalu
When I was a beginner using Kubernetes, my main concern was getting
Tutorial and theory on how to carry out forecasts with moving averag