Anatomy of Windows Functions
Theory and practice of an underappreciated SQL operation- 22471Murphy2025-03-22
MLOps – Data Validation with PyTest
Run deterministic and non-deterministic tests to validate your dataset- 22451Murphy2025-03-22
SQL Explained: Ranking Analytics
What they are and how you use them- 26677Murphy2025-03-22
How I Built an LLM-Based Game from Scratch
Part I: Game concepts and Causal Graphs for LLMs- 27793Murphy2025-03-22
ASA's Caution: Rethinking How We Use p-Values in Research
Understanding the ASA's statement to enhance your data science practices- 22317Murphy2025-03-22
The Math Behind KAN – Kolmogorov-Arnold Networks
A new alternative to the classic Multi-Layer Perceptron is out. Why is it more accurate and interpretable? Math and Code Deep Dive.- 21125Murphy2025-03-22
How to Pivot Tables in SQL
A comprehensive guide to creating pivot tables in SQL for enhanced data analysis- 20611Murphy2025-03-22
CUDA for AI – Intuitively and Exhaustively Explained
Parallelized AI from scratch in CUDA- 27867Murphy2025-03-22
Model Interpretability Using Credit Card Fraud Data
Why model interpretability is important- 28724Murphy2025-03-22
Simplifying the Python Code for Data Engineering Projects
Python tricks and techniques for data ingestion, validation, processing, and testing: a practical walkthrough- 26998Murphy2025-03-22
Mastering AI Department Reorganizations: Lessons from the Trenches
Do's and Dont's after five years of Data Science department reorgs- 26480Murphy2025-03-22
Welch's t-Test: The Reliable Way to Compare 2 Population Means with Unequal Variances
Discover why Welch's t-Test is the go-to method for accurate statistical comparison, even when variances differ.- 25306Murphy2025-03-22
How Many Pokemon Fit?
Finding the best Pokemon team by modeling and solving a knapsack problem with PokeAPI and PuLP optimization Python library- 22368Murphy2025-03-22
Mapping the Pokemon World: A Network Analysis of Habitat-Based Encounters
An introduction to Network Analysis in Python, along with a practical example using Pokemon data from the PokeApi- 26925Murphy2025-03-22
Time Series Regression and Cross-Validation: A Tidy Approach
Step by step guide to EDA, feature engineering, cross validation and model comparison with tidymodels, modeltime and timetk.- 30007Murphy2025-03-22
Understanding Buffer of Thoughts (BoT) – Reasoning with Large Language Models
New prompt engineering tool for complex reasoning, compared with Chain of thought (CoT) and Tree of Thought (ToT)- 22029Murphy2025-03-22
Deep Learning Illustrated, Part 4: Recurrent Neural Networks
An illustrated and intuitive guide on the inner workings of an RNN and the Softmax Activation Function- 21914Murphy2025-03-22
A Python Engineer's Introduction to 3D Gaussian Splatting (Part 2)
Understanding and coding how Gaussian's are used within 3D Gaussian Splatting- 26725Murphy2025-03-22
AI Agent Unit Testing in Langfuse
Creating a scalable testing solution for AI agents for operation by non-coders- 20455Murphy2025-03-22
My Easy Guide to Pre vs. Post Treatment Tests
A quick introduction to Before and After Tests with code.- 21187Murphy2025-03-22
Genius Cliques: Mapping out the Nobel Network
Combining Network Science, Data Visualization, and Wikipedia to uncover hidden connections between all the Nobel laureates.Data Science Expertise Comes in Many Shapes and Forms
Our weekly selection of must-read Editors' Picks and original features