Understanding Concept Drift: A Simple Guide
How changes in the distribution arise, and the impact of verification delay.- 27484Murphy2025-03-22
The German Tank Problem
Estimating your chances of winning the lottery with sampling- 27059Murphy2025-03-22
Set up a Pypi mirror in an AWS private environment with Terraform
Use AWS CodeArtifact as a fully secured solution for setting up a Pypi mirror in a private AWS environment.- 27027Murphy2025-03-22
How to Create an RFM Model in BigQuery
Learn what an RFM model is, how to create one, and how to segment on the results- 30048Murphy2025-03-22
How to Use SQLAlchemy to Make Database Requests Asynchronously
Learn to use SQLAlchemy asynchronously in different scenarios- 26015Murphy2025-03-22
How to Generate Instruction Datasets from Any Documents for LLM Fine-Tuning
Generate high-quality synthetic datasets economically using lightweight libraries- 28343Murphy2025-03-22
Visualizing household income from government sources – A guided walkthrough
Using python code which can be applied to data from your own dataset for generating your own charts and gifs- 20526Murphy2025-03-22
Setting Up PyTorch with GPU Support on EC2 without Preconfigured AMIs
A cost-effective approach- 25345Murphy2025-03-22
How I Became a Data Scientist Before I Joined LinkedIn
These hands-on projects were crucial for career transformations- 21037Murphy2025-03-22
Time Series Forecasting: Beyond the Basics
Tips and tricks to unlock the true potential of time series forecasting in the real-world- 22105Murphy2025-03-22
How I Learned To Code (No CS Degree, No Bootcamp)
A rundown of my coding journey and advice I wish I had- 22522Murphy2025-03-22
Robust Statistics for Data Scientists Part 2: Resilient Measures of Relationships Between Variables
From basic to advanced techniques for outlier-rich data analysis.- 22086Murphy2025-03-22
How to Improve LLMs with RAG
A beginner-friendly introduction w/ Python code- 27581Murphy2025-03-22
A Sharp and Solid Outline of 3D Grid Neighborhoods
How 2D grid-based algorithms can be brought into the 3D world- 27343Murphy2025-03-22
PyTorch and MLX for Apple Silicon
A side-by-side CNN implementation and comparison- 27996Murphy2025-03-22
Data Science for Sustainability- Green Inventory Management
Simulate the impact of store delivery frequency on the CO2 emissions of a fashion retailer- 24100Murphy2025-03-22
A Priority Based Scheduler for Amazon SageMaker Training Jobs
Optimizing the use of limited AI training accelerators - Part 2- 28550Murphy2025-03-22
5 Redshift SQL Functions You Need to Know
With code examples on how to use them- 27513Murphy2025-03-22
Revolutionize Web Browsing with AI
Unlock Interactive Online Experiences Using GPT-4V and Puppeteer- 29110Murphy2025-03-22
Experiment Tracking & Hyperparameter Tuning: Organize Your Trials with DVC
Learn how to avoid getting lost with all the experiments while tuning your model's hyperparameters- 23672Murphy2025-03-22
Genius Cliques: Mapping out the Nobel Network
Combining Network Science, Data Visualization, and Wikipedia to uncover hidden connections between all the Nobel laureates.Data Science Expertise Comes in Many Shapes and Forms
Our weekly selection of must-read Editors' Picks and original features