Computationally intensive tasks are everywhere now. We are using resource-intensive techniques such as LLMs and Generative AI a lot these days. Whoever uses precious resources would know how daunting it is to do the same task again, even though we know th- 26338Murphy2025-03-23
How do hazards and maximum likelihood estimates predict event rankings?- 21898Murphy2025-03-23
Get the most out of PyArrow support in pandas and Dask right now- 27924Murphy2025-03-23
This is Part 7 of the series Time Series for Climate Change. List of articles: Part 1: Forecasting Wind Power Part 2: Solar Irradiance Forecasting Part 3: Forecasting Large Ocean Waves Part 4: Forecasting Energy Demand Part 5: Forecasting Extreme Weather- 26253Murphy2025-03-23
Background – Spearheading AI Strategy at an Established Bank Prior to founding Data-Centric Solutions, I spearheaded the development, execution, and implementation of the AI strategy at an established bank. Although relatively green in delivering at- 22717Murphy2025-03-23
Mastering the producer-consumer pattern with asyncio through real-life examples- 24749Murphy2025-03-23
A (biased) history of open sourcing AI libraries and models I joined the Google Brain team in 2015 right as TensorFlow was open sourced. Contrary to popular belief, TensorFlow was not the secret sauce behind Google’s success at that point in time. O- 21149Murphy2025-03-23
Deploy BART on Amazon SageMaker Real-Time Inference- 29305Murphy2025-03-23
Why do we need queries, keys AND values?- 23272Murphy2025-03-23
The Realities of Freelancing in Data Science- 29195Murphy2025-03-23
How and why write effective prompts if you work with data- 29009Murphy2025-03-23
Mastering open-source language models: diving into Falcon-40B- 23085Murphy2025-03-23
Personal Tales Into Data Science In one of my previous articles, I detailed my experience of transitioning into machine learning from cognitive science and the imposter syndrome that taunted me. In that article, I mentioned: "An idea began to slowly- 26168Murphy2025-03-23
A guide to implementing Text2SQL to power data-driven organisations- 24979Murphy2025-03-23
Sure, data quality is everyones’ problem. But who owns the solution? Given the variations in approach and mixed success, we have a lot of natural experiments from which to learn. Some organizations will attempt to diffuse the responsibility widely a- 29359Murphy2025-03-23
Cleaning up survey responses using OpenAI's GPT Model. Full Code with Github link.- 24933Murphy2025-03-23
Overview Have you ever faced a scenario wherein you’ve tried to extract a crucial data point from a transactional system (such as an e-commerce system) using its API, only to discover that the necessary information was not accessible through the pro- 28227Murphy2025-03-23
Reinforcement learning (RL) can do amazing stuff. Most recently, ChatGPT is fine-tuned on human feedback with PPO, a variant of a class of reinforcement learning algorithm called Policy Gradient (PG). Understanding RL, especially policy gradient, could be- 29034Murphy2025-03-23
"It’s okay to write it this way as long as I understand it now, and the good thing is, it works! I manage to magically churns out a pretty good result with my model, what a good one to end off the day." No, I’m here to tell you that&- 29625Murphy2025-03-23
How to choose the best machine for your ML workload- 23274Murphy2025-03-23
Why is ChatGPT only trained up until 2021?
Learn how to rearrange your code to achieve significant speed improvements.