Distributed Llama 2 on CPUs
A toy example of bulk inference on commodity hardware using Python.- 22531Murphy ≡ DeepGuide
No Baseline? No Benchmarks? No Biggie! An Experimental Approach to Agile Chatbot Development
Lessons learned bringing LLM-based products to production- 24938Murphy ≡ DeepGuide
6 Common LLM Customization Strategies Briefly Explained
From Theory to practice: understanding RAG, agents, fine-tuning, and more- 28295Murphy ≡ DeepGuide
Enhancing RAG: Beyond Vanilla Approaches
Retrieval-Augmented Generation (RAG) is a powerful technique that enhances language models by incorporating external information retrieval mechanisms. While standard RAG implementations improve response relevance, they often struggle in complex retrieval- 28704Murphy ≡ DeepGuide
This Is How LLMs Break Down the Language
The science and art behind tokenization- 21503Murphy ≡ DeepGuide
We look at an implementation of the HyperLogLog cardinality estimati
Using clustering algorithms such as K-means is one of the most popul
Level up Your Data Game by Mastering These 4 Skills
Learn how to create an object-oriented approach to compare and evalu
When I was a beginner using Kubernetes, my main concern was getting
Tutorial and theory on how to carry out forecasts with moving averag