Scaling Monosemanticity: Anthropic's One Step Towards Interpretable & Manipulable LLMs
From prompt engineering to activation engineering for more controllable and safer LLMs- 21453Murphy ≡ DeepGuide
How LLMs Think
Research paper in pills: "Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet"- 24665Murphy ≡ DeepGuide
Take a Look Under the hood
Using Monosemanticity to understand the concepts a Large Language Model learned- 29462Murphy ≡ DeepGuide
Towards Monosemanticity: A Step Towards Understanding Large Language Models
Understanding the mechanistic interpretability research problem and reverse-engineering these large language models- 22267Murphy ≡ DeepGuide
We look at an implementation of the HyperLogLog cardinality estimati
Using clustering algorithms such as K-means is one of the most popul
Level up Your Data Game by Mastering These 4 Skills
Learn how to create an object-oriented approach to compare and evalu
When I was a beginner using Kubernetes, my main concern was getting
Tutorial and theory on how to carry out forecasts with moving averag