Big Data File Formats, Explained
Parquet vs ORC vs AVRO vs JSON. Which one to choose and how to use them?- 20715Murphy ≡ DeepGuide
Working with Hugging Face Datasets
Learn how to access the datasets on Hugging Face Hub and how you can load them remotely using DuckDB and the Datasets library- 21341Murphy ≡ DeepGuide
Faster DataFrame Serialization
Read and Write DataFrames Up to Ten Times Faster than Parquet with StaticFrame NPZ- 28680Murphy ≡ DeepGuide
Saving Pandas DataFrames Efficiently and Quickly – Parquet vs Feather vs ORC vs CSV
Speed, RAM, size and convenience. Which storage method is best?- 24020Murphy ≡ DeepGuide
Anatomy of a Parquet File
Parquet from scratch: A Python deep dive into a raw parquet file- 22503Murphy ≡ DeepGuide
We look at an implementation of the HyperLogLog cardinality estimati
Using clustering algorithms such as K-means is one of the most popul
Level up Your Data Game by Mastering These 4 Skills
Learn how to create an object-oriented approach to compare and evalu
When I was a beginner using Kubernetes, my main concern was getting
Tutorial and theory on how to carry out forecasts with moving averag