AI-ML
2025
The State of Local and Affordable Inference in October 2025
·1379 words·7 mins
An overview of the current landscape of GPUs and AI compute for local inference as of October 2025 from Nvidia and AMD to Intel, Apple, and the cloud.
Testing DeepSeek-OCR: Vision Text Compression for LLMs
·466 words·3 mins
Notes from testing DeepSeek-OCR as a local vision-language model for OCR and text compression on a large archive of screenshots. Includes observations on model performance, visual-token compression, and multilingual results.
Gemini Pro 2.5 in October 2025: decent text, shaky coding, tricky tradeoffs
·903 words·5 mins
A brief look at Gemini Pro 2.5 compared with ChatGPT 5 and Claude Opus/Sonnet, plus notes on Gemini 2.5 variants, NotebookLM, and mobile privacy concerns.
NVIDIA DGX Spark: underwhelming and late to the Party
·890 words·5 mins
NVIDIA’s DGX Spark arrives late as an AI inference system whose performance is lagging behind. With low speed unified VRAM, immature software optimizations, and heavy competition from Apple, AMD, and Intel, the Spark exposes how little remains of NVIDIA’s CUDA moat.
LLM false metric generation
·654 words·4 mins
There is a lot of synthetic data that is being generated by LLMs, These include false metrics.
ChatGPT 5 Working Guide: Practical Tips for Better Results
·725 words·4 mins
Creating a practical, no nonsense guide for using ChatGPT-5 efficiently by manual model switching and prompt style to each task while avoiding wasted tokens and bad outputs.
The Problem With Proprietary LLM Providers: Removing Model Access without recourse
·414 words·2 mins
OpenAI’s removal of GPT-4o, o3, and other models after GPT-5’s launch breaks fundamental MLOps principles. Without model versioning and control, data science workflows become unreliable. Local LLMs offer a better alternative for maintaining consistency.
AI Capex between 2022-2025
·506 words·3 mins
Lets explore the data of the AI infrastructure investments from NVIDIA, AMD, and hyperscalers like GCP, AWS, and Azure.
AI milestones in 2025
·111 words·1 min
2025 brought us Claude 4, MCP, Agents, GPT-5, o3 reasoning. A timeline of artificial intelligence developments, breakthroughs, and milestones throughout 2025
OpenAI Shows Someone Else's Conversations
·993 words·5 mins
I sent a picture to ChatGPT for OCR and it gave me a response that leaked from somewhere.
