DISCLAIMER! The best AI coding tool is the one available to you, that gives you the best model and...
Sequoia Capital has recently published a blog post arguing that AGI has been achieved because...
Both GitHub Copilot and Cursor offer ways to define guardrails for agents in the form of Instructions...
Three months ago, I tested Kiro's Spec-Driven Development (SDD) workflow and walked away impressed...
Kiro is a yet another VSCode fork (just like Cursor or Windsurf) that integrates AI coding features....
LLMs are known to struggle with math. Not in those PhD level tasks from AIME eval, where the...
I've learned it the hard way... If you use the recently released Grok-3 Mini reasoning model (which...
Microsoft CEO Satya Nadella said that "as much as 30% of the company’s code is now written by...
This will be a quick post. I've ran the recent OpenAI models through LLM Chess eval: o4-mini and o3...
I have recently touched on how diffusion/transformer models come into new domains - specifically the...
Meta has brought the long-awaited Llama 4 models on Saturday, April 5. Llama 3 came out on April 26,...
In March we saw 2 major releases of image generation tools ((Google, OpenAI)) that are very much...
Recently, I stumbled upon a post about SGLang, an open-source LLM inference engine that boasts 2-5x...
OpenAI has recently presented its newest reasoning model - o3-mini. At Medium and High reasoning...
Last week, Chinese Big-Tech company Alibaba released its best model to date: Qwen-Max. It is a...
In late 2023, people complained about GPT-4 Turbo's laziness - often the model didn't complete tasks....
Deepseek R1 is out - available via Deepseek API or free Deepseek chat. If you are following LLM/Gen...
Hallucination is also a latent fear accompanying the copy-pasting of a long scroll of text from a...
OpenAI has teased the o3 model today—a further development of the "reasoning" model and a successor...
Phi-4 14B has been recently released. Benchmarks look promising, e.g. it beats GPT-4o in Math: I...
Today Google has presented a major update to its flagship SOTA model - Gemini 2.0. What caught my...
A company decided to rename its product XYZ to ABC. Assets, texts in the app, and web pages were...
UPD September 15, 2025: Reasoning models opened a new chapter in Chess performance, the most recent...
One of the most popular open-source frameworks of 2023, which explored the application of AI agents...
The new open model by NVidia, Nemotron 70B has recently been the hotspot of "this is wild", "this is...
This is the 3rd part of my investigations of local LLM inference speed. Here're the 1st and 2nd...
"When sensationalism wins over nuance, we lose our ability to think." is a great quote by Lex Fridman...
Yesterday OpenAI introduced "A new series of reasoning models for solving hard problems", which...
First Impressions I had low expectations from Continue.dev when I first installed it in...
Python 3.13 is due to be released in October, yet the first release candidate was published earlier...