Microsoft CEO Satya Nadella said that "as much as 30% of the company’s code is now written by...
This will be a quick post. I've ran the recent OpenAI models through LLM Chess eval: o4-mini and o3...
I have recently touched on how diffusion/transformer models come into new domains - specifically the...
Meta has brought the long-awaited Llama 4 models on Saturday, April 5. Llama 3 came out on April 26,...
In March we saw 2 major releases of image generation tools ((Google, OpenAI)) that are very much...
Recently, I stumbled upon a post about SGLang, an open-source LLM inference engine that boasts 2-5x...
OpenAI has recently presented its newest reasoning model - o3-mini. At Medium and High reasoning...
Last week, Chinese Big-Tech company Alibaba released its best model to date: Qwen-Max. It is a...
In late 2023, people complained about GPT-4 Turbo's laziness - often the model didn't complete tasks....
Deepseek R1 is out - available via Deepseek API or free Deepseek chat. If you are following LLM/Gen...
Hallucination is also a latent fear accompanying the copy-pasting of a long scroll of text from a...
OpenAI has teased the o3 model today—a further development of the "reasoning" model and a successor...
Phi-4 14B has been recently released. Benchmarks look promising, e.g. it beats GPT-4o in Math: I...
Today Google has presented a major update to its flagship SOTA model - Gemini 2.0. What caught my...
A company decided to rename its product XYZ to ABC. Assets, texts in the app, and web pages were...
UPD January 25, 2025: Deepseek R1 is another model that broke the ceiling of zero wins showing...
One of the most popular open-source frameworks of 2023, which explored the application of AI agents...
The new open model by NVidia, Nemotron 70B has recently been the hotspot of "this is wild", "this is...
This is the 3rd part of my investigations of local LLM inference speed. Here're the 1st and 2nd...
"When sensationalism wins over nuance, we lose our ability to think." is a great quote by Lex Fridman...
Yesterday OpenAI introduced "A new series of reasoning models for solving hard problems", which...
First Impressions I had low expectations from Continue.dev when I first installed it in...
Python 3.13 is due to be released in October, yet the first release candidate was published earlier...
This is the 2nd part of my investigations of local LLM inference speed. Here're the 1st and 3rd...
The recent release of Llama 3.1 was reminiscent of many releases this year. It underlined a trend...
Decoding by Contrasting Layers (DoLa) is a technique suggesting a different approach to calculating...
Fine Tuning LLM via Huggin Face TRL/Torch: ECC On: 2,22 epochs/day ECC Off: 2,33 epochs/day...
By default, MT-Bench uses OpenAI as a service provider with a gpt-4 model ID, which is a vanilla...
Nvidia's keynote at the recent Computex was full of bold marketing and messaging, bordering on...
At just a fraction of power, 4090 is capable of delivering almost full performance. While running...