Browse our collection of articles on various topics related to IT technologies. Dive in and explore something new!
Document extraction is still one of the slowest moving parts in automation architectures. Even with...
🎯 Key Takeaways (TL;DR) A single 1B multimodal architecture covers detection,...
🎯 Key Takeaways (TL;DR) Breakthrough Achievement: A model with only 0.9B parameters...
PaddleOCR-VL What it is PaddleOCR-VL is a vision-language model (VLM) built for document...
Modern displays are more than just LCD panels and touch sensors. Between the layers that make up your...
I built a tool for teaching LLMs how to extract structured data from documents by annotating, not...
Table of Contents: Real-Life Key Information Extraction (Part 2) Why Open-Source...
Hi everyone! I've been working on a document processing API suite that solves a few problems I...
What is IDP? IDP combines AI-powered tools like natural language processing and machine...
Quick Summary: 📝 PaddleOCR is a powerful and lightweight OCR toolkit designed to convert...
During the development of a personal project, I needed to build a script that converts Farsi...
Introduction Building OCR models for right-to-left (RTL) languages like Arabic, Urdu,...
Get ready to be amazed by the amazing progress in OCR technology! 🤯 In 2025, we won't just be reading...
DeepSeek-OCR: When a Picture Is Actually Worth 10 Fewer Tokens ...
🎯 Key Takeaways (TL;DR) Revolutionary Technology: DeepSeek OCR is not a traditional...
Published: October 26, 2025 Model Version: DeepSeek-OCR v1 (Oct 20, 2025) Last Verified: October 26,...
In today's digital workplace, document scanning and text recognition are vital capabilities for many...
Discover Mistral OCR, the most advanced document understanding API that revolutionizes text and media extraction from PDFs, images, and complex documents. Explore its features, benchmarks, and real-world use cases.
Using OCR for Data Extraction from Forms and Invoices While basic OCR converts document...
Ever wished your computer could understand the words on a picture or a scanned paper? Well, get ready...
Using RapidOCR implementation with custom OCR models. Introduction Docling’s RapidOCR...
Extracting text from a PDF is usually straightforward when it's in English and doesn't have embedded...
I recently led an engaging hands-on workshop at ABBYY DevCon 2025. The session, "Creating a Document...
Mistral AI has recently released a powerful OCR model — Mistral OCR — Their tagline for the model is...
Apple's vision framework provides APIs for performing computer vision tasks such as face detection,...
Quick Summary: 📝 Kreuzberg is a Python document intelligence framework designed to extract...
Building a web-based OCR (Optical Character Recognition) application has never been easier with...
Introducing VisionParser: Modern Document Processing for Developers Hello dev.to...
Hey devs! 👋 We're excited to introduce you to DocExtractor — an open, developer-friendly solution to...
Tired of juggling receipts, apps, and loyalty platforms from every store you visit? Me too. The real...