Articles by Tag #ocr

Browse our collection of articles on various topics related to IT technologies. Dive in and explore something new!

DeepSeek OCR in Automation Pipelines: Practical Engineering Insights and Integration Patterns

Document extraction is still one of the slowest moving parts in automation architectures. Even with...

Learn More 33 8Nov 18

2025 Complete Guide: How to Build End-to-End OCR with HunyuanOCR

🎯 Key Takeaways (TL;DR) A single 1B multimodal architecture covers detection,...

Learn More 0 0Nov 25

2025 Complete Guide: PaddleOCR-VL-0.9B — Baidu's Ultra-Lightweight Document Parsing Powerhouse

🎯 Key Takeaways (TL;DR) Breakthrough Achievement: A model with only 0.9B parameters...

Learn More 6 1Oct 17

Paddle OCR-VL & DeepSeek-OCR

PaddleOCR-VL What it is PaddleOCR-VL is a vision-language model (VLM) built for document...

Learn More 0 0Oct 21

Optical Clear Adhesive (OCA): Why It Matters in Modern Display Assembly

Modern displays are more than just LCD panels and touch sensors. Between the layers that make up your...

Learn More 0 0Oct 23

K-shot training with LLMs

I built a tool for teaching LLMs how to extract structured data from documents by annotating, not...

Learn More 5 4Sep 17

Practical Approaches to Key Information Extraction (Part 2)

Table of Contents: Real-Life Key Information Extraction (Part 2) Why Open-Source...

Learn More 13 5Jan 16

I am building a document api suite that gives you coordinates for every answer

Hi everyone! I've been working on a document processing API suite that solves a few problems I...

Learn More 0 0Oct 8

What is Intelligent Document Processing?

What is IDP? IDP combines AI-powered tools like natural language processing and machine...

Learn More 0 0Oct 29

Stop Typing That Image Text: PaddleOCR Makes AI-Powered Text Extraction Effortless

Quick Summary: 📝 PaddleOCR is a powerful and lightweight OCR toolkit designed to convert...

Learn More 0 0Nov 1

Farsi Image generator

During the development of a personal project, I needed to build a script that converts Farsi...

Learn More 0 0Sep 12

Generating Synthetic RTL OCR Data for Donut with SynthDoG-RTL

Introduction Building OCR models for right-to-left (RTL) languages like Arabic, Urdu,...

Learn More 1 0Sep 23

Say goodbye to tedious data entry! The future of OCR is here, and it’s smarter than ever!

Get ready to be amazed by the amazing progress in OCR technology! 🤯 In 2025, we won't just be reading...

Learn More 6 0Dec 16 '24

Very insightful!

DeepSeek-OCR: When a Picture Is Actually Worth 10 Fewer Tokens ...

Learn More 0 0Oct 26

Complete Guide 2025: How DeepSeek OCR Reduces AI Costs by 20x Through "Visual Compression"

🎯 Key Takeaways (TL;DR) Revolutionary Technology: DeepSeek OCR is not a traditional...

Learn More 1 0Oct 22

DeepSeek-OCR: When a Picture Is Actually Worth 10 Fewer Tokens

Published: October 26, 2025 Model Version: DeepSeek-OCR v1 (Oct 20, 2025) Last Verified: October 26,...

Learn More 7 0Oct 26

How to Build a .NET Document Scanner with C# and Windows OCR API

In today's digital workplace, document scanning and text recognition are vital capabilities for many...

Learn More 5 2Aug 13

Mistral OCR: The Future of Document Understanding & AI-Powered OCR

Discover Mistral OCR, the most advanced document understanding API that revolutionizes text and media extraction from PDFs, images, and complex documents. Explore its features, benchmarks, and real-world use cases.

Learn More 1 1Mar 11

Using OCR for Data Extraction from Forms and Invoices

Using OCR for Data Extraction from Forms and Invoices While basic OCR converts document...

Learn More 12 2May 14

🔎 What is OCR? and How Can You Use It Without Any ML Experience?!

Ever wished your computer could understand the words on a picture or a scanned paper? Well, get ready...

Learn More 5 0Jul 21

Using Docling’s OCR features with RapidOCR

Using RapidOCR implementation with custom OCR models. Introduction Docling’s RapidOCR...

Learn More 4 0Apr 3

Unlocking Text from Embedded-Font PDFs: A pytesseract OCR Tutorial

Extracting text from a PDF is usually straightforward when it's in English and doesn't have embedded...

Learn More 3 0Dec 1 '24

DevCon 2025 Workshop: Creating a Document Processing MCP Server

I recently led an engaging hands-on workshop at ABBYY DevCon 2025. The session, "Creating a Document...

Learn More 0 0Aug 8

Powerful PDF+Image Parsing — Mistral OCR

Mistral AI has recently released a powerful OCR model — Mistral OCR — Their tagline for the model is...

Learn More 0 0Mar 12

Building an iOS ID Scanner with Face, Document, OCR and MRZ Detection

Apple's vision framework provides APIs for performing computer vision tasks such as face detection,...

Learn More 2 0Sep 12

Kreuzberg: The Python Document Intelligence Framework That Will Blow Your Mind!

Quick Summary: 📝 Kreuzberg is a Python document intelligence framework designed to extract...

Learn More 0 0Jul 27

How to Build a Free Web OCR App for Images and PDF Files

Building a web-based OCR (Optical Character Recognition) application has never been easier with...

Learn More 0 0Aug 8

Introducing VisionParser: A Modern OCR API for Receipt and Invoice Processing

Introducing VisionParser: Modern Document Processing for Developers Hello dev.to...

Learn More 1 0Feb 1

🚀 Automating Document Data Extraction with DocExtractor: Save Time, Build Smarter

Hey devs! 👋 We're excited to introduce you to DocExtractor — an open, developer-friendly solution to...

Learn More 0 0Apr 29

Building Purchase Tracker: The MVP That Eats Your Receipts (So You Don’t Have To)

Tired of juggling receipts, apps, and loyalty platforms from every store you visit? Me too. The real...

Learn More 0 0Aug 28