Linghua Jin

Linghua Jin @badmonster0

About: building CocoIndex - data infra for AI, ex-google tech lead ⭐https://github.com/cocoindex-io/cocoindex

Location:
San Francisco
Joined:
Feb 28, 2025

Linghua Jin
articles - 35 total

CocoIndex https://github.com/cocoindex-io/cocoindex is on Github Trending today! Smart incremental engine to build any index for AI. Support any custom logic, any target with standard interface like building blocks.

...

Learn More 0 0Oct 3

Best realtime context for coding agents https://cocoindex.io/blogs/index-code-base-for-rag/

A post by Linghua Jin

Learn More 0 1Aug 7

[Boost]

Multimodal Face Recognition Pipeline with CocoIndex: Real-Time...

Learn More 0 0Jul 29

Multimodal Face Recognition Pipeline with CocoIndex: Real-Time Image and Vector Search

CocoIndex supports multi-modal processing natively - it could process both text and image with the...

Learn More 7 0Jul 29

Context + Reliable LLM e.g., Claude code will be the future of coding agents. Checkout cocoindex that brings fresh context for reliable coding agents! https://github.com/cocoindex-io/cocoindex

Build Real-Time Codebase Indexing for AI...

Learn More 6 0Jul 13

Build Real-Time Codebase Indexing for AI Coding agents

In this blog, we will show you how to index a codebase for RAG with CocoIndex. CocoIndex provides...

Learn More 5 0Jul 13

Do you need academic research insights to back up your AI Agents?

Beyond Embeddings: Building...

Learn More 0 0Jul 11

Beyond Embeddings: Building Metadata-Rich Indexes from Academic PDFs

In this blog we will walk through a comprehensive example of indexing research papers with extracting...

Learn More 5 0Jul 11

https://github.com/cocoindex-io/cocoindex Super simple ETL to get data ready for AI agents. Write in Python, with Rust 🦀 performance . Appreciate a github star!

A post by Linghua Jin

Learn More 0 0Jul 4

[Boost]

Build Real-Time Knowledge Graphs from...

Learn More 5 0Jun 4

Build Real-Time Knowledge Graphs from Documents Using CocoIndex + Kuzu (with LLMs & Live Updates)

A blazing-fast, end-to-end open source pipeline for turning documents into queryable knowledge graphs...

Learn More 5 4Jun 4

Stream from S3 in Real-Time: CocoIndex Brings True Incremental Processing to the Cloud

🚀 CocoIndex https://github.com/cocoindex-io/cocoindex Now Supports Amazon S3 for Native, Real-Time...

Learn More 8 0May 30

How to build image search with semantic understanding

In this blog, we will build live image search and query it with natural language. For example, you...

Learn More 5 0May 25

In this walkthrough, we’ll show how to build a semantic image search engine powered by multimodal AI – searchable with natural language in real time, with data insight to understand what’s going on step by step.

How to build index with text embeddings ...

Learn More 5 0May 24

How to build index with text embeddings

In this blog, we will build index with text embeddings and query it with natural language. We try to...

Learn More 7 0May 24

🎉 CocoIndex hits 1,000 stars – powering real-time data for AI

We’ve been building CocoIndex, an ultra performant real-time data transformation framework for AI,...

Learn More 5 0May 13

product recommendation with LLM and knowledge graph

Real-Time Knowledge Graph for Product...

Learn More 0 0May 8

Real-Time Knowledge Graph for Product recommendation with LLM taxonomy extraction

In this blog, we will build a knowledge graph for product recommendations using taxonomy and...

Learn More 5 0May 8

[Boost]

Build Real-Time Knowledge Graph For Documents with LLM LJ...

Learn More 0 0May 1

Build Real-Time Knowledge Graph For Documents with LLM

CocoIndex makes it easy to build and maintain knowledge graphs with continuous source updates. In...

Learn More 5 0May 1

LLM to extract and auto generate knowledge graph - step by step, in ~100 lines of python

In this blog, we will use CocoIndex to extract relationships/ontologies using LLM and build a...

Learn More 2 0Apr 24

CocoIndex Changelog 2025-04-05

In the past 2 weeks, we added incremental processing with live update mode, evaluation utilities,...

Learn More 0 0Apr 8

Continuous update ETL on source updates, automatically

Today, we are excited to announce the support of continuous updates for long-running pipelines in...

Learn More 0 1Apr 8

How to do incremental processing for ETL - by examples

We could take a look at a few examples to understand what CocoIndex handles behind the scene for...

Learn More 13 0Apr 8

Incremental processing to keep low latency sync for your ETL

Incremental processing is one of the core values provided by CocoIndex. In CocoIndex, users declare...

Learn More 0 0Apr 7

Implement Contextual Retrieval for RAG

Anthropic has published a great article about Contextual Retrieval that suggests the combination of...

Learn More 0 0Apr 1

Efficiently process large files for RAG

When building data indexing pipelines, handling large files efficiently presents unique challenges....

Learn More 1 0Mar 30

Automate structured data extraction from PDF / Word by OpenAI and CocoIndex

In this blog, we will show you how to use OpenAI API to extract structured data from patient intake...

Learn More 2 0Mar 28

Is RAG Still Needed? Retrieval Beyond Vector Embeddings

The rise of Large Language Models (LLMs) has sparked an ongoing debate: do we still need...

Learn More 1 2Mar 27

Step by Step guide to ingest Google Drive for RAG

Overview In this blog, we will show you how to use CocoIndex to build text embeddings from...

Learn More 2 0Mar 26