ContextD – OCRs your screen activity, use it with LLMs via local API

mostly vibe-coded in 2 days, only contributed ~5 core insights, the rest is all opencode (trying it for the first time, it was ok)

API Platform
LLM
Open Source

Mar 20, 2026Visit website

✨ AI Summary

ContextD is a tool that performs Optical Character Recognition (OCR) on your screen activity and makes the extracted text available to Large Language Models (LLMs) via a local API.

Best For

Developers integrating LLMs with desktop applications, Users who want to automate tasks based on screen content, Researchers analyzing on-screen data

Why It Matters

It enables LLMs to process and act upon information directly from your screen.

Key Features

Performs Optical Character Recognition (OCR) on screen activity.
Integrates with Large Language Models (LLMs).
Provides a local API for LLM interaction.
Captures and processes on-screen text.

Use Cases

A software developer can use ContextD to automatically capture code snippets and error messages from their screen, feeding them into an LLM to generate explanations or potential solutions, streamlining debugging and learning.
A content creator can leverage ContextD to extract text from images or videos displayed on their screen, then use an LLM to rephrase or summarize the content for social media posts or blog articles.
A student can employ ContextD to capture lecture slides or textbook pages, feeding the OCR'd text into an LLM to generate study notes, flashcards, or answer comprehension questions, enhancing their learning process.

ContextD – OCRs your screen activity, use it with LLMs via local API

✨ AI Summary

Key Features

Use Cases

Original Sources