Cole McIntosh

Software engineer building agents and data extraction on top of LLMs.

Full-stack engineer with a focus on production AI systems. Currently running Mellow AI and maintaining OpenExtract, an open-source framework for pulling structured data from anything.

StackPythonTypeScriptRustGoLangChainPydantic AIOpenAIAnthropicPostgresRedisAWSDockerFastAPIReact
01

Currently

Building
Mellow AI

Independent AI engineering and consulting. If you need LLM work done, there's a good chance I've already built something close to it.

Maintaining
OpenExtract

Open-source data extraction framework. Structured data from PDFs, images, web pages — anything.

Writing
Notes & essays

Occasional writing on AI engineering, agents, and what's actually working in production.

02

Selected work

pdfmd

Open source · Rust

A fast, dependency-light PDF-to-Markdown converter written in Rust. Walks the object graph, decodes fonts, and interprets content-stream operators directly — roughly 2,200 pages per second on a single machine.

~2,200 pages/sec·zero-runtime-deps·github →

OpenExtract

Open source · Framework

An open-source framework for extracting structured data from unstructured sources. Built to make production data pipelines less painful.

Used in production at Mellow AI·docs →·github →

Nudge

Agent · Personal

An apartment delinquency agent. Built after one too many unnecessary leasing-office emails — now it handles them for me.

Running daily·github →

LangChain Salesforce

Integration · OSS

A connector that brings Salesforce into the LangChain ecosystem. Lets agents reason over CRM data without bespoke glue code.

Chain-of-Thought Reranking

Research · Retrieval

A reranking approach that asks the model to reason through candidates before scoring. Better retrieval quality at the cost of a few extra tokens.

03

Experience

2025 — Now
Founder & Principal Engineer
Building production LLM systems for clients — agents, RAG, structured extraction, evals. Maintainer of OpenExtract.
Mellow AI
2023 — 2025
AI & Data Engineer
Shipped LLM-driven document-extraction pipelines processing thousands of resident notices daily across the delinquency workflow. Owned model selection, prompt evaluation, and end-to-end Python services in production.
Pay Ready
2022 — 2023
Scaled Tooling Analyst
Built internal automation and operational dashboards for the ghost-kitchen network, replacing manual workflows for ops teams across hundreds of facilities.
CloudKitchens
now Atoms
2017 — 2022
Business Analyst
Built reporting infrastructure and SQL/Python data pipelines powering daily decisions for property-management customers nationwide.
RealPage
04

Open-source contributions

Merged work on projects I rely on in production.

langchain-ai / langchainLLM application framework
BerriAI / litellmUnified LLM API gateway
huggingface / smolagentsLightweight agent framework
pydantic / pydantic-aiType-safe agent framework
05

Get in touch

If you have an AI or LLM problem worth solving, I'd like to hear about it.

cole@staymellow.ai →