Eleventh Solutions Guangzhou, CN

Ignazio De Santis.

Most AI systems fail somewhere between a working demo and a working system. Retrieval that drifts, agents that don't recover, evaluation that catches nothing. I work in that gap. Backend systems, retrieval, agents, evaluation.

View projects → Eleventh.dev ↗ GitHub

Available for contract work [email protected]

telemetrylast 60s

p50 284ms

tput 47/s

err 0.3%

cost $3.94/1k

Eleventh eleventh.dev ↗

Code github.com/IgnazioDS ↗

Writing substack ↗

Status Available for contract work

Stack

ai / backendPython · FastAPI · LangGraph · Pydantic

retrievalpgvector · PostgreSQL · Redis · Celery

modelsAnthropic API · OpenAI API · Ollama

infrastructureDocker · Docker Compose · GitHub Actions · Nginx

frontend / desktopReact · TypeScript · Next.js · Vite · Tauri · Rust

Engineer with
range.

Software Engineer by background. Backend systems, integrations, and infrastructure for four years before AI ate the world. The current focus is retrieval, agents, and evaluation: the layer between a working prototype and a system that holds up in production.

The adjacent interests stayed. Data engineering, computer vision, distributed systems, the occasional Rust desktop app, and the CS fundamentals I keep coming back to. The breadth is on GitHub.

Work runs through Eleventh Solutions or as direct contract.

Arc eleventh · 2024 →

Then

Software Engineer

Backend systems, integrations, infrastructure

Now

Software Engineer & Founder

RAG · Agents · Evaluation · Backend Systems · Eleventh Solutions

How I work.

Every engagement follows a structured execution pipeline: from system design to production monitoring. Each phase has clear deliverables you receive at the end of it.

01 Days 1–3

Discovery.

Week one is for understanding the system, not coding. Constraints, data shape, and success metrics get named and quantified before anything is built.

System spec + architecture sketch
Evaluation criteria with measurable bars
Risk register + mitigation plan

spec.md discovery · v0.1

ingest:    pdf, docx, html
chunking:  recursive, 512 tokens
embedding: bge-large-en-v1.5
storage:   pgvector + jsonb meta
retrieval: top-k=8 + cross-encoder rerank
latency:   p50 ≤ 350ms · p99 ≤ 1200ms
recall:    ≥ 0.92 on labelled eval set
budget:    $0.04 / query · monthly cap

02 Week 1–2

Build.

End-to-end first, optimisation second. A skeleton system running on real data produces sharper questions than any whiteboard ever will.

End-to-end system skeleton
Pluggable model + retrieval components
Internal demo on real data

api/query.py FastAPI · async

from fastapi import FastAPI, Depends
from .rag import RagPipeline

app = FastAPI()
rag = RagPipeline.from_config("./config.yml")

@app.post("/query")
async def query(q: QueryIn) -> QueryOut:
    ctx = await rag.retrieve(q.text, k=8)
    return await rag.generate(q.text, ctx)

03 Week 2–3

Evaluate.

Vibes are not a metric. Every change is scored against a versioned eval set, and regressions are caught before they ship, not after a customer reports them.

Eval harness, repeatable + versioned
Baseline scores + failure analysis
Regression tests for known edge cases

eval-results.json v0.4 baseline · n=420

recall@8

0.94 ↑ 0.03

faithfulness

0.91 ↑ 0.02

answer relevance

0.89 ↑ 0.05

context precision

0.76 ↓ 0.04

latency p99

1.18s ↓ 0.22

04 Week 3

Deploy.

Production means: behind auth, behind rate limits, with a runbook for the day it breaks. Anything less is a demo, not a deployment.

Live API behind auth + rate limits
Runbook for incidents + rollbacks
Public stats endpoint for telemetry

curl /query prod · 200 OK

$ curl -X POST https://api.example.com/query \
  -H "Authorization: Bearer $TOKEN" \
  -d '{"text": "summarise q3 risk register"}'

{
  "answer": "The Q3 register flagged 4 ...",
  "sources": ["risk-2025-q3.pdf#p4", ...],
  "latency_ms": 247,
  "tokens": 1842,
  "trace_id": "7f3a1c..."
}

05 Ongoing

Monitor.

Shipping isn't finishing. Live cost, latency, and eval-against-prod metrics close the loop so the system improves with usage instead of degrading silently.

Cost + latency dashboards
Eval-against-prod regression checks
Iteration loop + change log

prod-metrics last 24h · sparkline

latency p50 284ms

throughput 47/s

err rate 0.3%

cost / 1k q $3.94

Experience.

2024–Present Active

Software Engineer & Founder

Eleventh Solutions · MS-CS, University of Colorado Boulder

→

Building AI systems through Eleventh Solutions (eleventh.dev). Retrieval, agents, evaluation, and the backend infrastructure underneath. Current work includes NexusRAG (a multi-provider RAG platform), SentinelID (on-device biometric CV), and the durable execution layer for agent workflows. Code on GitHub.

PythonFastAPILangGraph pgvectorRustDockerTauriReact

2020–2024 Founder

Software Engineer & Founder

Independent · Backend systems, integrations, infrastructure

→

Independent backend work for clients across multiple industries. API integrations, data pipelines, internal tooling, infrastructure. The four years where the engineering habits got formed before they got pointed at AI.

PythonFastAPIPostgreSQL DockerAPI IntegrationBackend Systems

Pre-2020: Multi-industry operator across Italy, UK, Ireland, USA, Australia, and China. Pattern recognition, cross-cultural communication, execution under constraint. Not the engineering story, but it informs how I scope.

Stack.

Interface

Desktop · Web · API consumer

ReactNext.jsTypeScriptViteTauri

Orchestration

Agents · workflows · evals

FastAPILangGraphPydanticCeleryRedis

Models

LLMs · embeddings · rerankers

AnthropicOpenAIOllamaBGE

Storage & Infra

Vector · relational · runtime

pgvectorPostgreSQLDockerNginxAlembicGH Actions

"Most of the work is the part that isn't the model."

001

Rigor

Applied to the parts that don't show: retrieval evaluation, failure paths, runbook discipline. The visible parts inherit it.

002

Documentation

Written for the engineer who inherits the system, not for the audit. Every repo has a real README, and every system has a record of why it was built the way it was.

003

Coherence

The architecture, the deployment, and the writeup describe the same system. No gap between what was built and what was said about it.

Projects.

Flagship · Private 01

Orion AI System

Single-node, operator-supervised AI freelance agent powered by the Anthropic Claude API. End-to-end execution pipeline (Scout, Qualify, Propose, Execute, Deliver, Follow-up) across three tracks: enterprise, engineering, and webdev. Closed-loop intelligence layer covering prompt-drift detection, an active-learning queue, Monte Carlo deal predictions, Bayesian pricing calibration, and per-LLM-call cost attribution.

Operator-supervised execution Closed-loop intelligence Multi-track autonomy

Python 3.12FastAPISQLiteAnthropic SDKReactStripe

Discuss the system → Private repo

architecture · closed-loop

Flagship · Live 02

NexusRAG

Multi-provider RAG platform with pluggable backends and multi-modal output (text and synthesized speech). LangGraph orchestration, pgvector semantic search, FastAPI backend, Docker multi-service deployment. Document intelligence with structured retrieval and full observability.

LangGraph orchestration Pluggable backends Multi-modal output

PythonLangGraphpgvectorFastAPIDockerPostgreSQL

Live · nexusrag-lyart.vercel.app ↗ Source →

architecture

Computer Vision · Edge Systems 03

SentinelID

Passkey-style biometric authentication with anti-spoofing and liveness detection. Tauri + Rust desktop shell, Python CV backend, Next.js dashboard. On-device computer vision: no cloud round-trip, no raw face data leaving the device by default.

RustTauriPythonOpenCVNext.jsDocker

GitHub →

architecture

07–09 / Selected projects

04 Agent Runbook Orchestrator Durable Execution · Agents PythonFastAPIPostgreSQLCelery Active → 05 Long-Form Content Intelligence Engine Content Intelligence · SaaS FastAPICelerypgvectorRedis Active → 06 EvalOps Workbench AI Evaluation · Dev Tools PythonFastAPISQLiteReact Active → 07 Repo RAG Debugger AI Debugging · RAG PythonLangChainpgvectorFastAPI Active → 08 Data Quality Watchtower Data Quality · Monitoring PythonpandasPostgreSQLDocker Active → 09 Revenue Signal Copilot Sales Intelligence · Signals PythonFastAPIPostgreSQLRedis Active →

Adjacent.

Work outside the AI focus. Some predates the current work, some is ongoing. The through-line is engineering discipline applied to problems that aren't about language models.

A1 MS-CS Playbook CS Fundamentals · Coursework PythonPyTorchNumPy Public → A2 API Sync Pipeline Data Engineering · ETL PythonRESTSQLite Public → A3 MingWay Language Learning · PWA Next.jsTypeScriptDrizzlePlaywright Live → A4 Churn Cohort Analysis Data Analysis · Analytics Pythonpandasmatplotlib Public →

Start a project.

Available for contract work and ongoing engagements. Retrieval, agents, evaluation, backend systems. Describe what you're building below.

Name

Project title

Ignazio De Santis.

Engineer withrange.

How I work.

Discovery.

Build.

Evaluate.

Deploy.

Monitor.

Experience.

Software Engineer & Founder

Software Engineer & Founder

Stack.

Projects.

Orion AI System

NexusRAG

SentinelID

Adjacent.

Engineer with
range.