LLM/RAG pipelines, voice AI, multi-agent systems, and computer vision — built for real load, real edge cases, and real business pressure. Not demos. Not prototypes. Systems that run.
Production retrieval pipelines with hybrid search, re-ranking, and evaluation. Built to stay accurate at scale.
End-to-end voice assistants — speech-to-text, LLM reasoning, text-to-speech — optimised for sub-3-second latency.
Orchestrated AI workflows with LangChain, LlamaIndex, and production-grade observability via LangSmith.
Real-time object detection on edge hardware. YOLO-based systems deployed across multi-camera setups in all conditions.
Previously built the AI layer for a cybersecurity platform (Munit.io), led engineering teams at EsMedia and Wargaming, and founded GrainTrack from scratch. Now running an independent AI engineering practice serving clients across healthcare, SaaS, and surveillance.
We work with teams that have high standards — where architecture matters and shipping doesn't mean cutting corners.
Start a conversation