Britain's Premier Job Portal
Our Generative AI products are only as good as the data behind them. This role owns that data layer from end to end: the pipelines that bring data in, the transformations that shape it, and the way it reaches retrieval systems, agents, and analytics. The work runs on AWS, and the aim is a single governed source that every consumer can rely on.
We want someone who has already built data pipelines for AI systems, not only for reporting. Preparing data for an LLM or an agent brings its own work around chunking, embeddings, indexing, and keeping content current, and you have done it before. The team is small and spans several languages, so you will own your pipelines and help set the standards the rest of us follow.
WHAT YOU WILL DO