All work
CompanyNewsCorp (USA) RoleLead Consultant Engineer PeriodMay 2023 — January 2024 TypeConsulting · Generative AI

Generative AI for the Wall Street Journal

Production Python/Go APIs for GenAI use cases at WSJ — LLM evaluation, vector retrieval, session management, and a pioneering screen-reader experiment to make articles accessible.

LLMs Falcon LLaMA Pinecone FastAPI DynamoDB Accessibility
4
Major LLMs evaluated for production
3
Vector stores benchmarked
1st
WSJ screen-reader experiment for accessibility

The brief

NewsCorp wanted to move generative AI from research demos into production capabilities for the Wall Street Journal — document understanding, retrieval-augmented generation, and session-aware AI interactions that could meet real production SLAs.

What I built

LLM evaluation and selection

Retrieval systems for production

Session-aware AI services

Accessibility experimentation

"The fun part of LLM work in production isn't the model. It's the dozen unglamorous services around it that decide whether the model ever gets to do anything useful."
Next case study

GenAI for Morgan & Morgan at Andela