MSc dissertation (Distinction). Python app using LLMs to extract, classify and calculate emissions data from unstructured documents.
My MSc dissertation at Nottingham Trent University, graded Distinction. A Python system that takes unstructured documents — invoices, supplier reports, utility statements — and returns a structured emissions breakdown at the other end.
The split matters. LLMs are the right tool for “read a weird document and normalise it”; they are the wrong tool for “multiply two numbers.” Keeping those jobs separate is most of what made the system reliable.
Emissions calculation is usually a manual, spreadsheet-driven process. Document variety is the bottleneck, not the math. Putting an LLM at the extraction layer turns a multi-hour job into a pipeline step — and gives you citations back to the source document, which auditors need.