Understanding PDF Font Encoding
November 2, 2024
A deep dive into how fonts work in PDF files, and why getting text extraction right is harder than it looks.
Statistician, hobby programmer and occasional blogger
I am a lecturer at the statistics department of the University of Leeds. This is my private homepage; there is also a separate work homepage.
I write code, publish research, and occasionally share things I'm thinking about.
November 2, 2024
A deep dive into how fonts work in PDF files, and why getting text extraction right is harder than it looks.
October 15, 2024
Reflections on writing a custom HTTP/HTTPS server with Let's Encrypt integration for this site.
September 28, 2024
Some thoughts on teaching computational statistics and the tools students actually use.
A Go library for reading and writing PDF files, with full support for fonts, encryption, and modern PDF features.
Fast parameter estimation for diffusion models in cognitive psychology and neuroscience.
Custom HTTP/HTTPS server powering this site, with automatic TLS certificates and request logging.
Collection of mathematical utilities and visualization tools built for teaching and research.