Ben Guzovsky - Content Policy, Risk, and Applied AI

I'm currently Head of Risk & Policy at safetykit.com. See what I'm working on here, or contact me at benjamin [dot] guzovsky [at] gmail [dot] com!

LLMs Suck at Variance — Nov 2025

LLMs seem pretty smart, but they suck at seemingly basic tasks. Why?

The End of the Train-Test Split — Nov 2025

The train-test split does not work for classification tasks at the frontier of LLM capability.

Documentation for Prompts — Nov 2025

Prompts look like documentation, but they're deceptively complex. How can we document the choices that went into them?

"Capablanca Defeats Marshall's Attack" — Dec 2023

An exploration of historical time through three lenses and one chess player.

"Learning, Leveraged by Students" — Jan 2023

I spent high school staring out the window and waiting for lunch time. I felt like there was nothing I could change about school. This report is the culmination of my nationwide research on what students can change.

"Doorstops: Potential for more trusting schools" — Sept 2022

I am traveling across the country researching how trust is built, working to empower student voices.

"Pakistan Shows the Limits of China’s Economic Power" — Jun 2022

Pakistan should be the flagship success of China’s economic aid and investment programs abroad. Instead, it casts doubts on the viability of a Chinese economic world order.

"Weighted Internationalism in League of Nations Hiring Practices" — Jan 2022

"Nothing escapes his keen analytical gaze... He has immersed himself in an incredible volume of seemingly-dry historical sources and not only wrestled them into a clear and well-structured piece of writing, but also condensed them into a pithy but substantial interpretive formula... articulated with flair."
– Natasha Wheatley, Princeton History Professor

"Bot Poetry: Engineering Randomness" — Nov 2020

I created a haiku-generating algorithm and argued that the future of great writing isn't human, computer, or computer-trying-to-emulate-human, it's human-computer collaboration.

Further Writing

One-pager on asymetric impacts of AI on marketplaces like Fiverr and UpWork
A post on scrappy search engine optimization
On things that don't scale

Content Policy, Risk, and Applied AI

Writing!

I ship full-stack, interactive interfaces fast.