About

Hi, I’m Guanni (Jenny) Qu an EA-aligned builder and researcher focused on adversarial evaluation and AI control. I like turning red-team ideas into reproducible, auditable evaluations that measurably reduce risk in deployment.

Now

Why (EA)

I’m motivated by Effective Altruism: work where the expected impact is high and legible. For near-term safety, I prioritize control evaluations that are black-box, capability-grounded, and iteratively improvable; for the long run, I care about research that makes alignment and governance decisions evaluable rather than speculative.

Research directions

Open question I’m investigating

What is the half-life of jailbreak attack strength under iterative guardrail/model updates, and can an automated, “evergreen” generator predict or extend it?

Selected things I ship

Contact

Email: [email protected]. I’m excited to collaborate on eval engineering, control protocols, and artifact-driven safety research.