Theo Farrell

I founded DAISI in my second year of university after being inspired by a three-person reading group at the university’s Effective Altruism society. Supported by the Pathfinder fellowship, the group helps to funnel top-university talent into AI Safety.

Organised recruitment and increased our weekly attendance from 5 to 20 participants.
Facilitated BlueDot discussion groups to upskill members in AI alignment and policy knowledge.
Led technical upskilling workshops in deep learning and NLP based on ARENA.
Secured grant funding for group organising.
Pathfinder Fellow since Dec 2024 - received training and mentorship for AI Safety organising after a competitive application process.

Theo Farrell

Research

Published papers I played a key role in creating:

Sparse Autoencoders Can Learn Graded Latents for Relational Composition

Order by Scale: Relative-Magnitude Relational Composition in Attention-Only Transformers

Other published papers I’ve helped with:

Challenges of Evaluating LLM Safety for User Welfare

Reviewing

Talks

AI Safety North East

Field-building

Durham AI Safety Initiative

Founding and scaling an AI Safety university group

Projects

Bipedal Walking with RL

Challenges of Evaluating LLM Safety for User Welfare

Cryptography Coursework

Data Cleaning and Analysis

Data Compression Coursework

Durham AI Safety Initiative Website

EPQ Calculator

Ghost Marks in the Machine

Hands2Text

Human Features for Style Transfer

Image Processing

KNN vs Logistic Regression

Metaheuristics and TSP

Natural Computing Algorithms

Order by Scale

Programmable Matter Lab

Stance Classification

Let's chat!

Research

Published papers I played a key role in creating:

Sparse Autoencoders Can Learn Graded Latents for Relational Composition

Order by Scale: Relative-Magnitude Relational Composition in Attention-Only Transformers

Other published papers I’ve helped with:

Challenges of Evaluating LLM Safety for User Welfare

Reviewing

Talks

AI Safety North East

Field-building

Durham AI Safety Initiative

Founding and scaling an AI Safety university group

Projects

Role

Category

Languages

Tools

Focus

Sort by

Bipedal Walking with RL

Challenges of Evaluating LLM Safety for User Welfare

Cryptography Coursework

Data Cleaning and Analysis

Data Compression Coursework

Durham AI Safety Initiative Website

EPQ Calculator

Ghost Marks in the Machine

Hands2Text

Human Features for Style Transfer

Image Processing

KNN vs Logistic Regression

Metaheuristics and TSP

Natural Computing Algorithms

Order by Scale

Programmable Matter Lab

Stance Classification

Let's chat!