The Black Spatula Project

WhatsApp - Discord - GitHub


Background

The Black Spatula Project is an open initiative to investigate the potential of large language models (LLMs) to identify errors in scientific papers. We seek to answer the following questions: How many errors can LLMs detect? How serious are those errors? Which model/prompt/pipeline performs the best? And ultimately, how can we use AI to improve scientific integrity?

The project was inspired by a scientific paper that, due to a simple math error that even an AI reviewer could catch, caused many people to toss all of their black plastic kitchen implements. To learn more about the story of this project, please check out the initial post and latest update by Steve Newman, the initiator of the project.

If you’d like to get involved, join our currently very active WhatsApp group (for high-level discussion) and/or Discord (more focused on technical work). To contribute, check out the Ongoing Tasks section below.

Worksheets

Other

Ongoing Tasks

Research (#prompt-and-model)

Pipeline (#data-ingestion)

Website

Resources & Ideas

Potential Paper Sources

Potential Error Types

Implementation Ideas