Proteins Can Now "Talk": Meet BioReason-Pro — The World's First AI Reasoning Model That Thinks Like a Biologist

Eddie Avil
52 minutes ago
5 min read

Proteins Can Now "Talk." And AI Is Finally Listening.

For decades, one of biology's most stubborn bottlenecks has been hiding in plain sight. We know the sequences. We just don't know what they mean.

There are now over 250 million protein sequences catalogued in UniProt — but fewer than 0.1% carry experimental functional annotations. ResearchGate The sequencing revolution gave us an ocean of data. But we've been nearly blind to what most of it actually does.

Until now.

This week, researchers at the Arc Institute, in collaboration with teams from Stanford, UC Berkeley, UCSF, ETH Zürich, EPFL, the University of Toronto, and Cohere, unveiled BioReason-Pro — the first multimodal reasoning large language model for protein function prediction that integrates protein embeddings with biological context to generate structured reasoning traces. ResearchGate

In plain language: it's an AI that doesn't just label proteins. It thinks about them — the way a world-class biologist would.

The Problem With How Biology AI Has Worked — Until Now

Most existing computational tools approach protein function the same way a student might approach a multiple-choice exam: given a sequence, pick the most likely label. It works, but it misses something fundamental about how biology actually operates.

A protein's function emerges from the interplay of sequence, structure, evolutionary context, and decades of accumulated ontological knowledge — yet most AI models in biology still operate in their own individual domains. Chalmers tekniska högskola

Real biologists don't work that way. They synthesize evidence from protein domains, 3D structures, interaction partners, organism context, and the broader literature before committing to a functional hypothesis. BioReason-Pro is the first AI system built to mirror that integrative process from the ground up.

How BioReason-Pro Works: Reasoning, Not Just Predicting

BioReason-Pro combines ESM3 protein embeddings, a Gene Ontology graph encoder, and biological context to generate structured reasoning traces and functional annotations. UPMC Rather than producing a single output label, the model walks through its logic step-by-step — from molecular evidence, through domain analysis, to a structured functional hypothesis covering molecular function, biological process, cellular localization, and candidate interaction partners.

A critical engine inside the system is GO-GPT — an autoregressive transformer that treats GO annotation as a sequence generation task conditioned on protein representations, capturing hierarchical and cross-aspect dependencies of GO terms. UPMC

BioReason-Pro was trained via supervised fine-tuning on synthetic reasoning traces generated by GPT-5 for over 130,000 proteins, and further optimised through reinforcement learning. ResearchGate The result is a model that doesn't just pattern-match — it reasons under biological constraints, just as a trained scientist would.

The Results: Numbers That Should Stop You in Your Tracks

The benchmarks are striking:

🏆 73.6% weighted Fmax on GO term prediction — surpassing all prior accessible CAFA5 baselines, the field's gold standard competition
🔬 Strong performance on low-homology proteins — exactly where classical sequence-similarity methods fail most badly
🧑‍🔬 79% expert preference rate — in blinded evaluation, human protein experts preferred BioReason-Pro annotations over curated UniProt annotations in 79% of cases ResearchGate, with an average LLM judge score of 8/10 on functional summaries

Perhaps most remarkably, BioReason-Pro de novo predicted experimentally confirmed binding partners, with per-residue attention localising to the exact contact residues resolved in cryo-EM structures of those complexes UPMC — meaning the model identified, without being told, the precise molecular regions that matter. That's not prediction. That's understanding.

Why This Is a Paradigm Shift — Not Just a Better Benchmark

The significance here goes beyond a leaderboard jump. For the past several years, progress in biology AI has been driven by better encoders, larger protein language models like ESM, and stronger structure predictors like AlphaFold. Those advances were transformative. But they all share a common limitation: they produce answers, not explanations.

The reasoning traces BioReason-Pro generates are a new kind of output: hypotheses with supporting evidence, proposed mechanisms, and testable interaction partners. Chalmers tekniska högskola For a scientist, that distinction is everything. An annotation you can interrogate, challenge, and build on is infinitely more valuable than a black-box label — no matter how accurate.

This is the shift from AI as oracle to AI as research collaborator.

The Architecture Behind the Breakthrough

The model integrates data from 133,492 proteins across 3,135 organisms, curated from UniProt with experimental GO annotations, InterPro domains, STRING protein-protein interactions, and PDB protein structures. UPMC It was evaluated on a strict temporal split — training data through November 2022, test data from March 2023 to February 2024 — ensuring the benchmarks reflect genuine generalisation, not memorisation.

The base model is built on Qwen3-4B, giving the system its chain-of-thought reasoning capabilities, layered with the biological multimodal inputs that let it move from sequence to function in a way no previous system has achieved.

Fully Open. Fully Accessible. Right Now.

In an era where major AI breakthroughs are increasingly locked behind paywalls and proprietary APIs, BioReason-Pro is making a different bet. The team has released everything:

📄 Preprint paper (bioRxiv)
💻 Full codebase (GitHub)
🧬 Model weights and training data
🌐 Live web application at bioreason.net — with predictions available for over 240,000 proteins including the entire Human Protein Atlas

Any researcher, anywhere in the world, can use it today.

What This Means for Drug Discovery, Disease Research & the Future of Biology

The downstream implications are hard to overstate. The vast majority of proteins in the human body — and across all of life — remain functionally uncharacterised. Every dark corner of the proteome is a potential drug target, a disease mechanism, a biological process we don't yet understand.

BioReason-Pro demonstrates that AI systems can reason about protein function at expert level, opening a path toward scalable functional characterisation of the millions of uncharacterised proteins across all domains of life. UPMC

For drug discovery, that means faster target identification. For rare disease research, it means shining light on proteins that would never attract enough experimental funding to be characterised by hand. For basic science, it means the interpretive bottleneck that has shadowed the genomic revolution may finally be lifting.

Biology has always been an integrative reasoning problem. For the first time, AI is built to match that.

💬 The Bottom Line

BioReason-Pro isn't just a better protein classifier. It's a new kind of scientific instrument — one that reads molecular evidence, constructs a biological argument, and delivers a reasoned conclusion that human experts find more useful than the best manually curated database entries in existence.

The proteins have started talking. AI is finally fluent enough to listen.

🔗 Try BioReason-Pro: bioreason.net 📄 Read the preprint: bioRxiv 2026.03.19 💻 Access the code: GitHub — BioReason-Pro

ABOUT THE RESEARCH TEAM: BioReason-Pro was developed by researchers at the Arc Institute, Stanford University, UC Berkeley, UCSF, University of Toronto, ETH Zürich, EPFL, Cohere, and Xaira Therapeutics, led by Adibvafa (Adib) Fallahpour (NVIDIA) and Hani Goodarzi (Arc Institute).