arXiv:2605.04908: Gosset with Curated Pharma Index Outperforms Frontier LLMs by 3.2x
Gosset is a specialized AI platform with curated pharmaceutical data that returned 3.2 times more verified drugs per query compared to four frontier systems, achieving 100% precision and full recall across ten niche oncology and immunology targets.
This article was generated using artificial intelligence from primary sources.
A new study published on arXiv shows that a specialized AI platform with a curated pharmaceutical database significantly outperforms general frontier LLMs at drug discovery for niche therapeutic targets. The research compares a system called Gosset against four leading models on asset discovery tasks in oncology and immunology.
How was the experiment designed?
All five systems received identical natural-language queries and the same JSON output schema. Evaluation covered ten specialized targets where most pipeline activity occurs in preclinical and Asian-developed assets — areas typically underrepresented in general web search results. The frontier controls were Claude Opus 4.7, GPT 5.5, Gemini 3.1 Pro and Perplexity sonar-pro.
What were the key results?
Gosset returned 3.2 times more verified drugs per query than the best frontier system. It achieved 100% precision and full recall measured against the union of all verified drugs returned by any system in the test. In other words, the curated index not only found more hits but simultaneously avoided false positives in areas where general models frequently hallucinate drug names or misassign them to targets.
What does this mean for frontier models?
The authors emphasize that the results do not suggest replacing general-purpose models but extending them. Since Gosset is exposed as an MCP server tool, frontier models can include it in their toolchain and thereby “close most of the recall gap” by replacing generic web search with a specialized curated database. This positions curated vertical indexes as a complementary rather than competing category to large models.
Frequently Asked Questions
- What is Gosset?
- Gosset is a specialized AI platform with curated and annotated pharmaceutical data focused on asset discovery — finding drugs in development for specific therapeutic targets.
- Which systems was Gosset compared against?
- The study compares it against four frontier LLM systems: Claude Opus 4.7, GPT 5.5, Gemini 3.1 Pro and Perplexity sonar-pro, using identical queries and output JSON schema.
- What does MCP server mean in this context?
- MCP (Model Context Protocol) is an open standard that allows LLMs to call external tools. Gosset is available as an MCP server, so frontier models can integrate it as a data source instead of generic web search.
Sources
Related news
arXiv:2605.03195: Terminus-4B — 4 billion parameters for terminal execution matches Claude Opus and GPT-5.3-Codex on SWE-Bench Pro with ~30% fewer main agent tokens
Google: Gemini API Gets Multimodal File Search for Images and Breaking Change in Interactions API
arXiv:2605.03871: EvoLM — language models that improve themselves without external supervision