arXiv:2604.21508 BioMiner: multimodal AI extracts protein-ligand bioactivity from literature, 5.59× faster than manual work
The team of Jiaxian Yan and colleagues published on April 23, 2026 BioMiner — a multimodal AI system for automated extraction of protein-ligand bioactivity from scientific literature. The system processes text, tables and molecular structures, achieves F1 0.32 on the new BioVista benchmark (16,457 entries from 500 publications) and in a demonstration application extracts 82,262 data points from 11,683 papers.