arXiv:2605.22502: Compiling agentic workflows into LLM weights achieves near-frontier quality at 100× lower cost
Researchers demonstrated that complex agentic workflows can be encoded directly into the weights of a smaller fine-tuned model instead of external orchestration such as LangChain or LangGraph. The approach achieves near-frontier quality at 100× lower inference cost across three real-world scenarios: travel booking, Zoom support, and insurance, with workflows of 14 to 55 nodes.