What does TREX automate?

The complete LLM fine-tuning lifecycle: requirements analysis, literature and data search, training strategy formulation, data preparation, and results evaluation.

How does TREX use the search tree?

It models the experimental process as a search tree where each node represents a training configuration, enabling efficient planning, reuse of previous results, and drawing conclusions from iterative experiments.

ArXiv: TREX — Two AI Agents Automate the Entire LLM Fine-Tuning Process

The Problem: Fine-Tuning Requires Too Much Human Effort

Fine-tuning large language models — the process of adapting a pre-trained model to a specific task — currently requires significant human expertise. A researcher must analyze requirements, search the relevant literature, prepare data, select hyperparameters, run experiments, and evaluate results. Each of these steps involves a series of decisions that rely on experience and intuition.

Researchers Zerun Ma, Guoqiang Wang, and Xinchen Xie propose TREX — a system that automates this entire process using two coordinated AI agents.

How Does TREX Work?

The system is built on two modules. The Researcher takes on the tasks of requirements analysis, literature and data source search, and training strategy formulation. The Executor implements concrete experiments — from preparing data recipes to running training and evaluating results.

The key innovation is modeling the experimental process as a search tree. Each node in the tree represents a specific training configuration, and branches lead to variations. The system can efficiently plan exploration paths, reuse results from previous experiments, and draw conclusions from iterative attempts — rather than starting each experiment from scratch.

Results on the FT-Bench Benchmark

For evaluation, the researchers developed FT-Bench — a benchmark with 10 real-world tasks covering a range from optimizing foundational capabilities to improving domain-specific performance. Results show that the TREX agent “consistently optimizes model performance on target tasks.”

For teams that regularly fine-tune models, TREX promises a significant reduction in time and experimentation costs — by automating the routine steps currently performed by expensive ML engineers.

ArXiv: TREX — Two AI Agents Automate the Entire LLM Fine-Tuning Process

The Problem: Fine-Tuning Requires Too Much Human Effort

How Does TREX Work?

Results on the FT-Bench Benchmark

Sources

Related news