SoftSkill: Skill in 32 Tokens, +42.1 on LiveMath

SoftSkill is a method described in the paper arXiv:2606.20333 that converts skill documents, such as Markdown SKILL.md files, into compact continuous latent objects that guide model behavior without modifying the base model. Instead of hundreds or thousands of instruction tokens, SoftSkill uses 32 virtual tokens and achieves an improvement of 42.1 percentage points on the LiveMath task compared to operating without the skill.

The paper arXiv:2606.20333 presents SoftSkill, a method that converts skill documents into compact latent control objects. A skill document is a textual instruction, for example a Markdown SKILL.md file, that describes to the model how to perform a particular task. Such instructions can occupy hundreds or thousands of context tokens, consuming the context window and slowing execution.

How Behavioral Compression Works

SoftSkill applies behavioral compression — it compresses natural-language instructions into 32 virtual (latent) tokens that guide model behavior, all without modifying the base model’s weights. Unlike fine-tuning, which adapts the model itself, SoftSkill acts only through added control tokens, so skills can be switched on and off as needed without retraining.

Results by the Numbers

On the LiveMath task, SoftSkill delivers +42.1 percentage points compared to operating without a skill and +12.5 points compared to competing methods. On SearchQA it achieves +8.3 points over baseline prompting and +5.2 points over the SkillOpt method. The comparison shows that the compressed latent representation not only saves tokens but also outperforms the original textual instructions in accuracy.

What This Changes

Fewer tokens means lower cost and faster execution, especially for agents juggling multiple skills simultaneously. Since the base model remains untouched, SoftSkill is compatible with existing models and offers a practical path for modularly adding capabilities. The approach fits into the broader trend of replacing long system prompts with compact, learned control signals.

Frequently Asked Questions

What does SoftSkill do?

SoftSkill converts skill documents into 32 continuous latent tokens that guide model behavior without altering its weights.

How much does it improve results?

On the LiveMath task, SoftSkill delivers +42.1 percentage points compared to prompting without a skill and +12.5 points over competing methods; on SearchQA it achieves +8.3 points over baseline prompting.

Is the base model changed?

No, the base model remains unchanged; SoftSkill acts only through latent control tokens.

arXiv:2606.20333: SoftSkill Compresses Skill Documents into 32 Latent Tokens and Boosts LiveMath by 42.1 Points

How Behavioral Compression Works

Results by the Numbers

What This Changes

Frequently Asked Questions

Sources

Related news