AWS: EU AI Act FLOPs Meter for SageMaker fine-tuning

The Fine-Tuning FLOPs Meter toolkit is a new AWS SageMaker AI extension released on May 12, 2026, that automatically tracks the compute thresholds of the European AI Act (3.3×10²² FLOPs, 3.3×10²⁴ for systemic risk) during LLM fine-tuning. It is activated with a single flag compute_flops=true in the recipe YAML and automatically generates audit documentation to S3 and DynamoDB.

Amazon Web Services released the Fine-Tuning FLOPs Meter on May 12, 2026 — an open toolkit for Amazon SageMaker AI that automates tracking of European AI Act compute thresholds during fine-tuning of large language models. The tool is activated with a single configuration flag in the recipe YAML and records all necessary metrics for audit documentation.

Which thresholds does the EU AI Act apply?

The toolkit implements three thresholds through the determine_compliance_threshold() function. The default 3.3×10²² FLOPs applies when pretraining compute is unknown or below 10²³ FLOPs. The relative threshold of 30% of actual pretraining compute activates when pretraining is documented above 10²³ FLOPs. The Systemic Risk threshold of 3.3×10²⁴ FLOPs applies to models with pretraining compute above 10²⁵ FLOPs — the highest obligation category in the Act.

What does exceeding the threshold mean for an organization?

An organization that exceeds its applicable threshold transitions from “downstream user” to “GPAI model provider” status. The new classification creates additional obligations: detailed disclosure of architecture and training process, a public list of data sources used, and proof of compliance with EU copyright regulations. AWS emphasizes that a status change is the biggest compliance shift a fine-tuning organization can undergo.

How does the Meter integrate with SageMaker?

Activation is a single YAML flag: compute_flops: true in the SageMaker training recipe. The toolkit operates as a standard Hugging Face TrainerCallback, meaning it integrates into an existing SageMaker Training pipeline without architectural changes. Results are automatically uploaded to Amazon S3 and stored in DynamoDB as a permanent compliance record.

The output flops_meter.json contains architecture-based and hardware-based FLOPs calculations, the applied threshold type and compliance status, model parameters, training duration, GPU specifications, and training job identifiers for audit trail linkage. AWS recommends Flops_architecture as the primary compliance metric because it “accurately reflects your actual training configuration.”

The tool is part of AWS’s broader decision to raise regulatory obligations from the level of consulting advice to the level of cloud platform — turning compliance from a manual project into a feature toggle.

Frequently Asked Questions

Which EU AI Act thresholds does this toolkit track?

The toolkit automatically selects the default 3.3×10²² FLOPs threshold when pretraining compute is unknown or below 10²³, the relative threshold of 30% of pretraining compute when documented, and the systemic risk threshold 3.3×10²⁴ FLOPs for models with pretraining compute ≥ 10²⁵.

What obligations arise from exceeding the threshold?

An organization that exceeds its applicable threshold transitions from 'downstream user' to 'GPAI model provider' status; obligations include detailed architectural and training disclosure, a public list of data sources, and proof of compliance with EU copyright law.

AWS: Fine-Tuning FLOPs Meter for SageMaker automates EU AI Act compliance threshold tracking

Which thresholds does the EU AI Act apply?

What does exceeding the threshold mean for an organization?

How does the Meter integrate with SageMaker?

Frequently Asked Questions

Sources

Related news