Amazon SageMaker AI now supports serverless model customization and reinforcement fine-tuning for 12 additional open-weight models, enabling you to fine-tune and evaluate them without provisioning or managing infrastructure. The newly supported models are: gpt-oss-120b, Qwen2.5 72B Instruct, DeepSeek-R1-Distill-Llama-70B, Qwen3 14B, DeepSeek-R1-Distill-Qwen-14B, Qwen2.5 14B Instruct, DeepSeek-R1-Distill-Llama-8B, DeepSeek-R1-Distill-Qwen-7B, Qwen3 4B, Meta Llama 3.2 3B Instruct, Qwen3 1.7B, and DeepSeek-R1-Distill-Qwen-1.5B. With this expansion, you can customize these models using supervised fine-tuning (SFT), direct preference optimization (DPO), and reinforcement fine-tuning (RFT) techniques including RLVR and RLAIF, and only pay for what you use.
Reinforcement fine-tuning enables you to align models to complex, domain-specific reasoning tasks where techniques such as traditional SFT alone fall short. With RLVR, you can improve model accuracy on verifiable tasks such as code generation, math, and structured extraction by providing reward signals based on correctness. RLAIF uses AI-generated feedback to steer model behavior toward your quality and safety preferences. These techniques are available on previously supported and newly added models, with no cluster setup, capacity planning, or distributed training expertise required.
These models and fine-tuning techniques are available in US East (N. Virginia), US West (Oregon), Asia Pacific (Tokyo), and EU (Ireland). To get started, see the Amazon SageMaker AI model customization product page and visit the Amazon SageMaker AI pricing page (Model Customization tab) to see the full list of models, techniques, and prices.
Categories: general:products/amazon-sagemaker,general:products/amazon-machine-learning,marketing:marchitecture/artificial-intelligence
Source: Amazon Web Services
Latest Posts
- Amazon Connect Customer now supports embedding Cases and Customer Profiles in custom agent applications

- Collect Diagnostics change to Get Diagnostics for Outlook Mobile and Mac [MC1308855]
![Collect Diagnostics change to Get Diagnostics for Outlook Mobile and Mac [MC1308855] 3 pexels megan forbes 347998 963436](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- Action required: Upgrade macOS 13 devices to maintain Teams desktop access [MC1308857]
![Action required: Upgrade macOS 13 devices to maintain Teams desktop access [MC1308857] 4 pexels pixabay 163036](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- Microsoft 365 Copilot (including Copilot Chat): Admin notifications for Copilot mobile app on macOS [MC1308856]
![Microsoft 365 Copilot (including Copilot Chat): Admin notifications for Copilot mobile app on macOS [MC1308856] 5 pexels pixabay 290470](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)


![Collect Diagnostics change to Get Diagnostics for Outlook Mobile and Mac [MC1308855] 3 pexels megan forbes 347998 963436](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-megan-forbes-347998-963436-150x150.webp)
![Action required: Upgrade macOS 13 devices to maintain Teams desktop access [MC1308857] 4 pexels pixabay 163036](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-pixabay-163036-150x150.webp)
![Microsoft 365 Copilot (including Copilot Chat): Admin notifications for Copilot mobile app on macOS [MC1308856] 5 pexels pixabay 290470](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-pixabay-290470-150x150.webp)
