.NET Ramblings
open-menu closeme
Home
Videos
Feeds
About
Contribute
github linkedin
  • Efficiently serve dozens of fine-tuned models with vLLM on Amazon SageMaker AI and Amazon Bedrock

    calendar Feb 25, 2026 · aws.amazon.com/blogs/machine-learning
    Efficiently serve dozens of fine-tuned models with vLLM on Amazon SageMaker AI and Amazon Bedrock

    In this post, we explain how we implemented multi-LoRA inference for Mixture of Experts (MoE) models in vLLM, describe the kernel-level optimizations we performed, and show you how you can benefit from this work. We use GPT-OSS 20B as our primary example throughout this post. Link to article: …


    Read More
  • Building intelligent event agents using Amazon Bedrock AgentCore and Amazon Bedrock Knowledge Bases

    calendar Feb 25, 2026 · aws.amazon.com/blogs/machine-learning
    Building intelligent event agents using Amazon Bedrock AgentCore and Amazon Bedrock Knowledge Bases

    This post demonstrates how to quickly deploy a production-ready event assistant using the components of Amazon Bedrock AgentCore. We'll build an intelligent companion that remembers attendee preferences and builds personalized experiences over time, while Amazon Bedrock AgentCore handles the heavy lifting of production …


    Read More
  • Build an intelligent photo search using Amazon Rekognition, Amazon Neptune, and Amazon Bedrock

    calendar Feb 24, 2026 · aws.amazon.com/blogs/machine-learning
    Build an intelligent photo search using Amazon Rekognition, Amazon Neptune, and Amazon Bedrock

    In this post, we show you how to build a comprehensive photo search system using the AWS Cloud Development Kit (AWS CDK) that integrates Amazon Rekognition for face and object detection, Amazon Neptune for relationship mapping, and Amazon Bedrock for AI-powered captioning. Link to article: …


    Read More
  • Train CodeFu-7B with veRL and Ray on Amazon SageMaker Training jobs

    calendar Feb 24, 2026 · aws.amazon.com/blogs/machine-learning
    Train CodeFu-7B with veRL and Ray on Amazon SageMaker Training jobs

    In this post, we demonstrate how to train CodeFu-7B, a specialized 7-billion parameter model for competitive programming, using Group Relative Policy Optimization (GRPO) with veRL, a flexible and efficient training library for large language models (LLMs) that enables straightforward extension of diverse RL algorithms …


    Read More
  • Generate structured output from LLMs with Dottxt Outlines in AWS

    calendar Feb 24, 2026 · aws.amazon.com/blogs/machine-learning
    Generate structured output from LLMs with Dottxt Outlines in AWS

    This post explores the implementation of Dottxt’s Outlines framework as a practical approach to implementing structured outputs using AWS Marketplace in Amazon SageMaker. Link to article: https://aws.amazon.com/blogs/machine-learning/generate-structured-output-from-llms-with-dottxt-outlines-in-aws/


    Read More
  • Global cross-Region inference for latest Anthropic Claude Opus, Sonnet and Haiku models on Amazon Bedrock in Thailand, Malaysia, Singapore, Indonesia, and Taiwan

    calendar Feb 24, 2026 · aws.amazon.com/blogs/machine-learning
    Global cross-Region inference for latest Anthropic Claude Opus, Sonnet and Haiku models on Amazon Bedrock in Thailand, Malaysia, Singapore, Indonesia, and Taiwan

    In this post, we are exciting to announce availability of Global CRIS for customers in Thailand, Malaysia, Singapore, Indonesia, and Taiwan and give a walkthrough of technical implementation steps, and cover quota management best practices to maximize the value of your AI Inference deployments. We also provide guidance …


    Read More
  • Introducing Amazon Bedrock global cross-Region inference for Anthropic’s Claude models in the Middle East Regions (UAE and Bahrain)

    calendar Feb 24, 2026 · aws.amazon.com/blogs/machine-learning
    Introducing Amazon Bedrock global cross-Region inference for Anthropic’s Claude models in the Middle East Regions (UAE and Bahrain)

    We’re excited to announce the availability of Anthropic’s Claude Opus 4.6, Claude Sonnet 4.6, Claude Opus 4.5, Claude Sonnet 4.5, and Claude Haiku 4.5 through Amazon Bedrock global cross-Region inference for customers operating in the Middle East. In this post, we guide you through the capabilities of each Anthropic …


    Read More
  • Scaling data annotation using vision-language models to power physical AI systems

    calendar Feb 23, 2026 · aws.amazon.com/blogs/machine-learning
    Scaling data annotation using vision-language models to power physical AI systems

    In this post, we examine how Bedrock Robotics tackles this challenge. By joining the AWS Physical AI Fellowship, the startup partnered with the AWS Generative AI Innovation Center to apply vision-language models that analyze construction video footage, extract operational details, and generate labeled training datasets …


    Read More
  • How Sonrai uses Amazon SageMaker AI to accelerate precision medicine trials

    calendar Feb 23, 2026 · aws.amazon.com/blogs/machine-learning
    How Sonrai uses Amazon SageMaker AI to accelerate precision medicine trials

    In this post, we explore how Sonrai, a life sciences AI company, partnered with AWS to build a robust MLOps framework using Amazon SageMaker AI that addresses these challenges while maintaining the traceability and reproducibility required in regulated environments. Link to article: …


    Read More
  • Accelerating AI model production at Hexagon with Amazon SageMaker HyperPod

    calendar Feb 23, 2026 · aws.amazon.com/blogs/machine-learning
    Accelerating AI model production at Hexagon with Amazon SageMaker HyperPod

    In this blog post, we demonstrate how Hexagon collaborated with Amazon Web Services to scale their AI model production by pretraining state-of-the-art segmentation models, using the model training infrastructure of Amazon SageMaker HyperPod. Link to article: …


    Read More
  • Agentic AI with multi-model framework using Hugging Face smolagents on AWS

    calendar Feb 23, 2026 · aws.amazon.com/blogs/machine-learning
    Agentic AI with multi-model framework using Hugging Face smolagents on AWS

    Hugging Face smolagents is an open source Python library designed to make it straightforward to build and run agents using a few lines of code. We will show you how to build an agentic AI solution by integrating Hugging Face smolagents with Amazon Web Services (AWS) managed services. You'll learn how to deploy a …


    Read More
  • Amazon SageMaker AI in 2025, a year in review part 1: Flexible Training Plans and improvements to price performance for inference workloads

    calendar Feb 20, 2026 · aws.amazon.com/blogs/machine-learning
    Amazon SageMaker AI in 2025, a year in review part 1: Flexible Training Plans and improvements to price performance for inference workloads

    In 2025, Amazon SageMaker AI saw dramatic improvements to core infrastructure offerings along four dimensions: capacity, price performance, observability, and usability. In this series of posts, we discuss these various improvements and their benefits. In Part 1, we discuss capacity improvements with the launch of …


    Read More
  • Amazon SageMaker AI in 2025, a year in review part 2: Improved observability and enhanced features for SageMaker AI model customization and hosting

    calendar Feb 20, 2026 · aws.amazon.com/blogs/machine-learning
    Amazon SageMaker AI in 2025, a year in review part 2: Improved observability and enhanced features for SageMaker AI model customization and hosting

    In 2025, Amazon SageMaker AI made several improvements designed to help you train, tune, and host generative AI workloads. In Part 1 of this series, we discussed Flexible Training Plans and price performance improvements made to inference components. In this post, we discuss enhancements made to observability, model …


    Read More
  • Integrate external tools with Amazon Quick Agents using Model Context Protocol (MCP)

    calendar Feb 20, 2026 · aws.amazon.com/blogs/machine-learning
    Integrate external tools with Amazon Quick Agents using Model Context Protocol (MCP)

    In this post, you’ll use a six-step checklist to build a new MCP server or validate and adjust an existing MCP server for Amazon Quick integration. The Amazon Quick User Guide describes the MCP client behavior and constraints. This is a “How to” guide for detailed implementation required by 3P partners to integrate …


    Read More
  • Build AI workflows on Amazon EKS with Union.ai and Flyte

    calendar Feb 19, 2026 · aws.amazon.com/blogs/machine-learning
    Build AI workflows on Amazon EKS with Union.ai and Flyte

    In this post, we explain how you can use the Flyte Python SDK to orchestrate and scale AI/ML workflows. We explore how the Union.ai 2.0 system enables deployment of Flyte on Amazon Elastic Kubernetes Service (Amazon EKS), integrating seamlessly with AWS services like Amazon Simple Storage Service (Amazon S3), Amazon …


    Read More
  • Amazon Quick now supports key pair authentication to Snowflake data source

    calendar Feb 19, 2026 · aws.amazon.com/blogs/machine-learning
    Amazon Quick now supports key pair authentication to Snowflake data source

    In this blog post, we will guide you through establishing data source connectivity between Amazon Quick Sight and Snowflake through secure key pair authentication. Link to article: https://aws.amazon.com/blogs/machine-learning/amazon-quick-suite-now-supports-key-pair-authentication-to-snowflake-data-source/


    Read More

.NET Ramblings

Copyright  .NET RAMBLINGS. All Rights Reserved

to-top