SurfaceSURFACE BREAK

NVIDIA Nemotron 3 Nano now available on Amazon Bedrock serverless

Amazon Web Services has added NVIDIA Nemotron 3 Nano to Amazon Bedrock as a fully managed serverless model, making the compact language model available to developers on a pay-per-token basis without infrastructure provisioning.

VERIFIEDConfidence: 80%

Amazon Web Services has added NVIDIA Nemotron 3 Nano to Amazon Bedrock as a fully managed serverless model, making the compact language model available to developers without requiring any infrastructure provisioning. The addition means developers can call Nemotron 3 Nano through Bedrock's standard API on a pay-per-token basis, with AWS handling all underlying compute scaling automatically.

Previously, running NVIDIA's Nemotron models required self-managed infrastructure or direct access through NVIDIA's own APIs. The Bedrock serverless tier removes that barrier: developers pay only for tokens processed, with no minimum commitment and no idle compute costs. For teams already building on AWS, access requires no new service agreements -- Nemotron 3 Nano is available through the same Bedrock console and SDK used for other foundation models including Claude, Llama, and Mistral. The addition expands the Bedrock catalog's NVIDIA coverage and reflects the continued push by cloud providers to make specialized AI models accessible as managed, on-demand services rather than infrastructure challenges. Nemotron 3 Nano is designed for efficient inference at lower compute cost, making it a practical fit for the serverless, pay-as-you-go model that Bedrock offers.

Newsletter

Stay informed. The best AI coverage, delivered weekly.

BRIEFSurface

Google's Lyria 3 Pro extends AI music from a jingle to an actual song

Google announced Lyria 3 Pro on March 25, 2026, raising its AI music generator's output limit from 30 seconds to 3 minutes — six times longer than the base model released five weeks ago. The upgrade also teaches the model how songs are structured. That combination moves Lyria from novelty into something closer to a production tool.

Mar 25, 2026

BRIEFSurface

OpenAI's next model is nearly ready, and Altman says it can move the economy

OpenAI has completed pre-training on a new model codenamed "Spud," which CEO Sam Altman described to employees as "very strong" and capable of meaningfully accelerating the economy. The release is expected within weeks. The announcement comes with a consequential governance shift: Altman is stepping back from direct oversight of safety and security teams to focus on fundraising and infrastructure.

Mar 25, 2026

SIGNALSurface

OpenAI shutters AI video generator Sora

OpenAI has shut down Sora, its AI video generation platform, discontinuing both the consumer app and developer API with no replacement or migration path announced.

Mar 25, 2026

NVIDIA Nemotron 3 Nano now available on Amazon Bedrock serverless

Related

Google's Lyria 3 Pro extends AI music from a jingle to an actual song

OpenAI's next model is nearly ready, and Altman says it can move the economy

OpenAI shutters AI video generator Sora