Find your next role

Discover amazing opportunities across our network of companies committed to gender equality in the workplace.

Software Development Engineer- GENAI/ML, Amazon Catalog System Services(Level 5)

Amazon

Amazon

Software Engineering, Data Science
Seattle, WA, USA
Posted on Jul 3, 2025

DESCRIPTION

Join the Veritas team within Amazon's Selection and Catalog Systems (ASCS) organization as a Software Development Engineer focused on GENAI/ML initiatives. The Veritas team owns Amazon's premier LLM benchmarking and evaluation platform, which is critical for measuring and improving AI performance across the world's largest e-commerce product catalog.

In this role, you'll work directly with Large Language Models (LLMs) to enhance catalog data quality and customer experience at large scale. You'll have extensive opportunities to work with in-house LLM hosting and inference systems, Amazon Bedrock, prompt tuning, and optimization techniques. As part of the team that evaluates AI performance across billions of products and attributes, you'll help build and leverage AI agents at scale to assess LLM models, their applications, and the customer experiences they power.

The Veritas team provides a unique opportunity to combine advanced generative AI development with large-scale distributed systems engineering, while working on benchmarking and evaluation frameworks that teams across Amazon depend on for their AI development and deployment decisions.

Key job responsibilities
As a Software Development Engineer (SDE) in Veritas, you will develop distributed systems powered by LLMs and multi-modal ML models to enhance benchmarking and evaluation across Amazon's catalog ecosystem. Build GenAI-driven solutions that improve evaluation quality and automation for Nova, Rufus, Starfish, and other Store Agent systems. Design AI-driven workflows for various data to enhance LLM benchmarking and performance measurement. Work extensively with Amazon Bedrock, in-house LLM hosting, and inference systems to build scalable evaluation pipelines that assess catalog-related customer experiences including media content (Image, A+, Video). Partner with scientists and AI experts to integrate advanced developments in Generative AI, LLM evaluation, and prompt optimization. Create comprehensive evaluation methodologies spanning traditional attribute quality metrics to advanced use cases like attribute prioritization and consistency evaluation.

The ideal candidate brings experience in distributed systems, designing and implementing high-scale software services, and agile, continuous delivery practices. You are a Software Development Engineer who takes ownership of services, puts customers first, and is committed to delivering high-quality solutions.

About the team
The Veritas team is a specialized, innovation-focused group within Amazon Selection and Catalog Systems (ASCS) - Amazon's Catalog System Services (CSS) organization. We own Amazon's premier LLM evaluation and benchmarking platform (Veritas), used by teams across ASCS and the broader company to measure and improve model performance for catalog applications. We evaluate AI across billions of products and attributes, collaborating with science teams on catalog-specific AI research, prompt optimization, and model evaluation. You'll work with the latest generative AI technology, including custom model hosting infrastructure and advanced prompt engineering tools. Growth opportunities include leading industry-defining benchmarking standards for e-commerce AI and taking on leadership roles across Amazon's catalog ecosystem.

We foster a collaborative environment where innovation thrives, technical excellence is celebrated, and every team member shapes the future of AI-powered catalog systems while maintaining work-life balance and continuous learning.