Find your next role
Discover amazing opportunities across our network of companies committed to gender equality in the workplace.
Amazon
Software Engineering, Product
Cupertino, CA, USA
Do you like building software systems that power the world's largest cloud network? Would you like to play a key role in developing the tools, automation, and data infrastructure that keep AWS network interconnects running at peak performance?
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we're the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we're looking for talented people who want to help.
You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You'll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you'll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
The Core Networking team is looking for a Software Development Engineer II to join our Network Product Development (NPD) Interconnects Tools and Metrics (TMX) organization. In this role, you will design, develop, and operate the software systems that enable the NPD Interconnects team to monitor, qualify, and manage interconnect products across the AWS fleet. You will build and maintain tooling, automation, and data infrastructure across test infrastructure, observability and analytics, distributed systems for link operations, and ML model delivery. This role requires strong software engineering skills, the ability to navigate ambiguity across multiple technical domains, and a passion for building scalable, reliable systems that directly impact network availability for AWS customers.
Key job responsibilities
- Design, develop, deploy, and operate software systems that enable the NPD Interconnects team to monitor, qualify, and manage interconnect products across the AWS fleet
- Develop and maintain automated test frameworks and tooling that enable product engineers to validate optical transceivers and fiber connectivity products, scaling test infrastructure to support increasing qualification demands
- Build and maintain data ingestion, processing, and storage systems for optics and fiber telemetry data, enabling product owners to conduct fleet-wide analysis through self-service tooling and dashboards
- Design and deliver distributed systems that orchestrate link-level testing, validation, and troubleshooting workflows across AWS regions, ensuring resilience and scalability
- Collaborate with Applied Scientists to build ML/science model serving infrastructure, operationalizing models that optimize fleet performance and predict failures
- Independently clarify requirements and deliver system-level solutions for technically complex or operationally ambiguous problems, with guidance from senior engineers on architectural direction
- Participate in on-call rotation, lead troubleshooting of production issues, and drive resolution for both individual and large-scale fleet events
- Automate and simplify team operations processes, improving service resilience and performance
- Produce high-quality, well-tested code, actively participate in code reviews, and mentor newer team members to raise the engineering bar
- Communicate effectively about technical work, document system architecture and operations, and collaborate across team boundaries to deliver features in services owned by other organizations
A day in the life
On an everyday basis as part of our team, you have the unique opportunity to understand the growing AWS network and our internal customers’ requirement on interconnect solutions. You'll work backwards to devise hardware solutions by influencing the broad industry and/or to develop software tools with sister teams to maintain a highly available network that delights AWS customers. You design and implement processes and mechanisms that both help the team to deliver business impact to the organization in a systemic way, while also helping to raise the bar on our operational excellence.
Operating at the scale we do, there is no blueprint for how to do what we do, which encourages our engineers to identify and develop simple solutions to complex problems. We encourage durable solutions that look around corners while taking into consideration our customer needs from a cost, performance, and reliability perspective. We work closely with our internal partners that design, build and operate the network to ensure that our solutions meet their needs and exceed their expectations.
About the team
Within AWS Networking the NPD (Network Product Development) organization is responsible for, designing the hardware, building the software, and owning the interconnects for the routers that power the global AWS network. Beyond product delivery we actively manage the fleet or routers in a network that grows by 70% annually. This means tracking key business and operational metrics to ensure that we operate smoothly and minimize or eliminate customer impact due to device related issues for a transparent AWS customer experience.
About AWS
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
Why AWS?
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.