Find your next role
Discover amazing opportunities across our network of companies committed to gender equality in the workplace.
Amazon
Do you like to use network and Unix systems engineering to deliver simple, sustainable, and repeatable solutions? Would you like to play a key role in driving the stability and sustainability of next-generation IP networks?
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help.
You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
The Core Networking team is looking for an experienced Software Development Manager to lead our Network Product Development (NPD) Interconnects Tools and Metrics organization. In this role, you will build and scale high-performing engineering teams responsible for developing the software infrastructure, telemetry systems, and automation tools that enable the interconnects and links that help enable Amazon's global network operations. AWS Core Networking builds and operates the foundational network infrastructure that connects data centers worldwide, supporting AWS customers and Amazon's business units with industry-leading performance and reliability.
As a Software Development Manager, you will define technical strategy, establish organizational capabilities, and drive the delivery of large-scale distributed systems that power network monitoring, quality assurance, and operational intelligence. You will lead teams building ML-driven predictive systems, self-service data platforms, and intelligent troubleshooting tools that transform how we deploy, monitor, and maintain network infrastructure at scale. This role requires strong technical leadership, excellent communication skills to collaborate with cross-functional partners, and a proven track record of building teams that deliver mission-critical software systems. You will stay abreast of emerging technologies and AI/ML capabilities, investigating and implementing innovations that accelerate delivery and improve operational excellence.
Key job responsibilities
- Build the organization and systems that delivers highly available, large-scale distributed software services for Amazon Network telemetry, operational monitoring, and reliability tools that improve network availability and performance
- Establish the strategy and team capabilities for automated quality monitoring and predictive systems for interconnect products that provide actionable intelligence to product owners, supply chain teams, and operational stakeholders
- Define our ML initiatives and build the team that is capable of embodying proactive network reliability, transforming reactive troubleshooting into preventive maintenance that reduces customer impact
- Own the strategy and execution of network validation and troubleshooting tools that improve deployment efficiency and provide intelligent, context-driven recommendations for operational issues
- Define the technical vision for scalable, self-service data infrastructure and telemetry platforms that reduce operational maintenance overhead and enable rapid data onboarding
- Set technical direction and build organizational mechanisms that ensure architectural excellence and scalability across all team deliverables
- Develop and communicate monthly, quarterly, and yearly strategy and roadmaps aligned with project priorities, business objectives, and team capabilities
- Build team processes and operational rhythms across the full development lifecycle, balancing on-call and operational workload with project delivery and milestone commitments
- Hire, mentor, and develop a high-performing software development team while fostering a culture of technical excellence and continuous improvement
- Collaborate and influence across multiple organizations to align partner and dependent teams, driving delivery of software infrastructure that spans organizational boundaries
- Communicate complex technical information clearly and concisely to diverse audiences, including engineers, leadership, and cross-functional stakeholders
- Set technical direction and establish engineering best practices that scale across the organization
- Establish metrics, mechanisms, and feedback loops that drive continuous improvement in team delivery and operational excellence
A day in the life
On an everyday basis as part of our team, you have the unique opportunity to understand the growing AWS network and our internal customers’ requirement on interconnect solutions. You'll work backwards to devise hardware solutions by influencing the broad industry and/or to develop software tools with sister teams to maintain a highly available network that delights AWS customers. You design and implement processes and mechanisms that both help the team to deliver business impact to the organization in a systemic way, while also helping to raise the bar on our operational excellence.
Operating at the scale we do, there is no blueprint for how to do what we do, which encourages our engineers to identify and develop simple solutions to complex problems. We encourage durable solutions that look around corners while taking into consideration our customer needs from a cost, performance, and reliability perspective. We work closely with our internal partners that design, build and operate the network to ensure that our solutions meet their needs and exceed their expectations.
About the team
Within AWS Networking the NPD (Network Product Development) organization is responsible for, designing the hardware, building the software, and owning the interconnects for the routers that power the global AWS network. Beyond product delivery we actively manage the fleet or routers in a network that grows by 70% annually. This means tracking key business and operational metrics to ensure that we operate smoothly and minimize or eliminate customer impact due to device related issues for a transparent AWS customer experience.
About AWS
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
Why AWS?
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.