Find your next role

Discover amazing opportunities across our network of companies committed to gender equality in the workplace.

Site Reliability Engineer Manager

IBM

IBM

Software Engineering, Operations
Budapest, Hungary
Posted on Jan 28, 2026
Introduction

At IBM Infrastructure, we design and operate the systems that keep the world running. From high-resiliency mainframes and hybrid cloud platforms to networking, automation, and site reliability. Our teams ensure the performance, security, and scalability that clients and industries depend on every day. Working in Infrastructure means tackling complex challenges with curiosity and collaboration. You’ll work with diverse technologies and colleagues worldwide to deliver resilient, future-ready solutions that power innovation. With continuous learning, career growth, and a supportive culture, IBM provides the opportunities to build expertise and shape the infrastructure that drives progress.

Your role and responsibilities

At Cloud Data Services, we deliver fully managed, highly-available data services running on IBM Cloud’s global infrastructure.

We are seeking a Database Reliability Engineering Manager to grow and lead engineers in Budapest responsible for running and improving our managed database portfolio, including PostgreSQL, Redis, Elasticsearch, RabbitMQ, MySQL, and MongoDB. This position is ideal for a technical leader who thrives at the intersection of distributed systems engineering, cloud operations, and people leadership.

You will provide both technical and people leadership. You will mentor your team, cultivate a culture of engineering excellence, and guide architectural decisions that enhance the reliability, scalability, and performance of our data services

You will collaborate and align team priorities and contributions with other regional database reliability managers who own the operational health of our database services, overseeing uptime, resiliency, and long-term stability. This includes enhancing management scripts for our database services, introducing new metrics and alerts, writing tooling to support investigations and resolution, and writing automation for self-healing and auto-tuning. You will lead incident response when issues arise, coordinate cross-team efforts to quickly contain and resolve problems, and ensure that post-incident reviews lead to meaningful, long-term improvements. You’ll partner with development teams to harden our database platforms, enhance observability, refine deployment pipelines, and build mechanisms that make our services more resilient.

We are a “You build it, You run it” culture. As a member of the management team, you will participate in a “follow-the-sun” rotation (daytime only, including weekends) where you serve as the primary escalation point for data service operators to work through and manage high urgency operational issues, client impacting events, or critical client tickets. You will organize response and resolution, support responders with your expertise, communicate status, and lead root cause analysis.

Collaboration will be essential as you work closely with our globally-distributed development, support, security, and infrastructure teams. You will help shape roadmap priorities, ensure customer-centric reliability improvements, and maintain alignment across technical and business stakeholders. Additionally, you will be responsible for forecasting capacity, planning for future scaling needs, and guiding long-term strategies that support the growth of IBM Cloud Databases and the evolving needs of customers.

Required education
Bachelor's Degree
Preferred education
Bachelor's Degree
Required technical and professional expertise
  • 3+ years of experience in SRE, DevOps, Cloud Platform Engineering, or Site Reliability Engineering roles.
  • 3+ years managing engineering teams operating large-scale distributed production systems.
  • Hands-on expertise with at least PostgreSQL, Redis, Elasticsearch, RabbitMQ, MySQL, or MongoD
  • Strong understanding of observability and diagnostics—metrics, logs, tracing, alerts, and dashboards.
  • Excellent communication and leadership skills, especially in complex, globally distributed environments.

Preferred technical and professional experience
  • Experience operating or building managed Database-as-a-Service (DBaaS) offerings.
  • Demonstrated success implementing operational rigor—on-call programs, playbooks, runbooks, automation, and incident response maturity.
  • Proficiency with automation languages (Python, Go, Bash) and orchestration (Kubernetes).

ABOUT BUSINESS UNIT

IBM Systems helps IT leaders think differently about their infrastructure. IBM servers and storage are no longer inanimate - they can understand, reason, and learn so our clients can innovate while avoiding IT issues. Our systems power the world’s most important industries and our clients are the architects of the future. Join us to help build our leading-edge technology portfolio designed for cognitive business and optimized for cloud computing.

YOUR LIFE @ IBM

In a world where technology never stands still, we understand that, dedication to our clients success, innovation that matters, and trust and personal responsibility in all our relationships, lives in what we do as IBMers as we strive to be the catalyst that makes the world work better.

Being an IBMer means you’ll be able to learn and develop yourself and your career, you’ll be encouraged to be courageous and experiment everyday, all whilst having continuous trust and support in an environment where everyone can thrive whatever their personal or professional background.

Our IBMers are growth minded, always staying curious, open to feedback and learning new information and skills to constantly transform themselves and our company. They are trusted to provide on-going feedback to help other IBMers grow, as well as collaborate with colleagues keeping in mind a team focused approach to include different perspectives to drive exceptional outcomes for our customers. The courage our IBMers have to make critical decisions everyday is essential to IBM becoming the catalyst for progress, always embracing challenges with resources they have to hand, a can-do attitude and always striving for an outcome focused approach within everything that they do.

Are you ready to be an IBMer?

ABOUT IBM

IBM’s greatest invention is the IBMer. We believe that through the application of intelligence, reason and science, we can improve business, society and the human condition, bringing the power of an open hybrid cloud and AI strategy to life for our clients and partners around the world.

Restlessly reinventing since 1911, we are not only one of the largest corporate organizations in the world, we’re also one of the biggest technology and consulting employers, with many of the Fortune 500 companies relying on the IBM Cloud to run their business.

At IBM, we pride ourselves on being an early adopter of artificial intelligence, quantum computing and blockchain. Now it’s time for you to join us on our journey to being a responsible technology innovator and a force for good in the world.

IBM is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, genetics, pregnancy, disability, neurodivergence, age, or other characteristics protected by the applicable law. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.

OTHER RELEVANT JOB DETAILS

For additional information about location requirements, please discuss with the recruiter following submission of your application.