Find your next role

Discover amazing opportunities across our network of companies committed to gender equality in the workplace.

Site Reliability Engineer II - HashiCorp Platform DR

IBM

IBM

Software Engineering
Bengaluru, Karnataka, India
Posted on Oct 24, 2025
Introduction

A career in IBM Software means you'll be part of a team that transforms our customer's challenges into industry-leading solutions. We are an infinitely curious team, always seeking new possibilities, and dedicated to creating the world's leading AI-powered, cloud-native software solutions. Our renowned legacy creates endless global opportunities for our network of IBMers. We are a team of deep product experts, ensuring exceptional client experiences, with a focus on delivery, excellence, and obsession over customer outcomes. This position involves contributing to HashiCorp's offerings, now part of IBM, which empower organizations to automate and secure multi-cloud and hybrid environments. You will join a team managing the lifecycle of infrastructure and security, enhancing IBM's cloud solutions to ensure enterprises achieve efficiency, security, and scalability in their cloud journey.

Your role and responsibilities

At HashiCorp, we build the Infrastructure Cloud to help enterprises take a unified approach to reliability, disaster recovery, and operational resilience across cloud and enterprise environments. Our team ensures HashiCorp’s products meet the highest standards of availability, performance, and fault tolerance, enabling organizations to operate at scale with confidence.

As a Engineer on the HashiCorp Disaster Recovery team, you will help build and own solutions that strengthen disaster recovery (DR) governance and reliability across our cloud products. Your work will focus on system resilience, operational readiness, and high availability, making a real impact on the reliability of our platform.

We deliver the Infrastructure Cloud through our enterprise-grade SaaS platform, HCP, as well as through self-managed, on-premises solutions. Across our platform engineering teams, we’re looking for great engineers to help us build the future of reliable, scalable infrastructure!

What you’ll do (responsibilities)

  • Design, implement, and optimize disaster recovery (DR) solutions to enhance system resilience, ensuring high availability and fault tolerance across cloud products.

  • Develop and execute comprehensive DR testing strategies, identifying bottlenecks and failure points that impact Recovery Point (RPO) and Recovery Time Objectives (RTO).

  • Drive compliance and reliability initiatives, integrating DR best practices into system architecture and leveraging Chaos Engineering to validate failure scenarios.

  • Build scalable automation frameworks for testing, incident simulation, and recovery orchestration, reducing manual effort and improving operational efficiency.

  • Collaborate cross-functionally with engineering, product, and infrastructure teams to embed operational readiness into development lifecycles.

  • Lead incident/DR response drills and chaos experiments, analyzing test results, documenting findings, and implementing proactive improvements.

  • Monitor system performance and availability, developing dashboards and observability tools to provide actionable insights for reliability improvements.

  • Mentor engineers and foster a culture of resilience, promoting best practices in system design, testing, and disaster recovery preparedness.

Required education
Bachelor's Degree
Preferred education
Master's Degree
Required technical and professional expertise
  • 3+ years of experience in software development, reliability engineering, systems engineering, or non-functional testing, with a focus on disaster recovery, backup, and cloud resilience.

  • Proficiency in Golang and hands-on experience with version control systems such as Git or GitLab, ensuring maintainable and scalable code.

  • Strong understanding of microservices architecture and best practices for designing resilient, distributed systems in cloud environments.

  • Experience with CI/CD pipelines, ensuring automation, quality, and reliability in software delivery.

  • Exposure to cloud platforms (AWS, Azure, or GCP) and container orchestration technologies like Nomad or Kubernetes.

  • Strong collaboration and communication skills, with the ability to work cross-functionally and articulate technical concepts to diverse teams.

  • Commitment to continuous learning in reliability engineering, with an interest in enhancing disaster recovery strategies and system resilience.

  • Customer-centric and systems-thinking mindset, focused on delivering high-quality, scalable, and fault-tolerant solutions.

Preferred technical and professional experience
  • You have experience using HashiCorp products (Terraform, Packer, Waypoint, Nomad, Vault, Boundary, Consul).

  • Exposure to disaster recovery domain or worked on any product testing for DR is a plus

ABOUT BUSINESS UNIT

IBM Software infuses core business operations with intelligence—from machine learning to generative AI—to help make organizations more responsive, productive, and resilient. IBM Software helps clients put AI into action now to create real value with trust, speed, and confidence across digital labor, IT automation, application modernization, security, and sustainability. Critical to this is the ability to make use of all data, because AI is only as good as the data that fuels it. In most organizations data is spread across multiple clouds, on premises, in private datacenters, and at the edge. IBM’s AI and data platform scales and accelerates the impact of AI with trusted data, and provides leading capabilities to train, tune and deploy AI across business. IBM’s hybrid cloud platform is one of the most comprehensive and consistent approach to development, security, and operations across hybrid environments—a flexible foundation for leveraging data, wherever it resides, to extend AI deep into a business.

YOUR LIFE @ IBM

In a world where technology never stands still, we understand that, dedication to our clients success, innovation that matters, and trust and personal responsibility in all our relationships, lives in what we do as IBMers as we strive to be the catalyst that makes the world work better.

Being an IBMer means you’ll be able to learn and develop yourself and your career, you’ll be encouraged to be courageous and experiment everyday, all whilst having continuous trust and support in an environment where everyone can thrive whatever their personal or professional background.

Our IBMers are growth minded, always staying curious, open to feedback and learning new information and skills to constantly transform themselves and our company. They are trusted to provide on-going feedback to help other IBMers grow, as well as collaborate with colleagues keeping in mind a team focused approach to include different perspectives to drive exceptional outcomes for our customers. The courage our IBMers have to make critical decisions everyday is essential to IBM becoming the catalyst for progress, always embracing challenges with resources they have to hand, a can-do attitude and always striving for an outcome focused approach within everything that they do.

Are you ready to be an IBMer?

ABOUT IBM

IBM’s greatest invention is the IBMer. We believe that through the application of intelligence, reason and science, we can improve business, society and the human condition, bringing the power of an open hybrid cloud and AI strategy to life for our clients and partners around the world.

Restlessly reinventing since 1911, we are not only one of the largest corporate organizations in the world, we’re also one of the biggest technology and consulting employers, with many of the Fortune 500 companies relying on the IBM Cloud to run their business.

At IBM, we pride ourselves on being an early adopter of artificial intelligence, quantum computing and blockchain. Now it’s time for you to join us on our journey to being a responsible technology innovator and a force for good in the world.

IBM is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, genetics, pregnancy, disability, neurodivergence, age, or other characteristics protected by the applicable law. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.

OTHER RELEVANT JOB DETAILS

When applying to jobs of your interest, we recommend that you do so for those that match your experience and expertise. Our recruiters advise that you apply to not more than 3 roles in a year for the best candidate experience. For additional information about location requirements, please discuss with the recruiter following submission of your application.