What you'll do
- Responsible for building and evolving Braze's internal infrastructure as a service (IaaS) platform to support engineering teams.
- Develop and maintain large-scale distributed processing frameworks handling trillions of jobs daily using technologies like Sidekiq.
- Collaborate closely with multiple engineering teams to define, implement, and automate infrastructure services for improved reliability and scalability.
- Participate in incident management including on-call rotations to ensure system availability and continuous improvement.
- Focus on automation, observability, and operational safety to reduce toil and enhance platform reliability at massive scale.
What you should know
- Candidates should be prepared to work in a fast-paced, high-scale environment supporting billions of users and data points.
- The role requires an enthusiastic, proactive attitude with a strong desire to automate and improve engineering workflows.
- Applicants will engage in cross-team collaboration and documentation to enable smooth, scalable infrastructure usage.
- The position involves on-call responsibilities and incident management to maintain enterprise-grade SLAs.
- There are strong opportunities for professional growth supported by formal career pathing and a learning stipend.
About the company
- Braze is a leading customer engagement platform recognized for innovation in marketing technology and AI-powered personalization.
- The company values a collaborative, kind, and passionate culture with a strong emphasis on equity, inclusion, and work-life harmony.
- Braze is a global company with offices worldwide and a workforce that embraces asynchronous collaboration across remote teams.
- They offer a comprehensive Total Rewards package including competitive compensation, equity, and extensive benefits.
- Braze has been repeatedly recognized as a Great Place to Work and a leader in marketing technology by industry analysts.
Key required skills
Ruby on RailsSidekiqKafkaKubernetesDistributed systems