What you'll do
- Lead design and development of next-generation AWS AI/ML cloud servers with a focus on performance and scalability.
- Collaborate cross-functionally with engineers, TPMs, and managers across AWS hardware and software teams.
- Own system architecture, debugging, and automation to improve server reliability, testability, and diagnostics.
- Develop and implement AI-driven automation tools and workflows to enhance engineering productivity.
- Drive innovation in cloud infrastructure hardware and software stack, from baremetal to userland software.
What you should know
- Ideal candidates are innovative self-starters with deep system-level knowledge across hardware and software.
- The role involves complex problem solving with a focus on reliability, scalability, and diagnostics.
- Applicants should be comfortable working in a fast-paced, collaborative, and high-impact environment.
- Opportunity to work at the intersection of AI automation and cloud platform development.
- Ownership and direct impact on product improvements and AWS’s bottom line are key aspects.
About the company
- AWS is a global leader in cloud computing, pioneering scalable and innovative cloud services.
- The company values diversity and inclusion, fostering an environment where bold ideas are welcomed.
- AWS operates at massive scale, supporting millions of customers with cutting-edge infrastructure.
- Strong emphasis on employee growth, mentorship, and work-life balance within a fast-paced environment.
- AWS Hardware Engineering is critical to the business, delivering industry-leading server designs.
Key required skills
C++PythonGox86 architectureLinux kernelPCIeSystem debuggingAutomation