Senior Infrastructure Software Engineer

NVIDIA

NVIDIA

Software Engineering, Other Engineering
Multiple locations
Posted on Nov 8, 2024

We are looking for a Senior Infrastructure Software Engineer to join our Cloud Engineering Services team. This role offers an outstanding opportunity to contribute to designing, constructing, and optimizing large-scale infrastructure for various foundational cloud services. If you possess a deep understanding of cloud infrastructure and distributed systems and thrive in a challenging, innovative environment, this could be your perfect role!

What you'll be doing:

  • You will engage with product engineering teams to gain a comprehensive understanding of their infrastructure use cases. Communicate design trade-offs effectively and construct scalable systems to meet their unique needs.

  • Develop advanced tooling to automate the build and deployment of microservices and infrastructure components, improving efficiency and productivity.

  • Proactively identify bottlenecks in the daily usage of core infrastructure and implement robust solutions to resolve them.

  • We reduce manual labor and increase operational efficiency through automation.

  • Monitor the infrastructure to alert on significant events, ensuring the highest system performance and reliability level.

What we need to see:

  • A Bachelor's, Master's, or Ph.D. in Computer Science or a related field, or equivalent experience.

  • 6+ years of hands-on experience designing and building infrastructure to support large-scale, fault-tolerant distributed services.

  • Strong experience with cloud infrastructure platforms such as AWS, Azure, or Google Cloud.

  • High proficiency in infrastructure as code (IaC) and configuration management tools such as Terraform.

  • Expertise in administering, operating, and configuring Kubernetes and Envoy.

  • Proven experience in Continuous Integration/Continuous Delivery (CI/CD) tools like Gitlab and GitOps model.

  • Proficiency in monitoring tools Prometheus, Grafana, Cloudwatch, and Thanos.

  • Strong background in one or more general-purpose programming languages such as Go and Python.

Ways to stand out from the crowd:

  • Establish guidelines and standards for design, development, lifecycle, and management for HTTP APIs and gRPC services.

  • Strong knowledge of API specifications such as OpenAPI, Swagger, Protocol buffers, JSON Schema, AsyncAPI, GraphQL schemas.

  • Experience in API Management solutions, data interchange formats, and delivering scalable APIs.