Senior Infrastructure Software Engineer
NVIDIA
We are looking for a Senior Infrastructure Software Engineer to join our Cloud Engineering Services team. This role offers an outstanding opportunity to contribute to designing, constructing, and optimizing large-scale infrastructure for various foundational cloud services. If you possess a deep understanding of cloud infrastructure and distributed systems and thrive in a challenging, innovative environment, this could be your perfect role!
What you'll be doing:
You will engage with product engineering teams to gain a comprehensive understanding of their infrastructure use cases. Communicate design trade-offs effectively and construct scalable systems to meet their unique needs.
Develop advanced tooling to automate the build and deployment of microservices and infrastructure components, improving efficiency and productivity.
Proactively identify bottlenecks in the daily usage of core infrastructure and implement robust solutions to resolve them.
We reduce manual labor and increase operational efficiency through automation.
Monitor the infrastructure to alert on significant events, ensuring the highest system performance and reliability level.
What we need to see:
A Bachelor's, Master's, or Ph.D. in Computer Science or a related field, or equivalent experience.
6+ years of hands-on experience designing and building infrastructure to support large-scale, fault-tolerant distributed services.
Strong experience with cloud infrastructure platforms such as AWS, Azure, or Google Cloud.
High proficiency in infrastructure as code (IaC) and configuration management tools such as Terraform.
Expertise in administering, operating, and configuring Kubernetes and Envoy.
Proven experience in Continuous Integration/Continuous Delivery (CI/CD) tools like Gitlab and GitOps model.
Proficiency in monitoring tools Prometheus, Grafana, Cloudwatch, and Thanos.
Strong background in one or more general-purpose programming languages such as Go and Python.
Ways to stand out from the crowd:
Establish guidelines and standards for design, development, lifecycle, and management for HTTP APIs and gRPC services.
Strong knowledge of API specifications such as OpenAPI, Swagger, Protocol buffers, JSON Schema, AsyncAPI, GraphQL schemas.
Experience in API Management solutions, data interchange formats, and delivering scalable APIs.