Senior DevOps & Cloud Infrastructure Engineer (GCP)
Buget: $20.0 - $30.0
HOURLY / FULL_TIME
⭐ 4.98 (53)
United States
cloud-architecture, python, google-cloud-platform, devops, kubernetes, docker, cicd-platforms, automated-deployment
We are looking for a highly capable Senior DevOps & Cloud Infrastructure Engineer to help design, scale, secure, and operate our cloud platform running on Google Cloud Platform (GCP).
This role is focused on cloud infrastructure, Kubernetes, platform engineering, automation, observability, and production reliability. While experience with edge devices or IoT systems is a plus, our primary need is a strong cloud infrastructure engineer who can build and operate reliable production systems at scale.
Responsibilities
Cloud Infrastructure & Platform Engineering
Design, deploy, and maintain scalable infrastructure on Google Cloud Platform (GCP)
Manage and optimize production services running on:
GKE (Google Kubernetes Engine)
Cloud Run
Pub/Sub
Cloud SQL
Artifact Registry
IAM
VPC Networking
Cloud Monitoring & Logging
Improve cloud scalability, reliability, performance, and operational efficiency
Implement Infrastructure-as-Code using Terraform
Kubernetes & Platform Operations
Design and manage production Kubernetes clusters
Implement deployment strategies including canary, blue-green, and staged rollouts
Improve cluster security, reliability, autoscaling, and cost efficiency
Manage containerized workloads and production environments
CI/CD & Developer Productivity
Build and maintain CI/CD pipelines using GitHub Actions and related tooling
Improve deployment automation and release management
Support development teams with platform tooling and operational best practices
Automate operational workflows and infrastructure provisioning
Observability & Reliability Engineering
Design monitoring, alerting, and logging systems
Improve visibility across services, infrastructure, APIs, and deployments
Drive incident response, root-cause analysis, and operational improvements
Establish reliability best practices including SLOs, SLIs, error budgets, and service ownership
Security & Compliance
Implement security best practices across cloud infrastructure
Manage IAM policies, secrets management, certificates, and access controls
Support SOC2, HIPAA, and enterprise security requirements
Improve auditability and operational security posture
Collaboration
Work closely with AI engineers, backend engineers, robotics engineers, and product teams
Help define cloud architecture and infrastructure strategy
Support deployment and scaling of production healthcare systems
Required Qualifications
5+ years of DevOps, Platform Engineering, Cloud Infrastructure, or SRE experience
Strong hands-on experience with Google Cloud Platform (GCP)
Strong Kubernetes experience in production environments
Strong experience with Docker and containerized workloads
Experience with Terraform or similar Infrastructure-as-Code tools
Experience building and maintaining CI/CD pipelines
Strong Linux systems administration skills
Strong networking fundamentals (VPCs, VPNs, DNS, firewalls, load balancing)
Experience with monitoring and observability platforms
Strong scripting or programming skills in Python and/or Bash
Experience operating production systems with high availability requirements
Preferred Qualifications
Experience with GKE at scale
Experience with Cloud Run, Pub/Sub, and Cloud SQL
Experience with GitOps tools such as ArgoCD
Experience supporting AI/ML infrastructure
Experience with GPU workloads
Experience with distributed systems and event-driven architectures
Experience with SOC2 or HIPAA compliance
Experience with edge computing or IoT deployments
Experience supporting NVIDIA Jetson or Linux-based edge devices
Deschide pe Upwork