← Missions

AWS Cloud and DevOps Engineer needed to operate and innovate on large scale systems

Budget: - HOURLY / FULL_TIME ⭐ 4.94 (18974) United States

linux, ubuntu, ansible, git, configuration-management, amazon-web-services, devops, cicd, amazon-ec2, linux-system-administration, docker, kubernetes, github

Infrastructure Engineer Role Overview We are looking for an experienced Infrastructure Engineer who can architect, implement, and maintain scalable, secure, and reliable infrastructure. You will lead the setup of new greenfield builds, modernize our CI/CD ecosystems, and drive the transition to cloud-native architectures. You will collaborate across teams to ensure operational excellence, security, reliability, observability, and cost-efficiency while fostering a culture of technical innovation and collaboration. Responsibilities Architect & Modernize: Design and implement scalable AWS infrastructure and drive the modernization of Infrastructure-as-Code (IaC) using terraform enterprise/terraform cloud and Configuration-as-Code (CaC) systems such as chef/ansible to enhance maintainability and scalability. CI/CD Leadership: Oversee the development and maintenance of CI/CD pipelines, specifically driving consolidation and migration to high-performance, standardized pipeline platforms like Tekton. Container Orchestration: Lead the migration of services to EKS, ensuring high availability and optimal performance for containerized workloads. Service Mesh Management: Define and maintain robust Service Mesh architectures to manage service-to-service communication, security (mTLS), and traffic routing. Observability & Reliability: Define enterprise-wide observability strategies (logging, monitoring, distributed tracing) and implement Disaster Recovery (DR) and Business Continuity (BCP) plans to meet strict RTO/RPO objectives. Governance & Security: Architect enterprise-level Identity and Access Management (IAM) solutions and enforce Zero Trust principles across the infrastructure footprint. Cross-Functional Collaboration: Partner with engineering, data science, and security teams to meet business demands, maintain compliance, and foster a culture of technical excellence through mentorship. Audit / SOX: Must have working knowledge across audit and compliance and how various IT Tools and Technologies are integrated and work together to enhance enterprise security and compliance. What It Takes To Catch Our Eye Advanced IaC Expertise: Strong proficiency in Terraform with a history of re-architecting complex IaC systems at scale. Ability to work with enterprise grade terraform, optimize widely used modules and work with drifts without causing outages. AI Fluency: Need AI fluency in order to create repeatable context and prompt templates and use AI in a reliable manner. Select models efficiently based on tasks with appropriate context. Not just ad hoc vibe coding and talking about AI. Knowledge of Agentic AI will be beneficial. Kubernetes (EKS) Proficiency: Expert-level experience in container orchestration, specifically deploying and managing EKS at scale and migrating services from legacy systems (e.g., ECS/EC2). CI/CD Pipeline Design: Proven experience designing and managing Tekton pipelines or similar modern CI/CD ecosystems to improve developer velocity and pipeline reliability. Service Mesh Mastery: Deep understanding of Service Mesh technologies (e.g., Istio, Envoy) to manage microservices communication, traffic splitting, and observability. Networking & Zero Trust: Extensive experience in network architecture, including firewall management, load balancing, and implementing Zero Trust security models. Observability: In-depth knowledge of monitoring and distributed tracing frameworks (ELK, Opensearch, Prometheus, Grafana, Victoria logs, Redash, Pager Duty, Kafka). FinOps & Data Persistence: Demonstrated experience managing cloud spend on AWS and designing high-availability data persistence layers (SQL/NoSQL). Programming Skills: Strong scripting and automation skills in Python, Go, and Bash. Leadership: Excellent communication abilities with a history of mentoring distributed teams and driving large-scale infrastructure projects.
Ouvrir sur Upwork