Senior Python Engineer – Temporal.io (AWS ECS/EKS)
Budżet: $15.0 - $15.0
HOURLY / FULL_TIME
⭐ 0.00 (0)
Pakistan
python, amazon-web-services
We are looking for a highly experienced Python engineer with strong hands-on expertise in Temporal.io to help design, implement, optimize, and operate distributed workflow systems in production.
This is a contract role for a project involving workflow orchestration, fault-tolerant background processing, and cloud-native deployment on AWS infrastructure.
Required Expertise
Strong Python development experience (5+ years preferred)
Extensive hands-on experience with Temporal.io
Deep understanding of:
- Workflows
- Activities
- Task Queues
- Workers
- Retry Policies
- Signals & Queries
- Child Workflows
- Workflow Versioning
Experience deploying and operating Temporal workloads on:
- AWS ECS
- AWS EKS (Kubernetes)
- Experience with Docker and containerized deployments
Strong knowledge of distributed systems concepts:
- Idempotency
- Event-driven architecture
- Fault tolerance
- Resilience patterns
- Workflow recovery
- Observability
Responsibilities
- Design and implement Temporal workflows using Python SDK
- Build scalable, reliable background processing systems
- Deploy and manage Temporal workers on ECS/EKS
- Optimize workflow performance and worker scalability
- Troubleshoot workflow execution, retries, and failure scenarios
- Implement monitoring, logging, and alerting
- Review architecture and recommend best practices
- Collaborate with our engineering team on system design and implementation
Screening Questions
Please answer the following in your proposal:
Describe a production system you built using Temporal.io.
How many workflows and workers were running in production?
Have you deployed Temporal workers on ECS, EKS, or both?
Have you managed Temporal Server yourself, or only used Temporal Cloud?
How do you handle workflow versioning and backward compatibility?
What monitoring and observability stack did you use with Temporal?
Share examples of complex workflows involving retries, signals, or child workflows.
What is the largest scale Temporal deployment you have worked on?
Otwórz na Upwork