Website Femploy
Femploy by Infinite Code is recruiting for two Senior Site Reliability Engineers (SRE) to build the
reliability foundations for a high-impact, fast-scaling platform in New York City (Hybrid).
This is a premier “build-from-scratch” opportunity for an infrastructure specialist to define the SRE
culture of a rapidly growing firm. We are looking for AWS experts with 5+ years of experience who
can transition a platform from thousands to millions of users by implementing sophisticated
Observability, Terraform-based IaC, and blameless incident cultures.
What You’ll Do
*Reliability & Incident Management *
• Lead incident response and establish sustainable on-call practices
• Create runbooks and drive blameless postmortems
• Reduce MTTR through systematic improvements
Observability & Monitoring
• Build and maintain self-service observability systems
• Implement monitoring solutions that provide actionable insights
• Enable faster debugging and performance optimization
Infrastructure & Scalability
• Design and manage infrastructure-as-code (Terraform, CloudFormation)
• Architect scalable, secure AWS environments
• Improve reliability of databases, async workflows, and data pipelines
*CI/CD & Deployment *
• Partner with DevX to build robust CI/CD pipelines
• Implement advanced deployment strategies (blue/green, canary)
• Enable fast, safe, and reliable releases
Cross-Team Collaboration
• Work closely with engineering teams to embed reliability early in design
• Advocate for SRE best practices across the organization
Core Requirements
• 5+ years in SRE/DevOps OR 7+ years in Software Engineering (infrastructure-focused)
• Strong experience leading incident response & root cause analysis
• Expertise in designing high-availability systems
• Deep knowledge of AWS and infrastructure-as-code (Terraform preferred)
• Hands-on experience with CI/CD pipelines and automation
Key Skills
• Experience with tools like Datadog, Prometheus, ELK
• Ability to design monitoring systems that drive actionable insights
Nice to Have
• Strong communication and documentation skills
• Experience working in fast-scaling, high-growth environments
• Background in high-performance engineering cultures
• Evidence of initiative (side projects, startups, rapid career growth)
Why Join Us
• Opportunity to build SRE practices from the ground up
• Work on real scaling challenges (infra, data, reliability)
• High ownership and impact on system architecture
• Fast-paced, engineering-driven environment
How to Apply
Send your CV and a short summary of your experience to:
nelton@femploy.co.za
Only shortlisted candidates who meet the requirements and are available immediately will be
contacted.
To apply for this job please visit wellfound.com.
All job content is user‑submitted or from public web sources; web4.career is not liable for its accuracy or source.