Senior SRE, Software Engineering

  • Full Time
  • New York City
  • 205K - 225K USD / Year

Website Femploy

Femploy by Infinite Code is recruiting for two Senior Site Reliability Engineers (SRE) to build the
reliability foundations for a high-impact, fast-scaling platform in New York City (Hybrid).

This is a premier “build-from-scratch” opportunity for an infrastructure specialist to define the SRE
culture of a rapidly growing firm. We are looking for AWS experts with 5+ years of experience who
can transition a platform from thousands to millions of users by implementing sophisticated
Observability, Terraform-based IaC, and blameless incident cultures.

What You’ll Do

*Reliability & Incident Management *

• Lead incident response and establish sustainable on-call practices

• Create runbooks and drive blameless postmortems

• Reduce MTTR through systematic improvements

Observability & Monitoring

• Build and maintain self-service observability systems

• Implement monitoring solutions that provide actionable insights

• Enable faster debugging and performance optimization

Infrastructure & Scalability

• Design and manage infrastructure-as-code (Terraform, CloudFormation)

• Architect scalable, secure AWS environments

• Improve reliability of databases, async workflows, and data pipelines

*CI/CD & Deployment *

• Partner with DevX to build robust CI/CD pipelines

• Implement advanced deployment strategies (blue/green, canary)

• Enable fast, safe, and reliable releases

Cross-Team Collaboration

• Work closely with engineering teams to embed reliability early in design

• Advocate for SRE best practices across the organization

Core Requirements

• 5+ years in SRE/DevOps OR 7+ years in Software Engineering (infrastructure-focused)

• Strong experience leading incident response & root cause analysis

• Expertise in designing high-availability systems

• Deep knowledge of AWS and infrastructure-as-code (Terraform preferred)

• Hands-on experience with CI/CD pipelines and automation

Key Skills

• Experience with tools like Datadog, Prometheus, ELK

• Ability to design monitoring systems that drive actionable insights

Nice to Have

• Strong communication and documentation skills

• Experience working in fast-scaling, high-growth environments

• Background in high-performance engineering cultures

• Evidence of initiative (side projects, startups, rapid career growth)

Why Join Us

• Opportunity to build SRE practices from the ground up

• Work on real scaling challenges (infra, data, reliability)

• High ownership and impact on system architecture

• Fast-paced, engineering-driven environment

How to Apply

Send your CV and a short summary of your experience to:
nelton@femploy.co.za

Only shortlisted candidates who meet the requirements and are available immediately will be
contacted.

Labeled as: ,

To apply for this job please visit wellfound.com.

All job content is user‑submitted or from public web sources; web4.career is not liable for its accuracy or source.

Post a job Contact
SHARE
TOP
The AI Era Has Arrived !