SRE Full-Stack Engineer

Chennai
4–7 Years
Domain: Fintech | Cloud-native | Microservices
Role Summary
  • We are looking for an SRE Full-Stack Engineer who is equally comfortable writing application code and improving platform reliability
  • This role focuses on building reliability into the system, not just operating it
Reliability-Driven Development
  • Write production-grade code (Java / Node) with reliability in mind
  • Embed observability, metrics, and logging into services
  • Improve service performance, fault tolerance, and error handling
Platform & Tooling
  • Build internal tools and dashboards for monitoring and diagnostics
  • Contribute to CI/CD improvements and release safety mechanisms
  • Implement health checks, readiness probes, and resilience patterns
Observability
  • Design and improve monitoring using metrics, logs, and traces
  • Build dashboards and alerts aligned with SLOs
  • Help reduce alert noise and false positives
Production Support
  • Participate in on-call rotations
  • Troubleshoot production issues across application, database, and infrastructure layers
  • Contribute to root cause analysis and post-incident improvements
Required Skills & Experience
  • 4–7 years of experience in backend or full-stack development
  • Strong hands-on experience with Java and/or Node.js
  • Experience building REST APIs and microservices
  • Working knowledge of Docker and Kubernetes
  • AWS fundamentals
  • PostgreSQL and MongoDB
  • Strong debugging and problem-solving skills
Nice to Have
  • React experience for internal tools or dashboards
  • Exposure to distributed tracing (OpenTelemetry, etc.)
  • Prior experience working with SRE or platform teams