SRE Full-Stack Engineer
Chennai
4–7 Years
Domain: Fintech | Cloud-native | Microservices
Role Summary
- We are looking for an SRE Full-Stack Engineer who is equally comfortable writing application code and improving platform reliability
- This role focuses on building reliability into the system, not just operating it
Reliability-Driven Development
- Write production-grade code (Java / Node) with reliability in mind
- Embed observability, metrics, and logging into services
- Improve service performance, fault tolerance, and error handling
Platform & Tooling
- Build internal tools and dashboards for monitoring and diagnostics
- Contribute to CI/CD improvements and release safety mechanisms
- Implement health checks, readiness probes, and resilience patterns
Observability
- Design and improve monitoring using metrics, logs, and traces
- Build dashboards and alerts aligned with SLOs
- Help reduce alert noise and false positives
Production Support
- Participate in on-call rotations
- Troubleshoot production issues across application, database, and infrastructure layers
- Contribute to root cause analysis and post-incident improvements
Required Skills & Experience
- 4–7 years of experience in backend or full-stack development
- Strong hands-on experience with Java and/or Node.js
- Experience building REST APIs and microservices
- Working knowledge of Docker and Kubernetes
- AWS fundamentals
- PostgreSQL and MongoDB
- Strong debugging and problem-solving skills
Nice to Have
- React experience for internal tools or dashboards
- Exposure to distributed tracing (OpenTelemetry, etc.)
- Prior experience working with SRE or platform teams