n
nithishraju2699

Nithish S

@nithishraju2699

Site Reliability Engineer

India
English
About me
I am a Platform-focused Site Reliability Engineer with 4 years of experience operating large-scale, microservices-based production systems. I specialize in building and operating highly available, cloud-native platforms. I have led SEV-1 and SEV-2 incidents and reduced MTTR by ~25%.... Read more

Skills

n
nithishraju2699
Nithish S
Offline • 
Average response time: 1 hour

See my services

DevOps Containerization
I will provide devops and sre support
Infra as Code
I will automate devops and sre tasks using python and iac

Work experience

Tata_Consultancy Services

Site Reliability Engineer

Tata Consultancy Services • Full-time

Mar 2022 - Present4 yrs 2 mos

Platform SRE supporting Tier-1 banking applications running on Kubernetes (OpenShift/GCP). Owned platform reliability and availability for production-grade, microservices- based systems. Acted as Incident Commander for SEV-1 / SEV-2 incidents, leading end-to-end incident response and cross-team coordination. Reduced MTTR by ~25% through structured incident response, improved alerting, runbooks, and deep observability. Defined and operationalized SLIs/SLOs and built proactive monitoring and alerting using Splunk, New Relic, and ITRS Geneos, sustaining ~99% availability. Conducted blameless postmortems and root cause analysis as part of ITIL Problem Management. Established platform-level change and release workflows using GitOps and CI/ CD pipelines, aligning DevOps automation with ITIL Change Management. Improved deployment success rate by ~15% through controlled rollouts and rapid rollback strategies. Automated infrastructure provisioning using Terraform (IaC). Reduced operational toil by ~30% through automation and self-service workflows while owning 24/7 on-call for mission-critical systems. Investigated backend failures and Oracle SQL transaction issues, restoring data integrity and service functionality.