z
ziad_hsn

Ziad H

@ziad_hsn

Site Reliability Engineer

Egypt
English, Arabic
About me
I am a Site Reliability Engineer with over 4 years of experience in building and operating high-availability distributed systems on AWS and GCP. I have a strong focus on observability and automation. I have hands-on experience in diagnosing production incidents and managing large-scale systems.... Read more

Skills

z
ziad_hsn
Ziad H
Offline • 

See my services

DevOps Consulting
I will debug production incidents, kubernetes, ci cd pipelines

Work experience

Senior SRE / DevOps Engineer

PixelogicMedia • Full-time

Sep 2022 - Apr 20252 yrs 7 mos

On-call rotation lead for large-scale distributed infrastructure. Diagnosed and resolved production incidents including storage failures, performance degradation, and TLS issues. Key achievements: - Significantly reduced security-related downtime through IAM key rotation automation (Lambda, Step Functions, CloudFormation) - Optimized Elasticsearch storage costs through ILM hot-warm-cold tiering and shard tuning - Built custom alerting system with state tracking and low memory footprint - Led zero-downtime log collection migration across multiple nodes

Site Reliability Engineer

Capiter • Full-time

Oct 2021 - Sep 202211 mos

SRE focused on CI/CD pipelines and environment design for Kubernetes clusters. - Built CI/CD pipelines for automated deployment across multiple services - Designed staging environment from scratch using Helm and namespace isolation - Integrated security scanning with auto-fail on critical vulnerabilities - Optimized container builds with multi-stage builds and caching