Open Role /Senior Java Site Reliability Engineer/ McLean, VA (Hybrid), 15+ years only

2 views

Skip to first unread message

unread,

Jun 15, 2026, 10:09:47 AM (3 days ago) Jun 15

to s...@ekfrazo.com

Hello,

Hope you are doing well.

Please let me know your interest for the below role. If interested, kindly share with me your updated resume to move forward.

Role: Senior Java Site Reliability Engineer

Exp: 16-20 Years

Job Type: Contract

Project: Hybrid

Location: McLean, VA

Industry: Banking / Financial Services

Key Responsibilities

Support and maintain highly available production platforms across cloud and distributed environments. Drive incident management, root cause analysis, problem management, and platform stability initiatives.
Monitor and maintain uptime of Java applications and microservices.
Proactively identify and resolve application performance bottlenecks.
Conduct root cause analysis (RCA) for application outages and incidents.
Implement resiliency patterns including circuit breakers, retries, and failover mechanisms.
Lead reliability engineering efforts focused on system availability, performance optimization, and operational excellence. Implement and enhance observability solutions including monitoring, logging, alerting, and incident response automation.
Collaborate with development, infrastructure, and cloud engineering teams to improve deployment reliability and operational efficiency. Support infrastructure modernization, cloud transformation, and platform automation initiatives.
Coordinate disaster recovery testing, resiliency validation, capacity planning, and production readiness reviews. Provide technical leadership and mentor offshore/onshore engineering teams.

Required Experience

16–20 years of experience in Site Reliability Engineering (SRE), Production Engineering, Platform Engineering, or Application Support.
Strong experience supporting large-scale enterprise production environments. Proven background in incident management, problem management, and operational support.
Experience working within banking, financial services, fintech, or other highly regulated industries. Hands-on experience supporting mission-critical applications with stringent availability and performance requirements.

Required Skills

Java
Linux/Unix Administration
Kubernetes and Container Platforms
Docker
Cloud Platforms (AWS, Azure, or GCP)
CI/CD Tools (Jenkins, GitHub Actions, GitLab CI/CD, ArgoCD)
Infrastructure as Code (Terraform, Ansible)
Monitoring & Observability Tools (Splunk, Datadog, Grafana, Prometheus, Moogsoft)
ServiceNow, JIRA, Confluence
Python, Bash, or Shell Scripting
SQL and Database Troubleshooting
Application Performance Monitoring (APM)
Production Release Management
Disaster Recovery and High Availability Architectures

Education

Bachelor's degree in Computer Science, Information Systems, Engineering, or a related technical discipline. Top of FormBottom of Form

Thanks & Regards

Sai Kumar

Talent Head - Ekfrazo Technologies LLC

1603 Capitol Avenue Suite 413-A, Cheyenne, WY 82001

USA | India | South Africa | Nigeria

Reply all

Reply to author

Forward

0 new messages