Role:
AWS/DevOps Engineer OR Site Reliability Engineer (9893251)
Length: 6
months contract
Location:
4555 Mansell Road United States 30022 Alpharetta GA
Need only locals to
and No OPT and CPT
Description:
·
Looking for a highly motivated Site Reliability
Engineer, who is capable of build and run large-scale, massively distributed,
fault-tolerant systems. Individual to work with teams across the organization
and ensures core services reliability and keep an eye on capacity and
performance.
·
Responsible for blameless postmortems and
proactive identification of potential outages factor into iterative
improvement.
·
Experience in Designing and Deploying multi-data
center Large Scale Web Applications.
·
Work closely with dev, and ops teams to build
highly available, cost effective systems.
·
Create new tools and scripts designed for
auto-remediation of incidents.
·
Design/Implementation of Big Data technologies,
including Hadoop, MongoDB, Kafka, RabbitMQ, Zookeeper, Spark, ELK, etc
·
Responsible for establishing end-to-end
monitoring and alerting on all critical aspects to ensure SLAs and get
proactive notifications of possible issues for all systems.
·
Design platforms for extremely high uptime
metrics.
·
Works well independently and requires little or
no supervision.
·
Work with cloud operations team to resolve
trouble tickets, developing and running scripts, and troubleshooting.
·
Fully understand the application, microservices
interactions.
·
Design/Implementation containers/applications in
scalable HA/DR multi-tier cloud environments, including new system design,
documentation, implementation, and deployment.
·
Participate in 24x7 an on-call rotation.
Job
Requirements:
·
7+ years of experience in the following areas:
·
Experience in providing L4 technical support for
production 24x7.
·
Strong experience in production support and
operations.
·
Design/Implementation of network and
presentation tier technologies, including F5, Apache, Nginx, etc
·
Experience in Performance
Testing/Tuning/Monitoring, maximizing system uptime and availability, ensuring
functional and performance SLAs.
·
Experience with monitoring
Application/Infrastructure Performance, and availability.
·
Automation Experience with Build/deployment,
Software Configuration/Continuous Integration/Continuous Delivery/Release
Engineering related tasks in an JavaEE/C++ Environments.
·
Experience in automating manual processes using
Python, Ruby, Unix Shell (bash, ksh), perl, Ant, etc.
·
Installing, Configuring, Administering, and
Tuning of JavaEE Application Servers/Containers like Tomcat, WebSphere, etc
·
Installing/maintaining/Administering software on
Unix Linux, Windows servers.
·
Experience with Web service technologies,
including REST, SOAP, JSON, XML
·
Experience with Cloud Platforms and
virtualization Technologies.
·
Deploying and automating
infrastructure/applications in cloud environment using Chef, RPM, etc.
·
Working closely with Development, QA, Product
Management, and Production Ops teams to make sure Product Releases on-time with
quality.
·
Hands on experience Configuring and
Administering SCM(GIT, SVN), Build (CMake, Make files, Maven), CI(Jenkins), CD
Automation Tools.
·
Experience with database (RDBMS, NoSql)
technologies is a plus.
·
Experience with Performance Testing is a plus.
·
Configuring and maintaining SDLC Environments.
·
Experience in Agile Methodologies and processes.
·
Strong Automation, problem-solving skills, and
ability to follow through to completion.
·
Demonstrated leadership skills through a variety
of activities, including leading or mentoring technical staff
·
Strong verbal/written communication skills.
·
Participate in 24x7 an on-call rotation.
Thanks.,
Ram Krishna M
ASAP Solutions Inc.,