Site Reliability Engineer
Observability/Site Reliability Engineer
Malvern PA Duties and responsibilities: As an Observability/Site Reliability Engineer you will have the opportunity to put your operational savvy-ness and engineering skills to work! You'll be partnering with multiple Teams on the job, ensuring the "-ilities" (Availability, Reliability, Scalability, Usability; etc.) of GIFS systems in both test and production environments. Additionally, you can anticipate working with real-time monitoring, diagnostic data, and analyze trends. As a caretaker of these systems, you'll collaborate and plan activities with GIFS Technical Leads to ensure that application service level objectives are met What it takes: The ideal candidate will have: A holistic view of and genuine appreciation for reliability, borne of real-world experience operating production services: • Examples of using software engineering and SRE practices to solve operational problems • A background in software engineering and can confidently collaborate with engineers to identify and resolve issues • Outstanding interpersonal skills and can build strong relationships with your inclusive communication methods • Examples of working in distributed teams • 3+ years' experience in software development • Knowledge of public cloud environments AWS is a plus • Working experience of Splunk, Cloudwatch,Honeycomb, AWS X-Ray, Grafana • Experience creation of monitors and alerts in Splunk, HoneyComb, X-Ray • 2+ years' experience utilizing Agile methodologies | ||||