Previous Job
Previous
Sr. DevOps Engineer
Ref No.: 18-01215
Location: Philadelphia, Pennsylvania
Sr. DevOps Engineer in Philadelphia, PA 19103

Interview Logistics:
  • Phone Interview
  • F2F Interview Required - NO SKYPE

Required Skills Set:
Years of Experience: 8+
Education Required: Bachelors Degree or Equivalent Work Experience
Qualifications:
  • 5+ years in a Site Reliability role, development operations role, or closely related position
  • Experience administering Linux systems in a production environment
  • Programming experience in one or more of the following languages: Go, Ruby, Java, Python, Shell
  • Bachelor's Degree in Computer Science or a related field, or relevant work experience
Additional Skills:
  • Ability to dive deep into complex technical problems
  • Experience with configuration management tools such as Ansible, CFEngine, Chef and Puppet
  • Experience building tools for automation (Packer, Ansible, Terraform) (building, testing, releasing, monitoring and alarming)
  • Excellent problem solving skills with a strong attention to detail
  • Experience with distributed version control like Git or Mercurial
  • Experience with IaaS and PaaS providers such as AWS, OpenStack, Heroku, and CloudFoundry
  • A sense of ownership, initiative, and drive
  • Experience with enterprise monitoring solutions like AppDynamics, Graphite, Nagios, and Splunk
  • Familiarity with continuous integration/deployment processes and tools such as Artifactory, Gerrit, Git, Jenkins, Maven and Nexus

Project Description:
Development
  • Build tools and alarms that would inform of potential problems or customer issues
  • Adapt what exists and build what doesn't to scale the system
  • Build tools and develop processes for continuous integration and delivery of services
  • Obsess over collecting and digesting metrics
  • Build and drive the automation systems that maintain system health
Site Reliability Engineering / Operations
  • Root-cause complex problems involving multiple parties, networks, hardware and software that relate to scaling and performance
  • Participate in on-call rotation
  • Engender reliability and availability starting with metrics and measurements
  • Enable scaling by providing tools, developing training or augmenting processes
  • Secure the system from issues, be they real, perceived or notional

Physical Environment and Working Conditions:
Must be able to work on site in Philadelphia, PA