full time management tech

Job Details

The Manager, Site Reliability Engineering (SRE) will be working with the development & operations teams, focusing on AWS and Azure infrastructure and automation. This role is responsible for the day-to-day operations of the DevOps team and combines a mix of project management, team management, and engineering duties. As a Press Ganey SRE Manager you will have the opportunity to work with likeminded and capable engineers and managers from across the organization to help drive innovation and capability. The DevOps team are subject-matter experts within Press Ganey and provide insight and engineering advice to development and product teams, helping to shape requirements and craft solutions to complex problems.

What You’ll Do:

  • Act as primary point of contact on all cloud infrastructure projects
  • Work collaboratively with software engineering to define infrastructure and deployment requirements; be a sounding board and provide recommendations for engineering around AWS services
  • Be... the driving force behind our automation and observability initiatives
  • Train and mentor the DevOps team
  • Build and maintain operational tools for deployment, monitoring, and analysis of AWS infrastructure and systems
  • Perform infrastructure cost analysis and optimization
  • Provide project management, sprint planning, and road-mapping support to the DevOps team

What You’ll Bring:

  • Insatiable desire to learn and grow; curiosity about all things technology, development, operations, and cloud
  • At least 5 years of experience designing, building and maintaining AWS infrastructure (VPC, EC2, Security Groups, IAM, ECS, CloudFront, S3, MSK, CloudWatch, ACM)
  • Hands-on experience deploying and managing infrastructure with Terraform
  • Hands-on experience with configuration management tools like Ansible, Chef, Puppet, or SaltStack
  • Hands-on experience with devops tools such as Docker, Git, Consul, Nomad, Vault, Jenkins, and Grafana
  • Hands-on experience with monitoring and logging tools such as New Relic, Instana, AppDynamics, Datadog, Prometheus, Splunk, ELK, CloudWatch, and CloudTrail
  • Experience with and knowledge of cloud native architectures; ability to design highly available, resilient, multi-region systems in AWS
  • Experience with and deep knowledge of Linux systems
  • Strong bias for action and ownership

See something wrong with this listing?

Contact support