Company Name:

Avaya

Location:

Seattle, WA

Approximate Salary:

Not Specified

Date Posted:

April 7, 2019

Site Reliability Engineer

Job Description

About Employer

Employer enables the mission critical, real-time communication applications of the world s most important operations. As a global leader in delivering superior communications experiences, Employer offers a complete portfolio of software and services for contact center and unified communications offered on premises, in the cloud, or a hybrid. Today s digital world requires communications enablement, and no other company is better positioned to do this than Employer. For more information, please visit www. avaya. com.

Job Information

As the Site Reliability Engineer (SRE) on the SRE team, you have a unique opportunity to enable the Employer Cloud organization to adopt a cloud native infrastructure and application deployment methodology. On an on-going basis, this position will identify areas of improvement to the technical platform which developers utilizes to deliver and operate Employer Cloud application products. This position will focus on speed of delivery, reliability, security, and availability of services for the organization. A deep technical proficiency in both enterprise-scale systems as well as next generation cloud native applications is required.


Short Description


  • Instrument code, build tools and dashboards to help visualize and understand real-time system health, usage, and performance metrics.
  • Oversee the infrastructure and service health monitoring process to enable proactive issue mitigation and expedited issue resolution.
  • Design, manage, and maintain internal tools to support engineering, operations, research and/or support processes.
  • Troubleshoot and resolve issues in our development, test and production environments.
  • Work with the platform team to identify and fix software/system performance bottlenecks and stability issues.
  • Contribute to overall system scalability to ensure Employer Cloud s ability to deliver highly reliable services
  • Understand, implement, and automate security controls, governance processes, and compliance validation.
  • Oversee the continuous integration and deployment (CI/CD) toolchain to ensure that the system code for our high availability, mission critical cloud platform that supports all core Employer Cloud products and services is reliably tested and predictably released.
  • Stay up-to-date on relevant technologies, plug into user groups, understand trends and opportunities to ensure we are using the best possible techniques and tools.

Education and Experience


  • Strong background in Linux/Unix administration and scripting
  • Extensive experience managing and configuring public cloud providers using CFN or Terraform, specifically AWS
  • Experience with Docker
  • Experience with Kubernetes, EKS or ECS
  • Knowledge of Helm or similar tool
  • Experience with monitoring and analytics using Datadog or similar
  • Experience with configuring and maintaining Jenkins and Jenkins Pipeline or similar (Gitlab CI/CD Runners)
  • Experience with Aggregating logging platforms such as Elasticsearch, Logstash and kibana
  • Knowledge of best-practice security and networking techniques for public facing systems
  • Strong experience with MySQL and related database technologies
  • Knowledge of best practices and IT operations in an always-up, always-available mission critical service
  • Experience writing code in Python and/or Go

GLDR *li-post



  • Strong background in Linux/Unix administration and scripting
  • Extensive experience managing and configuring public cloud providers using CFN or Terraform, specifically AWS
  • Experience with Docker
  • Experience with Kubernetes, EKS or ECS
  • Knowledge of Helm or similar tool
  • Experience with monitoring and analytics using Datadog or similar
  • Experience with configuring and maintaining Jenkins and Jenkins Pipeline or similar (Gitlab CI/CD Runners)
  • Experience with Aggregating logging platforms such as Elasticsearch, Logstash and kibana
  • Knowledge of best-practice security and networking techniques for public facing systems
  • Strong experience with MySQL and related database technologies
  • Knowledge of best practices and IT operations in an always-up, always-available mission critical service
  • Experience writing code in Python and/or Go

GLDR *li-post


Experience

7 - 10 Years of Experience

Education

Bachelor degree or equivalent experience

Advance Degree preferred

Preferred Certifications

Apply Now