| Role | Site Reliability Engineer
| Required Technical / Functional Skills | ? Experience with modern container orchestration systems: Kubernetes, Mesos, DC/OS, Swarm ? Experience with infrastructure configuration and automations processes and tools: Terraform, Puppet, Ansible, Chef, Fabric ? Experience with security in the cloud: Intrusion, penetration, and vulnerability scanning ? Experience with monitoring solutions: ELK, Splunk, SUMO, Nagios, Prometheus, Dynatrace, Sitescope ? Experience with ServiceNow or other modern workflow systems ? Experience with Change Management processes and functions ? Experience with various data technologies including relational and non-relational databases and message queues ? Good working knowledge of build automation and continuous integration/delivery ecosystem: Git, Gitlab, Gerrit, Maven/Gradle, Jenkins, Docker, Nexus, Artifactory, Selenium
Responsibilities:
Required Skills ? Software development experience in Java ? Deep understanding of Linux systems ? Hands on experience with cloud infrastructure such as AWS, Google compute, Azure ? Deep expertise in monitoring distributed systems application architectures ? Exposure to and maintenance of configuration management and orchestration tools at scale ? Diagnosing and troubleshooting user facing service outages ? Exposure to system and application level telemetry for large distributed architectures ? Diagnosing and resolving problems in high-throughput web applications and network services ? Expert level troubleshooting skills across different levels of the stack ? Bachelor's degree in Computer Science, Engineering, or Information Systems or any equivalent combination of experience, education, and/or training in the computer systems engineering field.
| Desired Technical / Functional Skills | Java/J2EE Puppet, Kubernetes, Splunk, Nagios, Git