Site Reliability Engineer

Site Reliability Engineer (TSYSJP00004200) 
Location: Jacksonville, FL 32256
Duration: 6 months
 
Description:
 
As a Site Reliability Engineer at iMobile3 a TSYS Company, you’ll be helping us ensure that our critical services are battle tested. This role will require a generalist who can contribute to development, design & architecture, system operations, resiliency testing, security hardening, and performance engineering. 
 
A SRE’s responsibilities include: 
Chaos engineering - you’re expected to think laterally about how our systems might fail in theory, design tests to demonstrate how they behave in practice, and then formulate and implement remediation plans, as appropriate. 
Pushing our systems to their limits, and then coming up with designs for how to get them to the next performance tier. 
Safeguarding reliability. Ensuring that our services are highly available, resilient against disasters, self-monitoring, and self-healing. 
Running “game days” to test assumptions about reliability and learn what will break before it matters to customers. 
Reviewing designs with an eye toward increasing the holistic stability of our platform and identifying potential risks. 
Building systems to proactively monitor the health, performance and security of our production and non-production virtualized infrastructure. 
Improving our monitoring and alerting systems to make sure engineers get paged when it matters (and don’t get paged when it doesn’t). 
Troubleshooting systems and network issues, alongside our Technical Operations Team. 
Mentoring other engineers in reliability-related skills. 
Evolving our SDLC, practices, and tooling to account for Site Reliability considerations and best practices. 
Developing runbooks and improving documentation. 
No prior experience with payments is necessary. 
 
A qualified candidate should be experienced in the following areas: 
 
Senior level software engineer experience, or higher. A qualified applicant has been a software developer on previous software engineering teams, and wants to continue writing code in their new role.
Experience building and maintaining high-availability applications including redundancy, fail over, scalability, monitoring and performance. 
Proficiency in coding in either C# or Java - our platform is built in C#, but we'll train you if you've got a Java background. 
Demonstrated experience in deploying, managing, and operating scalable infrastructure in a public cloud, preferably Azure. 
Expertise in Operating Systems, Networking, and Database concepts. 
Experience with .NET stacks running in the cloud. 
Experience writing RESTful API services deployment and maintenance. 
Service Oriented or microservice architectures. 
Experience with virtualization, monitoring and automation. 
Hands on experience with orchestration and system configuration tools such as Salt, Ansible, Fabric, Puppet, Chef, Terraform, etc. 
Load balancing, storage, and clustering technologies. 
Linux and/or Windows System Administration. 
Systems and Network Engineering. 
Continuous Integration tools (eg. Jenkins, Hudson, TeamCity, Bamboo)
 
Thanks, 
Amit Sehdev
APN Software Services Inc.
Direct: 510-402-1061 | Fax: 510-623-5055 | Amit@apninc.com

Want to apply later?

Type your email address below to receive a reminder

Apply to Job

ErrorRequired field
ErrorRequired field
ErrorRequired field
Error
Error
insert_drive_file
insert_drive_file