At PayIt, we have a big vision to simplify the way that everyday citizens interact with government agencies. We are rapidly expanding and looking for engineers who can help us build a
world-class Gov-Tech platform.
The Site Reliability Engineering (SRE) Team is our critical link between applications and the infrastructure platform, building infrastructure as code and partnering with our application development teams to make their services more observable, scalable and reliable.
Job Responsibilities:
• You will own efforts from sprint planning to delivery and be expected to take on new problems to continually push our technology forward.
• Design, develop and implement solutions that improve the stability, scalability, availability, reliability, observability, and latency of PayIt products services.
In addition:
• Design, build and operate (cloud) infrastructure to enable reliable and rapid deployment of microservices with effective monitoring and resilient operations.
• Set up critical infrastructure, develop tools and framework to automate operational tasks, deployment of machines, services, and applications.
• Collaboratively represent SRE in design reviews while working cross functionally with Engineering teams on operational readiness and compliance qualifications.
• Identify and drive opportunities to improve automation for code deployment, management, and visibility of application services.
• Collaborative partner in end-to-end monitoring and alerting on all critical components of PayIt products while sharing the on-call rotation and be an escalation contact for incidents.
Job Requirements:
· Experience with the development and operation of high-traffic backend systems
· Ability to diagnose and troubleshoot complex distributed systems handling high volume transactions with troubleshooting skills that span application, OS, networking (TCP/IP), and system layers.
· Strong experience relational and non-relational DB’s.
· Familiarity building a CI/CD pipeline (Git, Jenkins, Kubernetes, Docker, etc.) leveraging configuration management tools such as AWS CloudFormation, Ansible, Chef, Puppet, Terraform, etc.
· Proficiency with a programming language like Python, Go and shell scripting to automate tasks.
· Excellent problem solving, critical thinking, communication, and teamwork skills
· Excellent written and verbal communication skills, and the ability to work on collaboratively.
· Self-disciplined, self-managed, self-motivated and strong sense of ownership, urgency, and drive
Bonus Points for:
• Experience in a site reliability or infrastructure engineering function for IaaS, PaaS, or SaaS company fostering a DevOps culture/mindset.
• Expertise in AWS cloud computing and its related services
• Passion for automation and monitoring instrumentation in the code.
What We Care About:
We’re about openness, integrity, accessibility and great communication - so these should be qualities that you have too.
The usability of our products, open exchange of ideas with our teams and the commitment to the cities, counties, and states we serve are values we won’t budge on, and so we’ll want you to exemplify these too.
Note: U.S. Citizens and all those authorized to work for any employer in the U.S. are encouraged to apply. We are unable to provide sponsorship at this time.
To all recruitment agencies: PayIt does not accept agency resumes. Please do not forward resumes to our career’s alias or PayIt employees. PayIt is not responsible for any fees related to unsolicited resumes.