Do you have a passion for innovation? Are you excited to leverage cutting edge technology to solve big business problems? If your response to those questions is “yes”, we would love for you to join us! At OrangePeople we consult for some of the most prestigious brands in the world. But more importantly, our consultants have a voice in the vision and future of the company. At OrangePeople, our focus is people. It’s right there in our name.
The Lead Systems Engineer works as part of a team responsible for end-to-end technical support of complex applications. The ideal candidate would be a world-class technologist with a strong desire to dive deep into complex technical challenges and resolve them. This role will split their time between 24x7 operational support and proactive availability threat hunting. Keeping the business up and running is the singular objective of this role. It requires someone who lives for the hunt of finding what went wrong, and fixing it fast. Must be willing and able to own the issue and drive it, with maniacal focus, towards resolution.
Responsibilities:
- Keen understanding and technical intuition for all potential Support and Operational problems that could happen in a matured enterprise environment
- Extensive experience in as many major Java frameworks as possible (Spring, Strut, Hibernate, JSF, Grails etc) in a full stack capacity
- Candidate must have strong technical fluency; comfortable understanding and discussing architectural concepts with management, architects, developers and systems & applications engineering teams.
- Deep expertise in Public Cloud technologies (AWS, Azure – IaaS, PaaS) and internal hosting infrastructure (server, storage, networking, etc.).
- Extensive experience with containers and orchestration: Docker, Kubernetes, AWS ECS, Azure Service Fabric, Lambda.
- Understanding of internet standards such as HTTP, DNS, FTP, SSH, HTML, XML, JDBC, ODBC, SNMP and other protocols.
- Experience with eventing queue management technologies such as Rabbit MQ and Websphere MQ.
- Expert level knowledge of Splunk with ability to mine large datasets for critical triage data.
- UNIX/LINUX server experience, including expertise in system installation, configuration, administration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures (Windows a plus, but not required)
- Web (Apache), .Net & Java application (Tomcat, Jboss, etc) server expertise including installation, administration, configuration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures
- Knowledge of networking and load balancers concepts.
- Strong ability and desire to troubleshoot and solve complex technology issues in the shortest amount of time possible while demonstrating complete ownership of restoration efforts.
- Able to follow be a sleuth on the trail of technical clues that ultimately leads to the root cause of an incident.
- Able to assess new application development to identify issues that may result in operational challenges after launch.
- Ability to apply strong knowledge of core technologies to various applications built on those technologies.
- Able to liaison with various development teams to drive stability and availability improvements as a result of deep technical analysis of their systems.
- Will be part of a team with 2 groups.
- While 1 half of the team is focused on the operational support, including on call support, the other half of the team will be focused on threat hunting.
- This will provide the team with a balance between operational support for major incidents and deep technical work that is not defined by incident restoration.
Required:
- Extensive Experience in as many major Java frameworks as possible (Spring, Strut, Hibernate, JSF, Grails etc) in a full stack capacity
- Candidate must have strong technical fluency; comfortable understanding and discussing architectural concepts with management, architects, developers and systems & applications engineering teams.
- Deep expertise in Public Cloud technologies (AWS, Azure – IaaS, PaaS) and internal hosting infrastructure (server, storage, networking, etc.).
- Experience in MySQL Database administration is a plus.
- Extensive experience with containers and orchestration: Docker, Kubernetes, AWS ECS, Azure Service Fabric, Lambda.
- Understanding of internet standards such as HTTP, DNS, FTP, SSH, HTML, XML, JDBC, ODBC, SNMP and other protocols.
- Experience with eventing queue management technologies such as Rabbit MQ and Websphere MQ.
- Expert level knowledge of Splunk with ability to mine large datasets for critical triage data.
- UNIX/LINUX server experience, including expertise in system installation, configuration, administration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures (Windows a plus, but not required)
Preferred Qualifications:
- Experience with AppDynamics, Splunk, SiteScope, Rundeck, or Jenkins are a plus
Additional Responsibilities:
- Participate in OrangePeople monthly team meetings, and participate in team building efforts.
- Contribute to OrangePeople technical discussions, peer reviews, etc.
- Contribute content and collaborate via the OP-Wiki/Knowledge Base.
- Provide status reports to OP Account Management as requested
-
About us:
Orange People is an Enterprise Architecture and Project Management solutions company. Our most valuable asset is our people: dynamic, creative thinkers who are passionate about doing quality work. As a member of the Orange People team you will have access to industry-leading consulting practices, strategies & technologies, innovative training & education. An ideal Orange Person is a technology leader with a proven track record of technical achievements and strong process/methodology orientation.