Role: Platform Monitoring Engineer
Location: San Jose , CA
Interview: Phone/Skype
Emp Type: Permanent Job
5 years, Overall 10+
Prometheus, Grafana, ELK, kibana, Fluentd
logstash/fluentd, Alertmanager
- ELK and monitoring development,
- Logstash
- fluentd grok
- filters development,
- Grafana
- Containerization experience
* Assumed skill: Linux experience
Preferred :
- Prometheus
- Docker / Kubernetes / Other Orchestrators
- Use ElasticSearch for mining logs to identify issue RCA
- Monitor the system 24*7 and invoke Apps Support or Platform Support teams as necessary
- Should be able to use Prometheus to monitor systems
- Responsible to create issues and involve the concerned teams
- Determine the priority of the issues based on the impact to users
- Initiate the troubleshooting bridge and track the issue until completion in case of crisis
- Should give feedback to Support teams on building and enhancing the monitoring and alerting tools
San Jose , CA