Monitoring Automation Engineer- Plano, Pennington
Listed on 2026-01-06
-
IT/Tech
Systems Engineer, SRE/Site Reliability
Position Title: Monitoring Automation Engineer
Contract duration: 12 months with a possibility of extension
Targeted
Start date:
May/June
Desired Core Location(s): Plano, TX or Jersey City, NJ
Remote Option: 3 days/week in the office
PR: $70.24/hour
Reason Position is open: New funding, originally planning to bring on a Spunk SME but instead pivoting in search and looking for someone that can configure the automation of monitoring (with Splunk experience) using code.
RequirementsHands on experience with Terraform for automating infrastructure deployment.
Proficiency in Jenkins and version control tools like Bitbucket.
Experience with Ansible for automated deployments.
Knowledge of Artifactory for package management.
Experience configuring and managing Splunk, Dynatrace, and Open Telemetry (OTel).
Proficiency in Python, and bash shell for automation.
Strong troubleshooting skills for diagnosing monitoring and performance issues.
- Design and implement monitoring pipelines using Splunk, Dynatrace, and Open Telemetry (OTel).
- Automate the deployment of monitoring tools using Terraform, Ansible, and Jenkins.
- Manage configuration and version control with Bitbucket and Artifactory.
- Ensure seamless integration of monitoring solutions into CI/CD pipelines.
- Develop and maintain alerting, logging, and tracing solutions to support observability best practices.
- Optimize monitoring configurations for performance, cost, and scalability.
- Troubleshoot monitoring issues and provide root cause analysis for system incidents.
- Document monitoring architectures, automation scripts, and best practices.
- Stay updated on new monitoring technologies and advocate for improvements.
- Knowledge of Prometheus, Grafana, or ELK (Elasticsearch, Logstash, Kibana).
- Experience with Kafka and Kubernetes.
What can they expect on day 1? Skilled Monitoring Engineer to design, configure, and maintain monitoring pipelines for applications in Global Markets. Collaborate with Dev Ops, SRE, and development teams to ensure our monitoring tools provide actionable insights into system performance and reliability.
Interview Process(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).