×
Register Here to Apply for Jobs or Post Jobs. X

Sr. Software Engineer, Cloud SRE and Automation

Job in Tower, St. Louis County, Minnesota, 55790, USA
Listing for: F5 Networks
Full Time position
Listed on 2026-02-07
Job specializations:
  • IT/Tech
    Cloud Computing, SRE/Site Reliability
Job Description & How to Apply Below
Location: Tower

At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation.

Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive.

Position Summary

We re seeking a Senior Software Engineer specializing in Cloud SRE and Automation with strong expertise in building reliable, scalable cloud infrastructure, implementing runbook automation, and driving operational excellence through intelligent automation and observability.

What You ll Do

Build and Scale Cloud Infrastructure with Reliability

  • Design, develop, and implement cloud-native automation and remediation services across AWS, Azure, and GCP platforms
  • Build and maintain highly available, scalable infrastructure using Infrastructure as Code (Terraform, Cloud Formation, ARM Templates)
  • Develop and optimize cloud architectures for reliability, performance, and cost efficiency
  • Implement and manage Kubernetes-based containerized workloads across multi-cloud environments
  • Design and build self-healing systems with automated remediation and closed-loop automation
  • Create and maintain observability pipelines, monitoring solutions, and alerting systems across cloud platforms
  • Develop cloud-native CI/CD pipelines for rapid, reliable application and infrastructure deployments

Drive SRE Excellence and Automation

  • Apply Site Reliability Engineering principles including SLIs, SLOs, SLAs, and error budgets to cloud services
  • Design and implement runbook automation frameworks for incident response and operational tasks
  • Build automation tools and scripts to reduce toil and improve operational efficiency
  • Develop integration layers with ITSM platforms, incident management systems, and monitoring tools (Service Now, Pager Duty, Jira)
  • Implement chaos engineering and resilience testing to validate system reliability
  • Perform capacity planning, performance tuning, and cost optimization for cloud resources
  • Participate in incident response, on-call rotations, and conduct blameless postmortems

Monitor, Observe, and Optimize

  • Implement comprehensive observability solutions using Prometheus, Grafana, Open Telemetry, Cloud Watch, and other tools
  • Build automated alerting and intelligent runbook triggering based on system metrics and logs
  • Develop dashboards and metrics to track system health, performance, and reliability
  • Analyze system behavior and implement predictive analytics for proactive issue detection
  • Optimize application and infrastructure performance across distributed cloud environments

Collaborate and Mentor

  • Work closely with SREs, QA, development teams, and platform engineers to improve reliability and performance
  • Mentor junior engineers on SRE best practices, cloud architecture, and automation development
  • Participate in code reviews, technical design discussions, and architecture planning
  • Contribute to the evolution of SRE culture and practices within the organization
  • Document automation workflows, runbooks, cloud architectures, and operational procedures

Qualifications

Must-Have

  • Software Engineering & SRE Experience – 6-8 years of software development experience with 3+ years in SRE, Dev Ops, cloud engineering, or platform engineering roles
  • Programming Skills – Strong programming proficiency in Python
  • Cloud Platform ExpertiseHands-on experience with multi-cloud environments
  • AWS: EC2, ECS/EKS, Lambda, Cloud Watch, Cloud Formation, Step Functions, Systems Manager, Auto Scaling, VPC, IAM, S3, RDS/DynamoDB
  • Azure:
    Virtual Machines, AKS, Azure Functions, Azure Monitor, ARM Templates, Logic Apps, Azure Automation, Virtual Networks, Azure AD
  • GCP:
    Compute Engine, GKE, Cloud Functions, Cloud Monitoring, Deployment Manager, Cloud Workflows, VPC, IAM
  • Experience with at least two major cloud providers
  • Kubernetes & Containers – Strong experience with:
  • Kubernetes architecture, deployments, services, and…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary