Sr Software Engineer, Network Services
Listed on 2026-02-16
-
IT/Tech
Systems Engineer, Network Engineer
Job Summary
The Network Reliability Engineering (NRE) team is a newly established group within our global network team, focused on a software‑first approach to network engineering and operations. Our mission is to eliminate manual effort, improve reliability, and enable agility through automation. In this role, you’ll design and build automation systems and tools that reduce manual network operations, streamline workflows, and improve the reliability of our global infrastructure.
LocationWork from Home – Pennsylvania
DivisionSoftware Engineering
Supervisory OrganizationInternational Ticketing
Line ManagerSr Manager – Network Services
Contract TermsPermanent / Full Time
The TeamThe NRE team partners closely with the core network team to identify operational challenges and build scalable automation solutions. We also collaborate with other software engineering teams within the infrastructure group to deliver end‑to‑end tooling and services. The team plays a critical role in evolving how networks are managed—leveraging automation frameworks, observability tools, and SRE best practices to support a resilient and secure global infrastructure.
TheJob
Your work will span a broad range of automation efforts—including provisioning network hardware, implementing new configurations, enforcing access policies across data centers and AWS environments, and developing observability and monitoring systems. You’ll collaborate closely with the Core Networking team and infrastructure software engineering teams to deliver scalable, resilient automation aligned with architectural and security standards.
What You Will Be Doing- Designing and building internal tools and automation systems to streamline network workloads.
- Automating the provisioning and configuration of network devices and services.
- Implementing systems to enforce network access policies across global data centers and AWS environments.
- Supporting systems the team has developed.
- Developing observability and monitoring tools to track network health, performance, and reliability.
- Collaborating closely with the core networking team to translate architecture and design into scalable automation.
- Integrating with internal APIs and systems built by other infrastructure engineering teams.
- Writing Infrastructure‑as‑Code (e.g., Terraform) to manage cloud and on‑prem network infrastructure.
- Applying SRE best practices to improve availability, scalability, and operational efficiency.
- Maintaining high standards for code quality, testing, and documentation in automation projects.
- Participating in incident response and troubleshooting efforts, using data‑driven analysis to drive improvements.
- Identifying opportunities to reduce operational toil and proactively proposing automation solutions.
- Contributing to a culture of continuous improvement, knowledge sharing, and strong team collaboration. Complete assigned project related work from Jira tickets following SAFe methodology.
- Support PCI / security compliance requirements (upgrades, defect management, etc).
- Participate in on‑call and potentially some after‑hours support as required.
- 5+ years in roles such as Software Engineering, Network Automation, Network Reliability, or SRE/Dev Ops with a strong track record of delivering automation for mission‑critical infrastructure.
- Proficient in Python (required), with experience in Go or Rust preferred; comfortable building tools, scripts, and services that interface with network devices and APIs.
- Hands‑on experience with CI/CD tools (e.g., Git Lab CI) for automating build, test, and deployment pipelines for infrastructure changes.
- Strong with Ansible (playbooks, roles, modules).
- Experience with open‑source network tooling such as Nautobot, Net Box, NAPALM, Netmiko.
- Experience using Docker and Kubernetes to build and manage network tools and services in a cloud‑native environment.
- Deep experience with Terraform (required) and/or Pulumi for provisioning infrastructure declaratively and integrating it into workflows.
- Understanding of networking concepts and hardware.
- Experience with tools like Solarwinds, Prometheus, Grafana for tracking metrics, logs, and building dashboards/alerts.…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).