Principal Network SRE Engineer - North America Software Center
Listed on 2025-12-14
-
IT/Tech
Systems Engineer, Network Engineer, Cloud Computing, Cybersecurity
Principal Network SRE Engineer - North America Software Center
Join TSMC Washington and help power the future of technology. At TSMC, we don't just make semiconductors; we innovate to transform industries and enhance lives. As the world’s leading semiconductor foundry, we partner with top tech companies to drive advancements in industries such as healthcare, automotive, consumer electronics, and renewable energy. At TSMC Washington, you'll thrive where innovation meets precision manufacturing, and integrity guides our high standards and customer trust.
Our visionary leaders collaborate with clients to achieve groundbreaking results, ensuring our leadership in the semiconductor sector. Explore career opportunities with TSMC Washington and join a company with a commitment to excellence and innovation.
Who We're Looking For:
We are seeking a highly skilled Network SRE Engineer to join our team and manage our large‑scale data center network infrastructure. In this role, you will be responsible for ensuring the reliability, scalability, and performance of our critical network systems. You will leverage your expertise in network engineering and SRE principles to design, deploy, and maintain resilient network systems, automate operational tasks, and troubleshoot complex issues in mission‑critical environments.
Key Responsibilities:
- Architect and implement robust, scalable, and fault‑tolerant data center network solutions, including Spine‑Leaf architectures and SDN technologies.
- Evaluate, design, and implement routing and switching protocols for optimal network performance and reliability. Establish disaster recovery strategies and high availability mechanisms for mission‑critical network environments.
- Develop automation solutions using scripting languages and Dev Ops tools. Automate routine network configuration, deployment, and management tasks to improve operational efficiency and reduce manual toil.
- Use Infrastructure as Code (IaC) practices to build repeatable, version‑controlled, and scalable network infrastructure.
- Apply SRE principles (e.g., SLOs, SLIs, error budgets) to design and operate resilient network systems. Implement self‑healing mechanisms for detecting and resolving failures without manual intervention.
- Diagnose and resolve complex network issues in large‑scale, distributed environments. Develop detailed runbooks and response procedures for minimizing downtime during outages.
- Identify opportunities to improve reliability, scalability, and cost‑efficiency of the network infrastructure.
- Collaborate with cross‑functional teams, including system engineers, Dev Ops teams, application developers, and security teams, to align network operations with business objectives.
- Mentor and guide junior engineers on SRE methodologies and network troubleshooting.
- 8+ years of experience in highly available and scalable network infrastructures, which including deployment, configuring, and maintaining routers, switches, firewalls, load balancers, and other networking equipment.
- 6+ years hands‑on experience with network troubleshooting, monitoring, observability tools, and popular network protocols in LAN and WAN.
- Familiar with SRE principles to improve the reliability, availability, and scalability of the data center network.
- Automate routine network operations, configuration management, and device provision by using tools and mainstream programming languages (e.g., Python, Ansible, Go). Optimize existing network processes to improve efficiency and reduce manual intervention.
- Familiar with CI/CD pipelines and infrastructure as code (IaC). Knowledge of container orchestration systems (e.g., Kubernetes, Docker).
- Plus:
Certifications such as CCNP, CCIE, or equivalent.
Personal Attributes:
- Teamwork skills across diverse and remote teams.
- Proactive and adaptable to shifting priorities for our needs.
- Have a background in high‑performing team environments.
- Agile with new technologies.
- Expert in diagnosing and resolving complex issues.
- Must have legal authorization to work in the U.S. Please note that we are unable to provide sponsorship or take over sponsorship of an employment visa at this time.
- Employment contingent on background…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).