Senior IT Platform Engineer; Senior Systems Engineer
Listed on 2026-06-03
-
Engineering
Systems Engineer
Overview
The Senior Systems Engineer designs, implements, and supports enterprise on‑premises compute, virtualization, Active Directory, and backup systems, with responsibility for reliability, performance, and recoverability across plants, data centers, and offices. This role is hands on and focused on engineering excellence, operational stability, and resilient system design in support of business critical workloads.
While the environment is primarily on prem today, this position will help prepare systems and practices for future hybrid and cloud adoption, including increased automation and infrastructure standardization over time.
The full salary range for this position is $111,200 – $166,800
. However, our current budget for a new hire is $111,200 – $150,000
, depending on the candidate s specific experience and skills.
Petaluma, 109 Kentucky Street, Petaluma, CA 94952, USA
Key Responsibilities- SYSTEMS ENGINEERING & ARCHITECTURE
- Engineer and support enterprise compute, virtualization, Active Directory, and backup platforms, primarily on premises
- Design systems for high availability, fault tolerance, scalability, and recoverability
- Define and maintain engineering standards for server builds, clustering, hypervisors, and storage systems
- Lead upgrades, migrations, hardware refreshes, and platform modernization initiatives
- Serve as a senior technical resource and mentor within the infrastructure team
- Integrate infrastructure with network, security, identity, and application platforms
- Support limited hybrid workloads and contribute to future cloud readiness
- Provide clear architecture diagrams, standards, and operational documentation
- Collaborate with application, network, security, and cloud teams on cross functional initiatives
- OPERATIONS & RELIABILITY
- Ensure production systems meet uptime, performance, and recovery objectives
- Act as Tier 3 escalation for complex infrastructure incidents and participate in on call rotations
- Troubleshoot issues across compute, virtualization, storage, and backup layers
- Partner with managed service providers to ensure consistent execution of operational and recovery procedures
- Perform capacity planning and performance optimization
- ARCHITECTURE AND ENGINEERING
- Design and implement resilient, secure, and scalable compute, virtualization, and storage architectures
- Define and maintain standards, reference designs, and best practices for server builds, cluster design, hypervisor configuration, and storage layout
- Lead platform upgrades, hypervisor migrations, storage refreshes, and modernization initiatives
- Ensure integration with adjacent platforms such as network, security, cloud, identity, data, and applications
- Support hybrid environments spanning on-premises infrastructure, cloud compute platforms (Azure AWS), and SaaS workloads
- Design and maintain high-availability clusters and disaster recovery configurations
- AUTOMATION & STANDARDIZATION
- Use scripting and tooling to improve efficiency in provisioning, patching, validation, and lifecycle management
- Reduce manual effort and configuration inconsistency through standardized builds and processes
- Maintain clear documentation for configurations, procedures, and support handoffs
- Contribute to longer term automation and Infrastructure as Code efforts as platform maturity increases
- AUTOMATION & OPERATIONAL EXCELLENCE
- Automate server provisioning, patching, lifecycle management, validation, recovery, and compliance validation
- Reduce manual operational effort through scripting and workflow automation
- Partner with MSPs to ensure consistent execution of backup, recovery, and infrastructure runbooks
- Improve monitoring signal quality across compute, storage, and virtualization layers
- Design self-healing or auto-remediation capabilities where appropriate
- Continuously optimize resource utilization, performance, and capacity planning
- Design, maintain, and operate backup and disaster recovery solutions
- Own recovery procedures and participate in periodic DR testing
- Apply system hardening, secure configuration, and patching standards
- Support enterprise initiatives related to resiliency, ransomware protection, and business continuity
- Bachelor’s degree…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).