Senior IT Platform Engineer; Senior Systems Engineer Job Petaluma area,California USA,Engineering

Position: Senior IT Platform Engineer (Senior Systems Engineer)

Overview

The Senior Systems Engineer designs, implements, and supports enterprise on‑premises compute, virtualization, Active Directory, and backup systems, with responsibility for reliability, performance, and recoverability across plants, data centers, and offices. This role is hands on and focused on engineering excellence, operational stability, and resilient system design in support of business critical workloads.

While the environment is primarily on prem today, this position will help prepare systems and practices for future hybrid and cloud adoption, including increased automation and infrastructure standardization over time.

The full salary range for this position is $111,200 – $166,800
. However, our current budget for a new hire is $111,200 – $150,000
, depending on the candidate s specific experience and skills.

Locations

Petaluma, 109 Kentucky Street, Petaluma, CA 94952, USA

Key Responsibilities

SYSTEMS ENGINEERING & ARCHITECTURE
Engineer and support enterprise compute, virtualization, Active Directory, and backup platforms, primarily on premises
Design systems for high availability, fault tolerance, scalability, and recoverability
Define and maintain engineering standards for server builds, clustering, hypervisors, and storage systems
Lead upgrades, migrations, hardware refreshes, and platform modernization initiatives
Serve as a senior technical resource and mentor within the infrastructure team
Integrate infrastructure with network, security, identity, and application platforms
Support limited hybrid workloads and contribute to future cloud readiness
Provide clear architecture diagrams, standards, and operational documentation
Collaborate with application, network, security, and cloud teams on cross functional initiatives

OPERATIONS & RELIABILITY
Ensure production systems meet uptime, performance, and recovery objectives
Act as Tier 3 escalation for complex infrastructure incidents and participate in on call rotations
Troubleshoot issues across compute, virtualization, storage, and backup layers
Partner with managed service providers to ensure consistent execution of operational and recovery procedures
Perform capacity planning and performance optimization

ARCHITECTURE AND ENGINEERING
Design and implement resilient, secure, and scalable compute, virtualization, and storage architectures
Define and maintain standards, reference designs, and best practices for server builds, cluster design, hypervisor configuration, and storage layout
Lead platform upgrades, hypervisor migrations, storage refreshes, and modernization initiatives
Ensure integration with adjacent platforms such as network, security, cloud, identity, data, and applications
Support hybrid environments spanning on-premises infrastructure, cloud compute platforms (Azure AWS), and SaaS workloads
Design and maintain high-availability clusters and disaster recovery configurations

AUTOMATION & STANDARDIZATION
Use scripting and tooling to improve efficiency in provisioning, patching, validation, and lifecycle management
Reduce manual effort and configuration inconsistency through standardized builds and processes
Maintain clear documentation for configurations, procedures, and support handoffs
Contribute to longer term automation and Infrastructure as Code efforts as platform maturity increases

AUTOMATION & OPERATIONAL EXCELLENCE
Automate server provisioning, patching, lifecycle management, validation, recovery, and compliance validation
Reduce manual operational effort through scripting and workflow automation
Partner with MSPs to ensure consistent execution of backup, recovery, and infrastructure runbooks
Improve monitoring signal quality across compute, storage, and virtualization layers
Design self-healing or auto-remediation capabilities where appropriate
Continuously optimize resource utilization, performance, and capacity planning
Design, maintain, and operate backup and disaster recovery solutions
Own recovery procedures and participate in periodic DR testing
Apply system hardening, secure configuration, and patching standards
Support enterprise initiatives related to resiliency, ransomware protection, and business continuity

Qualifications

Bachelor’s degree…