More jobs:
Systems Engineer
Job in
Irving, Dallas County, Texas, 75084, USA
Listed on 2026-02-16
Listing for:
TRISTAR Insurance Group
Full Time
position Listed on 2026-02-16
Job specializations:
-
IT/Tech
Systems Engineer, Cloud Computing
Job Description & How to Apply Below
Education Level Bachelor's Degree## Systems Engineer
** In office Position
*
* POSITION SUMMARY:
The Senior Systems Engineer is a hands-on senior individual contributor responsible for designing, building, and operating TRISTAR’s core infrastructure platform with a strong emphasis on Linux systems, Kubernetes, and automation. This role will own the Kubernetes platform end-to-end—cluster build, lifecycle management, operational standards, reliability, and day-2 operations—while partnering closely with development teams as TRISTAR transitions toward a Dev Ops operating model.
Success in this role requires deep technical ownership, strong troubleshooting skills across distributed systems, and the ability to improve reliability through thoughtful design, observability, and repeatable automation.
ESSENTIAL DUTIES AND RESPONSIBILITIES:
Kubernetes Platform Engineering & Lifecycle:
• Design, build, and operate Kubernetes clusters in production, including upgrades,patching, scaling, and reliability improvements.
• Establish platform standards and operating practices as the environment matures(cluster configuration, access patterns, resource governance, and runbooks).
• Serve as the senior escalation point for Kubernetes platform issues and drive resolution through root-cause analysis and prevention.
Kubernetes Storage, Backup/Restore & Disaster Recovery:
• Design and implement Kubernetes storage patterns (Storage Classes, PV/PVC lifecycle,capacity planning) and support stateful workloads.
• Implement, test, and maintain Kubernetes-native backup/restore and recovery procedures.
• Integrate Kubernetes persistence needs with enterprise storage platforms, including Dell Object Scale and existing virtualization/storage systems.
Ingress, Load Balancing & Kubernetes Networking:
• Own Kubernetes traffic entry, including ingress controllers, load balancers, routing patterns, and TLS/certificate handling.
• Define repeatable patterns for exposing services and troubleshooting connectivity across platform components.
Linux Systems Engineering:
• Administer and harden Linux systems that support the platform, including patching,performance tuning, service reliability, logging, and baseline configuration.
• Troubleshoot system and platform issues across compute, storage, and network dependencies.
Automation, Scripting & API Integrations:
• Build automation to reduce manual work and increase consistency across infrastructure operations using Python/Power Shell/Bash and API-driven workflows.
• Evaluate, recommend, and help implement an automation / configuration management approach (tooling, patterns, and standards) to support repeatable tasks such as provisioning, configuration enforcement, patching, drift detection, and validation.
• Develop reusable automation assets (modules/playbooks/templates/scripts) andestablish version-controlled workflows (Git), documentation, and operational handoffpractices.
• Leverage RESTful APIs to integrate systems and create operational workflows (health checks, reporting, event-driven automations, and change validation).Monitoring, Alert Response & Operational Reporting:
• Monitor alert sources and observability tooling (including Solar Winds on-prem),investigate events, and drive issues to completion.
• Document incidents, actions taken, and final resolutions contribute to improved alerting quality and operational visibility.
Data Center Support (Occasional):
• Provide occasional on-site support as needed in the data center for infrastructure prepand troubleshooting (racking equipment, cabling, and physical connectivity verification).
• Maintain working familiarity with server hardware and data center best practices tosupport rare hands-on needs.
Cloud Readiness & Future-State Hosting:
• Partner with development and infrastructure teams to plan and progress TRISTAR’slong-term transition toward cloud-hosted deployments of the application stack
• Contribute to cloud design discussions with a practical understanding of core cloud concepts (networking, identity/access, security, reliability, scalability, and cost considerations) across major providers (AWS/Azure/GCP).
• Translate…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×