×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer

Job in Fort Worth, Tarrant County, Texas, 76102, USA
Listing for: Big Quest Solutions
Full Time position
Listed on 2025-12-24
Job specializations:
  • IT/Tech
    Cloud Computing, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 100000 - 130000 USD Yearly USD 100000.00 130000.00 YEAR
Job Description & How to Apply Below

Client’s Enterprise Data Machine Learning (EDML) employs innovative minds like yourself to design and develop software-systems that can meet the demand of our ever-growing customer base.

Like a startup inside an enterprise, EDML focuses on using a customer-centric approach to building our product to enable data-driven conversations with our customers.

As one of the Site Reliability Engineers, you’ll be able to work closely with customers, product management, and other subject matter experts in the technology industry to drive forward solutions that have immediate impact on the day-to-day ability for other data scientists and machine learning engineers to product ionize their models by iteratively improving how we operate and scale our cloud based containerized service.

What

You ll Do
  • Develop, deploy, and operate our secure infrastructure built on cloud services (AWS, Kubernetes, etc)
  • Ensure the high availability, resiliency, performance, business continuity and compliance capabilities of our cloud services.
  • Define SLA standards for SAAS solutions that are used by several groups within the company.
  • Work with our engineering teams to deploy and operate cloud services, scale our development, QA and production environments.
  • Build solutions for developer productivity. Develop and operate our build automation and continuous delivery systems.
  • Participate in an on-call rotation, drive incident resolution and improve platform resiliency
Basic Qualifications
  • Experience with container management technologies including Docker and Kubernetes.
  • Experience with AWS including EKS, ECS, IAM, S3, RDS, Security Groups, Route
    53, VPC Flow Logs, etc.
  • Experience with automation/configuration management using Terraform or similar solutions.
  • Experience with CI tools such as Jenkins.
  • Experience with operational monitoring tools, such as Datadog, New Relic and Splunk.
  • Proficient in Linux tools and shell scripting or other Linux automation
  • An interest in designing, analyzing and troubleshooting large-scale distributed systems.
  • Well-versed with the entire software development lifecycle, devops, and SRE practices.
Preferred Qualifications
  • Experience with automated unit and integration testing of infrastructure code
  • Experience with container security and vulnerability management
  • Experience in one or more languages such as Python or Go Lang
  • Certified Kubernetes Administrator (CKA)
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary