×
Register Here to Apply for Jobs or Post Jobs. X

Senior Site Reliability Engineer

Job in Peoria, Peoria County, Illinois, 61639, USA
Listing for: Caterpillar Financial Services Corporation
Full Time position
Listed on 2026-02-12
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer, SRE/Site Reliability, IT Support
Salary/Wage Range or Industry Benchmark: 128470 - 208770 USD Yearly USD 128470.00 208770.00 YEAR
Job Description & How to Apply Below

Career Area:

Technology, Digital and Data

Job Description:

Your Work Shapes the World at Caterpillar Inc.

When you join Caterpillar, you're joining a global team who cares not just about the work we do – but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here – we make it happen, with our customers, where we work and live.

Together, we are building a better world, so we can all enjoy living in it.

Job Summary:

As a Site Reliability Engineer, you will be responsible for ensuring the reliability, availability, and performance of our D365 ERP systems, connectivity, and infrastructure. You will collaborate with cross-functional teams to develop and implement strategies to improve system stability, automate repetitive tasks, and enhance service delivery and performance.

If you have a passion for delivering reliable, high-performance services and thrive in a fast-paced environment, we'd love to hear from you. Apply now to join our team as a Site Reliability Engineer!

What You Will Do:
  • Monitor and troubleshoot production and QA systems to identify and resolve performance, scalability, and reliability issues proactively.
  • Participate in the on-call rotation to provide 24/7 critical incident support for eCommerce platform systems
  • Design, implement, and maintain automated processes and tools to streamline deployment and release processes.
  • Collaborate with cross-functional teams to define, document, and implement operational processes, best practices, and procedures.
  • Implement and maintain system monitoring tools and dashboards to provide real‑time insights into system performance and identify potential issues.
  • Work closely with developers to identify and fix bugs and performance bottlenecks in the application code.
  • Ensure that systems and infrastructure comply with security, compliance, and regulatory requirements.
  • Continuously evaluate systems and processes to identify areas for improvement and implement changes as needed.
What

You Will Have:
  • Effective Communications: Strong understanding of communication concepts, tools and techniques; ability to effectively transmit, receive, and accurately interpret ideas, information, and needs through the application of appropriate communication behaviors.
  • Technical Troubleshooting: Extensive knowledge of technical troubleshooting approaches, tools and techniques; ability to anticipate, recognize, and resolve technical issues on hardware, software, application or operation.
  • Performance Measurement and Tuning: Knowledge of system performance, testing and programming; ability to monitor, measure, and optimize system performance and network communication.
  • Software Release Management: Knowledge of strategies, practices and tools for managing versions and distribution of software products and enhancements; ability to evaluate and improve release management practices and tools.
  • Software Reliability Management: Knowledge of software reliability management; ability to develop and use principles, methodologies and metrics that increase software product performance and reliability.
Considerations for top Candidates:
  • Bachelor's degree in Computer Science, Information Technology, a related field, or equivalent experience.
  • 6+ years of experience in site reliability engineering, Dev Ops, QA, or a related field.
  • Strong experience with Microsoft D365 or general Azure based services
  • Experience with AWS infrastructure and services
  • Experience with IaC solutions like Cloud formation and Terraform
  • Experience with CI/CD solutions - Github, Azure Dev Ops
  • Strong troubleshooting and critical thinking skills
  • 6+ years of experience and proficiency in one or more programming languages, such as Python (preferred), Java script (preferred).
  • Solid understanding of networking, load balancing, on prem hosting solutions, and web application architectures.
  • Experience with containerization technologies, such as Docker and Kubernetes.
  • Excellent problem‑solving skills and a strong attention to detail.
  • Strong IT and Business communication skills and ability to collaborate effectively with…
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary