More jobs:
IT - Network Engineer
Job in
Fort Mill, York County, South Carolina, 29715, USA
Listed on 2026-07-01
Listing for:
Apex Systems
Full Time
position Listed on 2026-07-01
Job specializations:
-
IT/Tech
IT Support
Job Description & How to Apply Below
Location:
Fort Mill, South Carolina
Hybrid
Schedule:
3 days onsite/2 days remote
12 mo. Contract-to-Hire position
Pay Range: $52-58/hr.
Role Overview
We are seeking a motivated Recovery Engineer / Analyst to join a Production Services team for a large financial client. This role is for a hands-on technical professional who performs well during high-severity production incidents, enjoys problem-solving, and is interested in long-term growth within the organization. You will work closely with senior recovery managers and various technical teams across the enterprise.
Key Responsibilities
Participate in major and critical incident bridges, assisting with triage, diagnostics, and recovery activities.
Gather and analyze logs, metrics, and alerts to support rapid issue identification.
Assist in identifying the impacted service, symptoms, and contributing factors during incidents.
Perform initial analysis using available diagnostics, observability tools, and documentation.
Support post-incident reviews and root cause analysis (RCA) efforts by collecting data and timelines.
Create, update, and validate runbooks, standard operating procedures (SOPs), and recovery playbooks based on incident learnings.
Analyze incident trends to identify areas for operational improvement.
Learn and apply SRE and reliability principles under the guidance of senior teammates.
Required Qualifications
Experience:
Experience in production support, operations, NOC, SRE, Dev Ops, or application support roles is required.
Technical
Skills:
Candidates must have a working knowledge of application, infrastructure, or cloud environments, as well as logs, monitoring, alerts, and basic diagnostics.
The ability to read and follow technical runbooks and SOPs is essential.
Familiarity with observability tools such as APM, logging platforms, and metrics dashboards.
Scripting or development exposure, including Power Shell, Python, or Bash.
Core Competencies:
Strong problem-solving and analytical skills.
Ability to remain calm and organized during high-severity incidents.
Clear verbal and written communication skills.
A willingness to learn from senior engineers and a strong sense of ownership.
Ability to work effectively across teams.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×