Senior Devops Engineer - MN
Listed on 2026-04-18
-
IT/Tech
Cloud Computing, Systems Engineer, Systems Administrator
LOCATION: Ouroffice is located in Minnetonka, MN. Candidates must reside in the Twin Cities area and will be required to be in-office 5 days per week. We do offer a flexible work environment.
OverviewWe are seeking an experienced Senior Dev Ops Engineer to join our engineering team at RAZR Marketing. In this role, you will be responsible for maintaining and optimizing AWS infrastructure, ensuring system reliability, and implementing disaster recovery strategies. Working with our enterprise‑scale Fibonacci NX monorepo containing 50+ applications and libraries, you will focus on operational excellence, system maintenance, and AWS best practices to support our customer‑facing loyalty platform and banking integrations.
We are looking for someone passionate about maintaining reliable and resilient systems in AWS, enjoys implementing operational best practices, and thrives in a collaborative environment supporting enterprise‑scale applications.
- Design, implement, and maintain infrastructure using Pulumi.
- Automate infrastructure provisioning and configuration management.
- Manage environment configurations across dev, staging, and production.
- Implement version control and code review processes for infrastructure changes.
- Develop reusable infrastructure components and modules.
- Manage and optimize AWS services including ECS, Lambda, RDS, S3, Cloud Front, and VPC configurations.
- Implement AWS best practices for security, performance, and cost optimization.
- Monitor and maintain system health across multiple environments (dev, staging, production).
- Conduct regular infrastructure audits and implement improvements.
- Design, implement, and maintain disaster recovery plans for critical systems and databases.
- Develop and execute backup strategies for RDS databases, S3 data, and application configurations.
- Conduct regular DR drills and validate recovery procedures.
- Document and maintain runbooks for disaster recovery scenarios.
- Implement multi‑region failover strategies for high‑availability services.
- Perform routine system maintenance including patching, updates, and security hardening.
- Manage database maintenance tasks including backups, performance tuning, and capacity planning.
- Monitor system performance and proactively address potential issues.
- Coordinate maintenance windows and communicate with stakeholders.
- Maintain and rotate secrets, certificates, and access credentials.
- Support application deployments for Angular containers, NestJS servers, and Java Spring Boot services.
- Manage Docker container deployments to AWS ECS and serverless environments.
- Coordinate release of deployments and provide rollback support when needed.
- Maintain deployment documentation and standard operating procedures.
- Implement and maintain AWS security best practices including IAM policies, security groups, and encryption.
- Conduct regular security assessments and vulnerability remediation.
- Ensure compliance with industry standards for banking and financial services integrations.
- Manage access controls and audit logging across AWS environments.
- Maintain comprehensive monitoring solutions using Cloud Watch, application logs, and custom metrics.
- Configure and tune alerting thresholds for infrastructure and application health.
- Participate in on‑call rotation for production incidents.
- Conduct post‑incident reviews and implement preventative measures.
- Monitor and optimize AWS costs through resource rightsizing and reserved capacity planning.
- Identify and eliminate unused or under‑utilized resources.
- Implement cost allocation tags and provide cost reporting to stakeholders.
- Create and maintain operational documentation, runbooks, and architecture diagrams.
- Document system configurations, procedures, and troubleshooting guides.
- Share knowledge with development and operations teams through training and mentorship.
- Work closely with SRE team on infrastructure reliability and incident response.
- Partner with development teams to understand operational requirements.
- Collaborate with security team on compliance and vulnerability management.
- Participate in Scrum ceremonies and contribute to operational planning.
- Bachelor's degree in…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).