Cloud Platform Engineer
Listed on 2026-02-16
-
IT/Tech
Systems Engineer, Cloud Computing, Cybersecurity, Data Security
Overview
Veteran-Owned Firm Seeking a Cloud Platform Engineer for an Onsite Assignment in Brooklyn, NY
My name is Stephen Hrutka. I lead a Veteran-Owned management consulting firm in Washington, DC. We specialize in Technical and Cleared Recruiting for the Department of Defense (DoD), the Intelligence Community (IC), and other advanced defense agencies.
At HRUCKUS, we support fellow Veteran-Owned businesses by helping them recruit for positions across organizations such as the VA, SBA, HHS, DARPA, and other leading-edge R&D-focused defense agencies.
We seek to fill a Cloud Platform Engineer position for the DSS - Department of Social Services in the City of New York.
The ideal candidate is a NY resident with at least 7 years of hands-on AWS experience utilizing core services such as EC2, RDS, S3, IAM, and Lambda. They possess 7+ years of expertise in Linux/Unix administration and automation scripting (Bash, Python) and a strong 5-year foundation in CI/CD pipelines and IaC for deploying AI agents and supporting MLOps using Kubernetes, ECS, or EKS.
If you’re interested, I ll gladly provide more details about the role and discuss your qualifications further.
Thanks,
Executive SummaryHRUCKUS is looking for an experienced Cloud Platform Engineer to assist the Information Technology Services (ITS) department of the NYC Department of Social Services (DSS) in managing the day-to-day activities of the organization.
Position DescriptionThe Cloud Platform Engineer will support the SNAP Payment Error Rate (CAP) Reduction Project. The SNAP Payment Error Rate (CAP) Reduction Project is a high-priority, agency-wide strategic initiative at the NYC Department of Social Services designed to mitigate federal oversight findings and prevent significant financial penalties. In partnership with McKinsey, the agency is deploying cutting-edge technologies—including Artificial Intelligence, Robotic Process Automation (RPA), and advanced analytics—to modernize error detection and strengthen eligibility verification, case processing accuracy, and quality control reviews.
By introducing proactive prevention capabilities and enhancing operational decision-making through data-driven insights, this project seeks to reduce payment inaccuracies, accelerate case resolution, and ultimately increase public trust in the integrity of the SNAP program.
Job Duties
- Monitor database and system performance using Cloud Watch metrics, alarms, and logs; troubleshoot proactively.
- Develop, deploy, and optimize AI/ML solutions using AWS AI services including Sage Maker and Bedrock, supporting model training, inference, and integration into production systems.
- Automate operational tasks using AWS Lambda, Systems Manager (SSM), and Infrastructure-as-Code tools such as Cloud Formation or Terraform.
- Design, build, and maintain scalable, fault-tolerant data processing and analytics workflows on AWS using services such as API Gateway, S3, EC2, RDS, Lambda, Glue, Athena, Dynamo
DB, EMR, Kinesis, Data Sync. - Design and integrate agentic AI systems, including LLM-based agents, multi-agent workflows, and autonomous orchestration pipelines using frameworks such as Lang Chain and Lang Graph.
- Implement ETL/ELT pipelines and data architectures that support machine learning, analytics, and intelligent agent-based applications.
- Support CI/CD pipelines for AI models and data workflows using Jenkins and container-based platforms such as ECS, EKS, or Kubernetes.
- Apply security best practices across AI and data platforms, including IAM least-privilege access, encryption, audit logging, and compliance controls.
- Maintain technical documentation for AI architectures, data pipelines, infrastructure configurations, and operational runbooks.
- Minimum 7 years of hands-on AWS experience: EC2, RDS, S3, Cloud Watch, Cloud Trail, IAM, KMS, AWS Backup, and Lambda.
- Minimum 7 years of experience in Linux/Unix administration and automation scripting (Bash, Shell, Python).
- Minimum 7 years of experience with Infrastructure as Code (IaC) and automation tools, including Cloud Formation, Terraform, and Ansible, for provisioning and maintaining.
- Minimum 7 years of…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).