Detection Analytics Software Engineer
Listed on 2026-02-18
-
IT/Tech
Systems Engineer, Data Engineer
Description
5 Month Project with chance of extension, but the project shouldn't take more than 8 months. Remote candidates will be accepted, but candidates that can come onsite 3 days/week are preferred.
This program is made up of 2 projects, with one team on each. Each project team will have 2 developers + 1 PM + 1 Data Scientist (and possibly an ME/EE SME). Two separate teams will execute in parallel (retrofit HRU vs new-build CDU) with minimal crossover due to scope, timeline, and deployment constraints. This role will pair with a Platform Integration Role.
TeamOverview
The Signals Quality team in Microsoft's CO+I IDEA group deploys high‑reliability detections across data‑center telemetry. The team blends analytics (Python/KQL), cloud‑native engineering (Azure Functions/Logic Apps), and CI/CD hardening via Azure Dev Ops. Close collaboration with Solutions Engineering enables rapid low/no‑code deployments that later graduate into production‑grade components.
Project Overview- Workstream 1:
Retrofit sites using HRU (Heat Rejection Units) - Workstream 2:
New‑build sites using CDU (Cooling Distribution Units) - Deployments target a Phoenix‑area site and a Milwaukee‑area site; no travel is expected.
You will design, implement, validate, and tune time‑series detections over large telemetry sets. You'll author advanced KQL (ADX) for data shaping and replay, build Python‑based analytics (often in notebooks) to iterate quickly, and package logic into Azure Functions/Logic Apps with CI/CD in Azure Dev Ops so detections can be rolled out reliably at scale.
Top Skills- Advanced KQL / Azure Data Explorer (ADX)
- Author time‑series queries (e.g., make‑series, make‑list, joins, windows) to extract, aggregate, and diagnose telemetry at scale.
- Build replay datasets and run historical backtests to validate threshold/anomaly logic and severity classification before production.
- Identify data gaps and propose telemetry enhancements, map operational scenarios to available signals.
- Python for Analytics (incl. notebooks) – 7+ YOE
- Wrangle/transpose large datasets; implement feature engineering and transformations needed by detection logic.
- Prototype and harden anomaly/threshold detection logic; collaborate with DS to quantify precision/recall tradeoffs and alert fatigue reduction.
- Package analytics into Azure Functions or glue code around Logic Apps when appropriate.
- Azure Functions / Logic Apps + Azure Dev Ops CI/CD – 3+ YOE
- Convert proven analytics into deployable components; manage pipelines, artifacts, secrets, and approvals in ADO.
- Integrate detection outputs into downstream systems and operator‑facing surfaces; contribute to runbooks and validation plans.
- Partner with Solutions Engineering to deliver rapid low/no‑code wins, then graduate those into hardened CI/CD‑backed services.
- Analyze large telemetry sets; shape/replay datasets; validate detection effectiveness and tune severity to reduce alert fatigue.
- Translate prototype logic into production‑ready analytics (Python/KQL); package into Functions/Logic Apps and wire up CI/CD.
- Work with Solutions Engineering on low/no‑code accelerators; coordinate with Engineering to harden and scale solutions.
- Validate data paths and telemetry onboarding; triage signal gaps with domain SMEs (CDU/HRU).
- Track work in ADO boards; contribute to design notes, detection descriptions, validation plans, and weekly status for the PM.
Best candidates move fluently between notebooks and ADX, quickly build/replay time‑series detections, collaborate cross‑functionally to land Logic Apps/Functions with robust CI/CD, and communicate clearly about tradeoffs and outcomes.
Average candidates are strong coders but slow on time‑series KQL or dataset shaping, rely heavily on others to product ionize logic, and struggle to tell the story of results for operators and PMs.
AdditionalSkills & Qualifications
N/A
Experience LevelExpert Level
Benefits- Medical, dental & vision
- 401(k)/Roth
- Insurance (Basic/Supplemental Life & AD&D)
- Short and long‑term disability
- Health and Dependent Care Spending Accounts (HAS & DCFSA)
- Transportation benefits
- Employee Assistance Program
- Time…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).