Senior, Software Engineer
Job in
Sunnyvale, Santa Clara County, California, 94089, USA
Listed on 2026-06-23
Listing for:
Wal-Mart
Full Time
position Listed on 2026-06-23
Job specializations:
-
IT/Tech
AI Engineer (Applied/Software), Machine Learning/ ML Engineer
Job Description & How to Apply Below
Position Summary...
What you'll do...
About Team:
Search PTE - Dev Ops Team processes billions of queries for millions of products on Walmart sites and apps worldwide. Whenever a user types in a query or browses through product categories on the web or mobile, our service goes to work. We mine structured and semi-structured data from product catalogs, social web, transactions, query logs, and AI-generated signals at an unprecedented scale.
We work on big data problems, cutting-edge relevance algorithms from information retrieval, machine learning, and AI-powered ranking to deliver a high-availability, low-latency service that directly impacts business metrics.
Position Summary
Being part of the Search PTE-Dev Ops team at Walmart provides deep insight into the full lifecycle of a product - from content acquisition to being sold on As a Senior Software Engineer in Dev Ops & AI Platform, you must support all systems and services to ensure high availability and reliability, while embracing AI-augmented workflows to accelerate engineering velocity. You will work closely with developers, AI/ML engineers, and platform teams to support new application features, AI model deployments, and service launches.
You will design, build, and operate the tools that help in developing, scaling, and monitoring cutting-edge technology - including GenAI and LLMOps pipelines. You must be able to triage complex technical issues in collaboration with engineering, NOC, Net Eng, and Platform teams. If you are passionate about five 9's reliability and excited about the intersection of AI and platform engineering, this position is for you.
We are looking for an expert in continuous integration and delivery pipelines, containerized infrastructure, and AI-assisted development practices. You will play a critical role in all search application and AI model release cycles, working closely with Engineering, QE, and Dev Ops. What You'll Do:
* Build, manage, and evolve QE & Release Automation frameworks, incorporating AI-assisted test generation and self-healing test capabilities
* Build and support Kubernetes-based containerization in production, including GPU-backed workloads for AI/ML inference
* Lead independently the investigation and resolution of high-impact search system and AI service incidents
* Build, manage, and support comprehensive monitoring and observability for applications and AI model performance (drift, latency, accuracy)
* Maintain and improve automation pipelines supporting application build, release, and AI model deployment cycles (CI/CD + MLOps/LLMOps)
* Integrate AI coding assistants and GenAI tooling (e.g., Wibey, Git Hub Copilot) into engineering workflows to accelerate development
* Design and implement AI-powered observability solutions using intelligent alerting, anomaly detection, and predictive incident management
* Collaborate with AI/ML teams to operationalize LLM-based features within search, including prompt pipeline management and vector search infrastructure
* Drive execution and lead medium- to large-scale projects from Dev to Ops, including AI/ML platform initiatives
* Analyze, design, and build frameworks using cutting-edge technology and AI tools to fulfill Operational Excellence
* Lead and independently handle high-impact, critical search system and AI service incidents
* Improve, optimize, and identify opportunities within the software development and AI deployment lifecycle (SDLC + MLOps)
* Provide engineering and QE teams with architectural guidance on solutions, automation frameworks, and AI integration patterns
* Work with product and engineering teams to review new functional and AI-driven requirements; develop comprehensive test plans and automate test cases - including AI model validation
* Perform quality assurance for large-scale eCommerce backend search services and AI-powered features
* Write programs and scripts to automate testing and validation of search backend services and LLM/AI inference pipelines
* Expertise in WCNP, Concord, Looper, Python, Golang, and Java - with hands-on experience in AI/ML tooling, LLMOps, and GenAI platforms
What You'll Bring:
* Bachelor's or Master's Degree in Computer Science, Engineering, or related field
* 5+ years of experience building scalable eCommerce applications or distributed backend services
* 3+ years of industry experience in application releases, CI/CD pipelines, and distributed system testing
* Strong expertise in containerization and orchestration using Kubernetes (including multi-cluster and GPU-node management)
* 2+ years of programming experience in Python, Go, Java, and Shell scripting, with exposure to REST and gRPC API frameworks
* Experience with modern CI/CD platforms (e.g., Concord, Git Hub Actions, Looper) and Git Ops workflows (e.g., ArgoCD, Flux)
* Working knowledge of AI/ML workflows: model serving, inference optimization, or LLM deployment pipelines
* Familiarity with observability stacks:
Open Telemetry, distributed tracing, log aggregation (e.g., Splunk,…
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×