Senior Software Engineer - AI Infrastructure Development. Concord LilyLifestyle
Listed on 2026-05-30
-
Software Development
Software Engineer, DevOps, Cloud Engineer - Software
Join Our Team as a Senior Software Engineer in AI Infrastructure Development!
As a vital member of our Technical Staff, take ownership of the design and development of key components in our Cloud Infrastructure. We’re looking for a talented developer who is a curious problem solver with a solid foundation in distributed systems and Linux engineering. Your expertise will allow you to navigate and enhance complex interactions within our systems.
This position is with our Compute Bare Metal Provisioning team, which plays an essential role in automating the entire server lifecycle—from the creation of new platform shapes (AMD/Intel/Arm/Nvidia) to hardware bring-up and customer-ready instance provisioning. You will directly work at the intersection of bare metal hardware and full-stack orchestration, where your skills in distributed systems and Linux will shine. Our team interacts with various components such as BMCs, NICs, Smart
NICs, ILOMs, GPUs, and custom firmware stacks, developing high-performance, scalable microservices that replicate, configure, secure, and validate server platforms across OCI’s extensive Compute and GPU Infrastructure. Collaborate closely with teams in Compute, Networking, Security, Data Center Engineering, and Hardware Development to ensure OCI can efficiently launch, scale, and maintain new server platforms with minimal operational overhead and maximum reliability.
We offer you the chance to work with cutting-edge GPU hardware and see your contributions make a real difference in our business. You will be part of a dynamic, motivated, and diverse team where your best work is supported and encouraged.
Key Responsibilities:- Own the software design and development for Oracle Cloud Infrastructure components.
- Deep dive into the stack and low-level systems, including Linux, Docker, Java web services, and Infrastructure as Code with Terraform.
- Proactively improve system performance and reliability by seeking out challenges to solve.
- Ensure simplicity and scalability in collaborative, agile environments.
- 3‑8+ years of experience in delivering and operating large-scale distributed systems, with a strong foundation in Linux development and systems debugging.
- Proficient in Object‑Oriented programming languages like C++ or Java, and experienced in scripting languages such as Python.
- In-depth knowledge of data structures, algorithms, operating systems, and distributed systems fundamentals.
- Hands‑on experience with tools like Terraform and familiarity with networking protocols (TCP/IP, HTTP).
- Solid understanding of databases, No
SQL systems, and distributed storage technologies. - Strong troubleshooting and performance tuning abilities.
- Experience in building multi‑tenant, virtualized infrastructure is a plus.
The salary range for this position is between $79,200 and $178,100 per year, with potential eligibility for bonuses and equity. Oracle offers comprehensive benefits, including health, vision, and dental insurance, retirement plans, paid time off, and more.
Apply now to be part of a company leading innovation in AI and cloud technology. Let’s build the future together!
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).