Customer Support Engineer; GPU Cluster
San Francisco, San Francisco County, California, 94199, USA
Listed on 2026-06-26
-
IT/Tech
Technical Support, Cloud Computing: Infrastructure & Operations
About the Role
As a Customer Support Engineer at a pioneering AI company, you will be the first line of defense to support customers as they build out training, fine‑tuning, and inference solutions with Together AI. You will dive deep into complex technical challenges, providing swift and effective solutions while serving as a product expert. As part of the Customer Experience organization, you will collaborate closely with product and sales, driving continuous improvement of our offerings.
This is an exciting opportunity for a deeply technical professional passionate about AI and customer success to make a significant impact in a fast‑paced, innovative environment.
- Engage directly with customers to tackle and resolve complex technical challenges involving our cutting‑edge Kubernetes GPU clusters; ensure swift and effective solutions every time.
- Become a product expert in our GPU Cluster service, serving as the last line of technical defense before issues are escalated to Engineering and Product teams.
- Collaborate seamlessly across Engineering, Research, and Product teams to address customer concerns and work with senior leaders both internally and externally to ensure the highest levels of customer satisfaction.
- Transform customer insights into action by identifying patterns in support cases and working with Engineering and Go‑To‑Market teams to drive Together’s roadmap (e.g., future models to support).
- Maintain detailed documentation of system configurations, procedures, troubleshooting guides, and FAQs to facilitate knowledge sharing with the team and customers.
- Be flexible in providing support coverage during holidays, nights, and weekends as required by business needs to ensure consistent and reliable service for our customers.
- 3+ years of experience in a customer‑facing technical role with at least 1 year in a support function in AI or supporting a mission‑critical API in SaaS.
- Strong technical background, with knowledge of AI, ML, GPU technologies and their integration into high‑performance computing (HPC) environments.
- Familiarity with infrastructure services (e.g., Kubernetes, SLURM), infrastructure as code solutions (e.g., Ansible), high‑performance network fabrics, NFS‑based storage management, container infrastructure, and scripting and programming languages.
- Foundational understanding in the installation, configuration, administration, troubleshooting, and securing of compute clusters.
- Complex technical problem solving and troubleshooting, with a proactive approach to issue resolution.
- Ability to work cross‑functionally with teams such as Sales, Engineering, Support, Product, and Research to drive customer success.
- Strong sense of ownership and willingness to learn new skills to ensure both team and customer success.
- Excellent communication and interpersonal skills, with the ability to explain complex technical concepts to non‑technical stakeholders.
- Ability to operate in dynamic environments, adept at managing multiple projects, and comfortable with frequent context switching and prioritization.
We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full‑time position is $160,000–$230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job‑related knowledge.
Equal OpportunityTogether AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).