Senior/Principal ML Systems Architect; TensorFlow + Python
Job in
Bristol, Bristol County, BS1, England, UK
Listed on 2025-12-20
Listing for:
SoftInWay Inc
Full Time
position Listed on 2025-12-20
Job specializations:
-
Software Development
AI Engineer, Machine Learning/ ML Engineer, Data Scientist
Job Description & How to Apply Below
Senior/Principal ML Systems Architect (Tensor Flow + Python) Overview
We are seeking a highly experienced ML Systems Architect to design and implement a scalable, production-grade architecture for our machine learning solver. This role bridges research prototypes and commercial deployment, ensuring reliability, maintainability, and performance in a mixed technology stack.
Responsibilities- Architect the ML Solver Platform
:- Define modular architecture for data preprocessing, model execution, and post-processing.
- Establish clear API contracts between Python/Tensor Flow and C# services.
- Convert research code into robust, testable, and observable services.
- Implement CI/CD pipelines, automated testing, and reproducibility standards.
- Design REST/gRPC endpoints for cross-language communication.
- Ensure compatibility with C#/.NET services.
- Performance & Scalability
:- Optimize GPU/CPU utilization, batching strategies, and memory management.
- Plan for multi-model and multi-tenant scenarios.
- MLOps & Lifecycle Management
:- Implement model versioning, artifact registries, and deployment workflows.
- Set up monitoring, logging, and alerting for solver performance.
- Security & Compliance
:- Apply best practices for secrets management, dependency scanning, and secure artifact storage.
- ML Frameworks
:
Expert in Tensor Flow (TF2/Keras), experience with ONNX Runtime for inference. - Programming
:
Advanced Python for ML; strong understanding of packaging, type checking, and performance profiling. - APIs
:
Proficiency in gRPC/Protobuf and REST for cross-language integration. - Performance Optimization
: GPU acceleration (CUDA/cuDNN), mixed precision, XLA, profiling. - Observability
:
Metrics, tracing, structured logging, dashboards. - Security
: SBOM, image signing, role-based access, vulnerability scanning.
- Experience with ONNX Runtime Training, PyTorch, or hybrid ML architectures.
- Familiarity with distributed training strategies and multi-GPU setups.
- Knowledge of feature stores and data validation frameworks.
- Exposure to regulated environments and compliance frameworks.
- ML
:
Tensor Flow, ONNX Runtime, tf2onnx. - APIs
:
FastAPI, gRPC.
- Work on cutting-edge ML solutions integrated into commercial engineering software.
- Define architecture that scales across global deployments.
- Collaborate with a team of experts in ML, software engineering, and UI development.
Position Requirements
10+ Years
work experience
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×