Senior AI/ML Engineer, CH
Vector8
Prepare for this role
Job Type
Description
As a Senior AI/ML Engineer at vector8, you will design, implement, and deploy AI solutions that bridge the gap between research and production. Your work will focus on integrating and fine-tuning AI Models, optimizing the model performance, and ensuring enterprise-grade reliability, security, and scalability.
This is a hands-on engineering role where you will:
- Develop andoptimizeLLMand VLM-powered solutions for enterprise use cases
- Develop andoptimizeTTS, STT and ML models.
- Apply software engineering best practices (testing, CI/CD, modular design, documentation)
- Collaborate with cross-functional teams (data engineers,MLOps, cloud architects, and business stakeholders)
- Solve real-world enterprise challenges (security, compliance, legacy system integration)
- Own the full lifecycle of AI models, from data exploration to production monitoring
You will work closely with vector8’s Engineers and Project Managers to co-design AI foundations that enable organizations to scale AI from individual use cases to enterprise-wide capabilities.
The role is primarily based in Zurich, with occasional travel to client sites and collaboration with teams across Europe.
Responsibilities
1. End-to-End Model Development
- Design, implement, and deploy distributed, high-volume, high-performance, low-latency machine learning solutions, with a focus on GenAI models, and especially LLM integrations and API-driven architectures
- Take ownership of your models throughout their entire life cycle:
- Data exploration and cleaning to build reproducible, versioned datasets
- State-of-the-artresearch toidentifythe best architectures for the problem (e.g., transformers, RAG, fine-tuning)
- Implementation, training, and optimization in reproducible environments
- Deployment, monitoring, and maintenance in production
- Optimizemodels for performance, latency, and cost efficiency, especially in LLM serving and inference
2. Software Engineering for AI
- Write clean, modular, and well-documented code in Python (FastAPI,Pydantic,asyncio)
- Apply best practices in:
- Testing (unit, integration, end-to-end)
- CI/CD (GitHub Actions, GitLab CI,ArgoCD)
- Observability (logging, monitoring, tracing)
- Ensure security and compliance (data protection, access controls, encryption)
- Integrate models and code into CI/CD pipelines for seamless deployment
3. AI & ML Integration & API Development
- Design and implementAI-powered solutions that integrate with APIs, microservices, and event-driven architectures
- Develop andoptimizeAIpipelines for:
- Dataset cleaning,preprocessingand model training
- Fine-tuning (domain adaptation, instruction tuning)
- Retrieval-Augmented Generation (RAG) (vector databases, semantic search)
- Prompt engineering (optimizinginputs for performance, cost, and accuracy)
- Model evaluation (benchmarking, bias detection, drift analysis)
- Build scalable, secure, and cost-efficient serving infrastructure (e.g.,FastAPI,vLLM)
- Debug andoptimizeperformance (latency, throughput, token efficiencyfor Transformer based architectures)
4. Enterprise AI & MLOps
- Deploy andmonitorAI models in production
- Design and implementMLOpspipelines for:
- Model training, fine-tuning, and evaluation
- Model versioning and lineage tracking
- A/B testing and canary deployments
- Ensure scalability and reliability (auto-scaling, fault tolerance, disaster recovery)
- Collaborate with data engineers to build data pipelines (batch, streaming, real-time)
5. Collaboration & Technical Leadership
- Work closely with product owners, DevOps, and quality assurance in an agile, cross-functional team
- Mentor junior engineers and promote best practices in AI/ML and software engineering
- Translate product requirements into technical solutions and architectural decisions
- Document architectures, decisions, and best practices for internal and client-facing use
- Develop relationships with internal and external stakeholders, including clients and partners
6. Innovation & Continuous Improvement
- Stay ahead of the latestAI and MLarchitectures (transformers, Mixture of Experts, sparse attention).
- Experiment withcutting-edgetechniques (quantization, distillation, speculativedecoding).
- Evaluate and benchmark open-source and proprietary models (Llama, Mistral,Mixtral, GPT-4, Claude).
- Bring your own ideas through vector8’s ideation process.
- Contribute to vector8’s AI accelerators (reusable components for common industry problems).
- Embrace a strategic and continuous improvement mentality to drive innovation.
Requirements
- 5+ years of experience in AI/ML engineering, software development, or a related field
- Expertisein LLM architectures and training methodologies:
- Transformers, attention mechanisms, fine-tuning, RAG, quantization
- Prompt engineering, model evaluation, bias detection
- Strong knowledge of machine learning architectures: fully connected, CNN, LSTM,transformersand classical ML models
- Strong software engineering skills:
- Proficient in Python (FastAPI,Pydantic,asyncio, type hints)
- Experience with API development
- Familiarity with modern toolchains (Docker, Kubernetes, Terraform)
- Hands-on experience with LLM integrations:
- LLM providers
- Vector databases (Pinecone,Weaviate, Milvus)
- Model serving (vLLM, TGI,KServe)
- Experience withMLOpsand production deployments
- Understanding of enterprise challenges:
- Security, compliance, scalability, costoptimization.
- Experience with relational and non-relational databases.
- Strong problem-solving and debugging skills.
- Excellent communication and collaboration skills (fluent in English; German is a strong plus).
- Bachelor’s orMaster’s degree in Computer Science, Mathematics, Physics, ora relatedfield.
- Experience with multi-cloud environments (AWS, Azure, GCP).
- Experience with code optimization (e.g., model quantization, parallelization).
Benefits
- A role at the forefront of AI transformation for leading Swiss enterprises in financial services, insurance, and beyond
- Work withcutting-edgeAI technologies and innovative solutions that create real, measurable business value
- A dynamic, entrepreneurial work environment where your contributions directly drive company growth
- Supportive team culture that prioritises continuous learning, professional development, and personal growth
- A leadership ethos focused on empowering people, fostering collaboration, excellence, authenticity, and diversity
- Competitive salary package, 25 days of vacation, development budget, and a flat hierarchy
This job is found at InterviewStack.io
Skills
About Vector8
Vector8: Unlocking AI’s True Value. Elevate your business with cutting-edge AI solutions and seamless human-machine collaboration.