InterviewStack.io LogoInterviewStack.io
Browse more Machine Learning Engineer jobs

Senior AI/ML Engineer, France

Vector8

France, France3 months ago
61 views27 saves0 applies

Prepare for this role


Benefits

Flexible HoursRemote WorkPaid Time OffRetirement PlanWellness ProgramHome Office Stipend

Job Type

full time

Description

As a Senior AI/ML Engineer at vector8, you will design, implement, and deploy AI solutions that bridge the gap between research and production. Your work will focus on integrating and fine-tuning AI Models, optimizing the model performance, and ensuring enterprise-grade reliability, security, and scalability.

This is a hands-on engineering role where you will:

  • Develop and optimize LLMand VLM-powered solutions for enterprise use cases
  • Develop and optimize TTS, STT and ML models.
  • Apply software engineering best practices (testing, CI/CD, modular design, documentation)
  • Collaborate with cross-functional teams (data engineers,MLOps, cloud architects, and business stakeholders)
  • Solve real-world enterprise challenges (security, compliance, legacy system integration)
  • Own the full lifecycle of AI models, from data exploration to production monitoring

You will work closely with vector8’s Engineers and Project Managers to co-design AI foundations that enable organizations to scale AI from individual use cases to enterprise-wide capabilities.

The role is primarily based in Paris, with occasional travel to client sites and collaboration with teams across Europe.

Responsibilities

1. End-to-End Model Development

  • Design, implement, and deploy distributed, high-volume, high-performance, low-latency machine learningsolutions, with a focus onGenAI models, and especiallyLLM integrations and API-driven architectures
  • Take ownership of your models throughout their entire lifecycle:
  • Data exploration and cleaning to build reproducible, versioned datasets
  • State-of-the-artresearch toidentifythe best architectures for the problem (e.g., transformers, RAG, fine-tuning)
  • Implementation, training, and optimization in reproducible environments
  • Deployment, monitoring, and maintenance in production
  • Optimizemodels for performance, latency, and cost efficiency, especially in LLM serving and inference

2. Software Engineering for AI

  • Write clean, modular, and well-documented code in Python (FastAPI,Pydantic,asyncio)
  • Apply best practices in:
    • Testing (unit, integration, end-to-end)
    • CI/CD (GitHub Actions, GitLab CI,ArgoCD)
    • Observability (logging, monitoring, tracing)
  • Ensure security and compliance (data protection, access controls, encryption)
  • Integrate models and code into CI/CD pipelines for seamless deployment

3. AI & ML Integration & API Development

  • Design and implementAI-powered solutions that integrate with APIs, microservices, and event-driven architectures
  • Develop andoptimizeAIpipelines for:
    • Dataset cleaning,preprocessingand model training
    • Fine-tuning (domain adaptation, instruction tuning)
    • Retrieval-Augmented Generation (RAG) (vector databases, semantic search)
    • Prompt engineering (optimizinginputs for performance, cost, and accuracy)
  • Model evaluation (benchmarking, bias detection, drift analysis)
  • Build scalable, secure, and cost-efficient serving infrastructure (e.g.,FastAPI,vLLM)
  • Debug andoptimizeperformance (latency, throughput, token efficiencyfor Transformer based architectures)

4. Enterprise AI & MLOps

  • Deploy andmonitorAI models in production
  • Design and implementMLOpspipelines for:
    • Model training, fine-tuning, and evaluation
    • Model versioning and lineage tracking
    • A/B testing and canary deployments
  • Ensure scalability and reliability (auto-scaling, fault tolerance, disaster recovery)
  • Collaborate with data engineers to build data pipelines (batch, streaming, real-time)

5. Collaboration & Technical Leadership

  • Work closely with product owners, DevOps, and quality assurance in an agile, cross-functional team
  • Mentor junior engineers and promote best practices in AI/ML and software engineering
  • Translate product requirements into technical solutions and architectural decisions
  • Document architectures, decisions, and best practices for internal and client-facing use
  • Develop relationships with internal and external stakeholders, including clients and partners

6. Innovation & Continuous Improvement

  • Stay ahead of the latestAI and MLarchitectures (transformers, Mixture of Experts, sparse attention).
  • Experiment withcutting-edgetechniques (quantization, distillation, speculativedecoding).
  • Evaluate and benchmark open-source and proprietary models (Llama, Mistral,Mixtral, GPT-4, Claude).
  • Bring your own ideas through vector8’s ideation process.
  • Contribute to vector8’s AI accelerators (reusable components for common industry problems).
  • Embrace a strategic and continuous improvement mentality to drive innovation.

Requirements

  • 5+ years of experience in AI/ML engineering, software development, or a related field
  • Expertise in LLM architectures and training methodologies:
    • Transformers, attention mechanisms, fine-tuning, RAG, quantization
    • Prompt engineering, model evaluation, bias detection
  • Strong knowledge of machine learning architectures: fully connected, CNN, LSTM, transformers and classical ML models
  • Strong software engineering skills:
    • Proficient in Python (FastAPI,Pydantic,asyncio, type hints)
    • Experience with API development
    • Familiarity with modern tool chains (Docker, Kubernetes, Terraform)
  • Hands-on experience with LLM integrations:
    • LLM providers
    • Vector databases (Pinecone, Weaviate, Milvus)
    • Model serving (vLLM, TGI, KServe)
  • Experience with MLOpsand production deployments
  • Understanding of enterprise challenges:
    • Security, compliance, scalability, costoptimization.
  • Experience with relational and non-relational databases.
  • Strong problem-solving and debugging skills.
  • Excellent communication and collaboration skills (fluent in English; German is a strong plus).
  • Bachelor’s or Master’s degree in Computer Science, Mathematics, Physics, or a related field.
  • Experience with multi-cloud environments (AWS, Azure, GCP).
  • Experience with code optimization (e.g., model quantization, parallelization).

Benefits

  • A great compensation package with competitive benefits, including:
  • Flexible working hours, including remote work options (hybrid model).
  • 25daysof paid vacation per year, plusadditionalflex days.
  • Private health and life insurance&pension plan for long-term security.
  • Home office allowance &Lunch vouchers to enjoy meals on us.
  • Discounted fitness memberships to stay active.
  • 50% reimbursement of public transport costs.
  • Free coffee, fruit, and snacks to keep you fueled.
  • Access to the latest technologies(LangDock, Claude Code for developers)
  • Grants for training, coaching, and conferences to support your continuous learning.
  • Opportunities to attend industry events and representvector8as a thought leader.
  • A less-formal work environment where authenticity and collaboration thrive.
  • A diverse and inclusive team that values curiosity, ownership, and innovation.

This job is found at InterviewStack.io

Skills

ci/cdmlopsmonitoringtransformersragllmpythonfastapiasynciogithub actionsgitlabargocdobservabilityencryptionapismicroservicesvector databasesa/b testingscalabilitydata pipelinesagilemachine learningdockerkubernetesterraformdebuggingawsazuregcpmodel evaluationmodel trainingprompt engineeringfine tuningsystem integrationrelational databasesdisaster recoveryquality assuranceapi development

About Vector8

Vector8: Unlocking AI’s True Value. Elevate your business with cutting-edge AI solutions and seamless human-machine collaboration.

software, insuranceWebsite