Senior AI Engineer - LLM Focus
Scope Merge
Tunis, Tunisia1 year ago
20 views8 saves3 applies
Prepare for this role
Benefits
Remote Work
Job Type
full time
Description
At Scope Merge, we connect top Tunisian engineers with leading European companies. We offer long-term roles, international exposure, and above-market compensation. You’ll work on exciting international projects while we handle employment, payroll, and benefits.
We are hiring a Senior AI Engineer with deep expertise in Large Language Models (LLMs) and client facing to join a fast-growing European startup applying cutting-edge AI to solve real-world problems. You’ll work on the development, fine-tuning, and deployment of LLMs in production.
## Tasks
* Design, train, and fine-tune LLMs for specific business use cases.
* Build and optimize inference pipelines for LLM-based applications, ensuring low-latency and scalability.
* Evaluate and integrate open-source LLMs (e.g., LLaMA, Mistral, Falcon) or APIs (e.g., OpenAI, Anthropic) depending on use case and cost constraints.
* Collaborate with backend engineers to deploy models efficiently using tools like Triton, vLLM, or ONNX Runtime.
* Design and run evaluation frameworks (e.g., prompt quality, hallucination detection, latency).
* Monitor models in production and implement mechanisms for feedback loops and continuous improvement.
* Stay up to date with advances in generative AI, open-source LLM tooling, and fine-tuning strategies.
## Requirements
* 4+ years of experience in applied ML or NLP, with at least 1–2 years focused on LLMs.
* Strong knowledge of transformer architectures and experience working with model libraries like Hugging Face Transformers, LangChain, or LLM orchestration tools.
* Proven experience deploying LLMs into production (custom or API-based) and optimizing them for inference.
* Familiarity with techniques such as LoRA, QLoRA, PEFT, RAG, or prompt engineering.
* Solid Python skills, especially in ML stack (e.g., PyTorch, TensorFlow, FastAPI for serving).
* Experience working with cloud infrastructure (AWS/GCP) and containerized deployments (Docker, Kubernetes).
* Bonus: Experience with data pipelines, vector databases (e.g., Weaviate, Pinecone, FAISS), or hybrid search.
## Benefits
* Work on international projects with top startups and tech companies.
* Collaborate with global teams and gain cross-border experience.
* Grow your skills through hands-on challenges and real-world impact.Modern offices in Lac 2, Tunis
* Supportive sick leave policy that respects your health and well-being
* Receive above-market salary and financial stability
CV in English.
We are hiring a Senior AI Engineer with deep expertise in Large Language Models (LLMs) and client facing to join a fast-growing European startup applying cutting-edge AI to solve real-world problems. You’ll work on the development, fine-tuning, and deployment of LLMs in production.
## Tasks
* Design, train, and fine-tune LLMs for specific business use cases.
* Build and optimize inference pipelines for LLM-based applications, ensuring low-latency and scalability.
* Evaluate and integrate open-source LLMs (e.g., LLaMA, Mistral, Falcon) or APIs (e.g., OpenAI, Anthropic) depending on use case and cost constraints.
* Collaborate with backend engineers to deploy models efficiently using tools like Triton, vLLM, or ONNX Runtime.
* Design and run evaluation frameworks (e.g., prompt quality, hallucination detection, latency).
* Monitor models in production and implement mechanisms for feedback loops and continuous improvement.
* Stay up to date with advances in generative AI, open-source LLM tooling, and fine-tuning strategies.
## Requirements
* 4+ years of experience in applied ML or NLP, with at least 1–2 years focused on LLMs.
* Strong knowledge of transformer architectures and experience working with model libraries like Hugging Face Transformers, LangChain, or LLM orchestration tools.
* Proven experience deploying LLMs into production (custom or API-based) and optimizing them for inference.
* Familiarity with techniques such as LoRA, QLoRA, PEFT, RAG, or prompt engineering.
* Solid Python skills, especially in ML stack (e.g., PyTorch, TensorFlow, FastAPI for serving).
* Experience working with cloud infrastructure (AWS/GCP) and containerized deployments (Docker, Kubernetes).
* Bonus: Experience with data pipelines, vector databases (e.g., Weaviate, Pinecone, FAISS), or hybrid search.
## Benefits
* Work on international projects with top startups and tech companies.
* Collaborate with global teams and gain cross-border experience.
* Grow your skills through hands-on challenges and real-world impact.Modern offices in Lac 2, Tunis
* Supportive sick leave policy that respects your health and well-being
* Receive above-market salary and financial stability
CV in English.
This job is found at InterviewStack.io
Skills
payrollllmsapisopenaionnxgenerative aillmnlptransformerslangchainragpythonpytorchtensorflowfastapiawsgcpdockerkubernetesdata pipelinesvector databaseslarge language modelsprompt engineeringfine tuning