InterviewStack.io LogoInterviewStack.io
Browse more AI Engineer jobs

AI Model Evaluation Specialist

Inizio Partners Corp

New York, New York, United States1 month ago
13 views4 saves0 applies

Prepare for this role


Job Type

full time

Description

Key Responsibilities:

  • Perform scoring and qualitative evaluations of
    LLM-generated responses across multiple use cases.
  • Develop and maintain scoring guidelines and rubrics to
    ensure consistency and objectivity.
  • Collaborate with data scientists, product managers, and
    engineering teams to align scoring with project goals.
  • Assist in the creation and labeling of high-quality
    evaluation datasets for prompt tuning or model fine-tuning.
  • Utilize NLP-based metrics and tools (e.g., ROUGE, BLEU,
    cosine similarity) for automated scoring support.
  • Document scoring patterns, common model errors, and
    improvement opportunities.
  • Contribute to prompt experimentation and help compare
    effectiveness of different prompt strategies.

Qualifications:

  • Prior experience with LLMs (e.g., GPT, Claude, LLaMA,
    etc.) or AI/NLP projects is highly preferred.
  • Strong analytical skills and attention to detail,
    especially in assessing language quality.
  • Familiarity with prompt engineering, generative AI, or
    conversational AI tools is a plus.
  • Hands-on experience with Python, Jupyter, or evaluation
    libraries (optional but desirable).
  • Experience working with evaluation frameworks or
    annotation tools (Label Studio, Prodigy, etc.) is a bonus.
  • Excellent written and verbal communication skills

This job is found at InterviewStack.io

Skills

llmsgptnlpgenerative aipythonprompt engineeringfine tuningexperimentation

About Inizio Partners Corp

Full-service recruitment agency specializing in executive search, permanent placements, and contingency staffing.

recruitment, staffing