InterviewStack.io LogoInterviewStack.io
Browse more Machine Learning Engineer jobs

Senior AI/ML Ops Engineer - New York / Jersey City

PHTN.ai

United States3 months ago
71 views18 saves11 applies

Prepare for this role


Benefits

Dental & VisionPaid Time Off401kRetirement Plan

Job Type

full time

Description

Key Responsibilities AI/ML Model Operations

  • Deploy, manage, and monitor machine learning and AI models in production environments.
  • Implement model performance monitoring including accuracy, latency, and inference metrics.
  • Detect and mitigate concept drift, data drift, and model degradation.

AI Observability

  • Design and implement AI observability frameworks to track model behavior and reliability.
  • Monitor LLM outputs, hallucination rates, and response quality.
  • Implement logging, tracing, and evaluation pipelines for AI systems.

Agentic Systems Monitoring

  • Monitor agent-based AI workflows and autonomous systems.
  • Track agent actions, tool usage, decision paths, and execution outcomes.
  • Implement guardrails, safety monitoring, and failure detection for AI agents.

Data Pipeline Monitoring

  • Monitor and maintain data ingestion, transformation, and feature pipelines.
  • Ensure data quality, schema consistency, and pipeline reliability.
  • Detect and resolve pipeline failures and anomalies.

Infrastructure & Automation

  • Build and maintain CI/CD pipelines for ML models and AI systems.
  • Manage model versioning, experiment tracking, and reproducibility.
  • Automate monitoring alerts, incident response, and remediation.

Collaboration

  • Work closely with data scientists, ML engineers, platform teams, and product teams.
  • Support continuous improvement of AI system reliability and governance
  • Compensation, Benefits and Duration

    Minimum Compensation: USD 50,000
    Maximum Compensation: USD 177,000
    Compensation is based on actual experience and qualifications of the candidate. The above is a reasonable and a good faith estimate for the role.
    Medical, vision, and dental benefits, 401k retirement plan, variable pay/incentives, paid time off, and paid holidays are available for full time employees.
    This position is not available for independent contractors
    No applications will be considered if received more than 120 days after the date of this post

This job is found at InterviewStack.io

Skills

machine learningmonitoringobservabilityllmdata pipelinesautomationci/cddata qualityincident responsedata ingestion