InterviewStack.io LogoInterviewStack.io
Browse more Data Engineer jobs

Agentic AI Data Engineer

EXL Service

United States$150,000+1 month ago
15 views3 saves2 applies

Prepare for this role


Job Type

full time

Description

Key Responsibilities

  • Design and implement agentic AI systems that autonomously orchestrate data workflows and decision pipelines
  • Build scalable data pipelines for structured and unstructured data (batch + real-time)
  • Develop and manage LLM-powered applications using retrieval-augmented generation (RAG), tool use, and multi-agent frameworks
  • Integrate AWS AI/ML services into production-grade architectures
  • Develop and optimize data lakes, warehouses, and lakehouse architectures
  • Build APIs and microservices to expose AI/ML capabilities
  • Ensure data quality, governance, and security across pipelines
  • Collaborate with data scientists, ML engineers, and product teams to deploy AI solutions
  • Implement monitoring, logging, and observability for AI agents and pipelines
  • Optimize cost and performance of cloud-based AI workloads

Required Technical Skills

Cloud & AWS Ecosystem

  • Strong experience with AWS services, including:
    • Amazon S3, Glue, Lambda, Step Functions
    • Amazon Redshift / Athena
    • Amazon SageMaker (training, deployment, pipelines)
    • Amazon Bedrock (foundation models, agents, knowledge bases)

AI/ML & Agentic Systems

  • Experience with LLMs and generative AI systems
  • Hands-on with agent frameworks (e.g., multi-agent orchestration, tool calling, planning systems)
  • Familiarity with AgentCore / agent orchestration platforms
  • Understanding of RAG architectures, embeddings, and vector databases
  • Experience with model deployment, inference optimization, and prompt engineering

Data Engineering

  • Strong proficiency in Python and SQL
  • Experience with ETL/ELT tools and frameworks
  • Distributed data processing (Spark, PySpark, or similar)
  • Streaming technologies (Kafka, Kinesis, or similar)
  • Data modeling and schema design

Data & AI Infrastructure

  • Experience with vector databases (e.g., Pinecone, FAISS, OpenSearch)
  • Knowledge of data lakehouse architectures (Delta Lake, Iceberg, Hudi)
  • Containerization (Docker) and orchestration (Kubernetes)
  • CI/CD for ML and data pipelines

Key Responsibilities

  • Design and implement agentic AI systems that autonomously orchestrate data workflows and decision pipelines
  • Build scalable data pipelines for structured and unstructured data (batch + real-time)
  • Develop and manage LLM-powered applications using retrieval-augmented generation (RAG), tool use, and multi-agent frameworks
  • Integrate AWS AI/ML services into production-grade architectures
  • Develop and optimize data lakes, warehouses, and lakehouse architectures
  • Build APIs and microservices to expose AI/ML capabilities
  • Ensure data quality, governance, and security across pipelines
  • Collaborate with data scientists, ML engineers, and product teams to deploy AI solutions
  • Implement monitoring, logging, and observability for AI agents and pipelines
  • Optimize cost and performance of cloud-based AI workloads
 

Required Technical Skills

Cloud & AWS Ecosystem

  • Strong experience with AWS services, including:
    • Amazon S3, Glue, Lambda, Step Functions
    • Amazon Redshift / Athena
    • Amazon SageMaker (training, deployment, pipelines)
    • Amazon Bedrock (foundation models, agents, knowledge bases)

AI/ML & Agentic Systems

  • Experience with LLMs and generative AI systems
  • Hands-on with agent frameworks (e.g., multi-agent orchestration, tool calling, planning systems)
  • Familiarity with AgentCore / agent orchestration platforms
  • Understanding of RAG architectures, embeddings, and vector databases
  • Experience with model deployment, inference optimization, and prompt engineering

Data Engineering

  • Strong proficiency in Python and SQL
  • Experience with ETL/ELT tools and frameworks
  • Distributed data processing (Spark, PySpark, or similar)
  • Streaming technologies (Kafka, Kinesis, or similar)
  • Data modeling and schema design

Data & AI Infrastructure

  • Experience with vector databases (e.g., Pinecone, FAISS, OpenSearch)
  • Knowledge of data lakehouse architectures (Delta Lake, Iceberg, Hudi)
  • Containerization (Docker) and orchestration (Kubernetes)
  • CI/CD for ML and data pipelines
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field
  • 4+ years of experience in data engineering or ML engineering
  • Hands-on experience with production-grade AI/ML systems

 

This position may pay a base salary of up to $150k per year based on skills and experience. 

This job is found at InterviewStack.io

Skills

data pipelinesragawsapismicroservicesmonitoringobservabilitys3lambdaredshiftsagemakerbedrockllmsgenerative aiembeddingsvector databasespythonsqletlsparkpysparkkafkadata modelingdelta lakeicebergcontainerizationdockerkubernetesci/cddata qualitymodel deploymentprompt engineering

About EXL Service

EXL Service is a global analytics and digital solutions company that partners with businesses to improve operational processes and customer experiences. They provide services in data analytics, automation, and digital transformation across various industries including insurance, healthcare, and finance.

large companyconsulting, data_analyticspublicWebsite