Search - Workchat - Applied Data Scientist II Job at Elastic, United States

Ri96UjZtVkM4eitXVXMvdlkzWnZxRlllWkE9PQ==
  • Elastic
  • United States

Job Description

What is The Role

The Search Conversational Experiences team builds Elastic’s new conversational (agentic) platform that lets customers chat with their own data in Elasticsearch. We own the quality layer for RAG, agents and tools, retrieval/citations, streaming, memory, and—crucially—the evaluation signals that turn open-ended questions into grounded, reliable answers. As a  Data Scientist , you’ll be part of a cross-functional team (backend, DS, PM, UX) driving chat quality end-to-end: designing and running evaluation pipelines, improving prompts and tool behaviors, and turning measurements into product decisions that customers can feel.se

You’ll help tackle frontier problems—folding RAG and vector search into an agent’s knowledge base, dynamically enriching model context to boost groundedness, shaping agent routing and tool selection policies, lighting up agent-driven visualizations on top of Elasticsearch data, and exploring multimodality and reasoning strategies where they truly move the needle. This is an applied role: you will prototype, evaluate, and partner with engineers to ship.

What you will be doing



  • Own well scoped pieces of the offline and online evaluation pipeline for agent workflows: retrieval coverage, reranking quality, reasoning traces, tool selection accuracy, citation integrity, and final answer helpfulness and faithfulness

  • Calibrate and validate LLM-as-judge rubrics against human labels, track agreement with statistics, and add periodic checks to prevent drift

  • Instrument agent runs with traces so you can localize errors to retrieval, reasoning, tool execution, or grounding, then contribute CI checks that block merges on regressions

  • Translate evaluation readouts into product calls such as model choice, routing policy, tool gating thresholds, prompt and chunking updates, and agent customization for Elastic use cases

  • Collaborate with backend engineers on contracts for ES|QL, citations, and telemetry schemas, and with PM and UX to land findings in shipped features

  • Share outcomes through clear docs, notebooks, and PRs, and contribute utilities that make evaluation faster and more reproducible for the team

What you will bring



  • 3 to 5 years in applied DS or ML with production ownership, including at least 1 to 2 years focused on evaluating LLM or agent workflows in shipped systems

  • Proven experience designing and running stepwise evaluations for agent pipelines: retrieval coverage, reranking quality, reasoning traces, tool selection accuracy, citation grounding, and final answer helpfulness and faithfulness

  • Golden set hygiene: stratified dataset design, leakage controls, reviewer guidelines, inter-rater checks, and versioned labels

  • Fluent with offline IR metrics and guardrails: Recall at k, nDCG, MRR, groundedness or citation support, plus latency and cost tracking; can move from offline gains to online A or B tests

  • Telemetry and traces for agent runs that localize failures to retrieval, reasoning, tool execution, or grounding; ability to add CI quality gates that block merges on regressions

  • Practical Elasticsearch experience or a similar search system; ES|QL familiarity is a plus

  • Strong written communication and async collaboration habits in a distributed team

Job Tags

Full time,

Similar Jobs

D.R. Horton, Inc

Junior Sales Representative - Savannah Job at D.R. Horton, Inc

 ...construction and sale of high-quality homes designed principally for the entry-level and first time move-up markets. The Company also provides...  ...We offer an excellent benefits package including: ~ Medical, Dental and Vision ~401(K)~ Employee Stock Purchase Plan... 

The Permanente Medical Group, Inc.

Don't See a Posting for Your Medical Specialty - Check Our Website: Job at The Permanente Medical Group, Inc.

 ...Job Description Kaiser Permanente / The Permanente Medical Group - Northern California FULL-TIME, PART-TIME, and PER-DIEM PHYSICIAN OPPORTUNITIES in a VARIETY OF SPECIALTIES in NORTHERN CALIFORNIA! Please visit our website for all our available opportunities... 

Tenet-AZ Region

Level II - Pharmacy Technician Job at Tenet-AZ Region

 ...Job Description Tenet-AZ Region is seeking a Pharmacy Technician Level II for a job in Tucson, Arizona. Job Description & Requirements ~ Specialty: Pharmacy Technician ~ Discipline: Allied Health Professional ~ Duration: Ongoing ~40 hours per week ~... 

ULG Staffing

Fire Watch Job at ULG Staffing

 ...Job Description Job Description NSC Staffing is hiring Fire Watch personnel for upcoming long-term shipyard projects in Jacksonville. If you have fire watch experience and are ready to work in a fast-paced marine environment, we want to hear from you. Position... 

Central Maintenance & Service

Evening Office Cleaner Job at Central Maintenance & Service

 ...ensuring safe and healthy facilities for our customers. We want you to join our locally owned and operated team! We are looking for a Cleaner in Warrendale , PA, who is dedicated, consistent, organized, and proud of the work they produce. Our competitive wages and...