Emprego
Meus anúncios
Meus alertas email de emprego
Fazer login
Encontrar um emprego Fichas de empresas
Procurar

Llm quality & model response analyst (remote)

Braga
Odixcity Consulting
Modelista
Anunciada dia 4 junho
Descrição

Job Title: LLM Evaluator (Model Response Analyst)Location: Remote (Worldwide)Job Summary: We are seeking a detail-oriented and analytical LLM Evaluator to assess, analyze, and improve the performance of large language models (LLMs).
In this role, you will evaluate AI-generated content for accuracy, coherence, factual reliability, bias, safety, and alignment with defined guidelines.ResponsibilitiesEvaluate and rank model-generated text based on complex rubrics covering dimensions such as factuality, coherence, safety, instruction- following, and creativity.Review multiple model responses to the same prompt and determine which output a human would prefer, providing justifications for your choices.Provide clear, concise feedback to the modeling and training teams regarding recurring failure models observed during evaluation sessions.Attempt to "break" the model by crafting prompts designed to elicit biased, harmful, or insecure outputs to help patch safety vulnerabilities.Collaborate with the quality assurance team to suggest improvements to evaluation guidelines when you encounter ambiguous or unclassifiable edge cases.Participate in regular "cross-checking" sessions with other evaluators to calibrate scoring standards and ensure inter-rater reliability across the global team.When a model underperforms, dig deeper than the surface score to hypothesize "why" the model made a specific error (e.G., training data vs. prompt misinterpretation).
Identify and flag novel or unexpected model behaviors to the research team, contributing to a living library of unique model outputs and failure modes.RequirementsMinimum of 2 years of professional experience in a relevant field such as; Computational Linguistics, Data Analysis, Technical Writing, Quality Assurance (specifically for NLP/AI), or cognitive science.Bachelor's degree in Computer Science, or a relating field.Deep understanding of how-to craft prompts to elicit specific behaviors and test model limits.Ability to look at a text output and explain "why" it is "good" or "bad" based on logic, tone, factuality, and instruction adherence.Experience working with Reinforcement Learning from Human Feedback (RLHF) data collection.Proven experience monitoring and improving consistency among evaluation teams.
Ability to analyze IAA scores and conduct calibration sessions to align judgement.Experience sourcing, cleaning, and annotating datasets specifically for the fine-tuning or evaluating LLMs.
Understanding of data distribution and its impact on model performance.Familiarity with A/B testing concepts applied to AI.
Ability to help design experiments to test if a new model version is truly "better" than the previous one.
#J-*****-Ljbffr

Se candidatar
Criar um alerta
Alerta activado
Salva
Salvar
Oferta parecida
Bim modeler
Braga
Quadrante
Modelista
Oferta parecida
Bim modeler - industrial hvac
Braga
Quadrante
Modelista
Oferta parecida
Bim modeler
Braga
Quadrante
Modelista
Ofertas parecidas
Emprego Arquitectura em Braga
Emprego Braga
Emprego Distrito de Braga
Página principal > Emprego > Emprego Arquitectura > Emprego Modelista > Emprego Modelista em Braga > Llm Quality & Model Response Analyst (Remote)

Jobijoba Portugal

Encontre ofertas

  • Ofertas de emprego por função
  • Pesquisa de ofertas de emprego por sector
  • Empregos por empresas
  • Empregos por localização

Contacto / Parceria

  • Entre em contacto
  • Publique as suas ofertas no site Jobijoba

Menções legais - Menções legais e termos de utilização - Política de dados - Gerir os meus cookies - Acessibilidade: Não conforme

© 2026 Jobijoba Portugal - Todos os direitos reservados

Se candidatar
Criar um alerta
Alerta activado
Salva
Salvar