Dutch AI Quality Analyst (Personalization)

Careerflow

Careerflow

Software Engineering, IT, Data Science, Quality Assurance

Netherlands

Posted on May 19, 2026

Role Overview

We are hiring Dutch-speaking AI Quality Analysts to evaluate a new personalization feature for Gemini. In this role, you will assess how effectively the AI uses information from past Gemini conversations, Gmail, Google Search, and YouTube activity to generate more relevant and helpful responses.

This role combines creativity, critical thinking, and strong language skills. You will create realistic prompts based on personal context and evaluate the AI’s responses for quality, relevance, grounding, and usefulness.

Responsibilities

  • Design and execute multi-turn prompts (1 to 5 turns) based on real-life personal context

  • Evaluate AI responses for relevance, personalization quality, and helpfulness

  • Identify hallucinations, unsupported claims, and incorrect inferences

  • Review whether personal data is integrated naturally into responses

  • Compare side-by-side responses and rank the stronger output

  • Write clear rationales explaining scoring decisions

  • Validate debug information and data source usage

  • Maintain data hygiene by deleting completed evaluation chats

  • Collaborate with remote global teams and meet quality benchmarks

Required Qualifications

  • Fluent in Dutch with strong reading and writing skills

  • Good English communication skills

  • Willing to use your primary personal Google account (not a test account) and enable personal data sources for evaluation

  • Strong analytical thinking and judgment

  • Excellent written communication and attention to detail

  • Comfortable working independently in a remote environment

  • Desktop or laptop with stable internet connection

  • Available for 20 hours per week minimum

  • Able to provide 2 hours overlap with PST timezone

Preferred Qualifications

  • Experience in data annotation, AI quality review, content moderation, localization, or similar roles

  • BS/BA degree or equivalent experience in Linguistics, Journalism, Policy, Ethics, Computer Science, or related fields

  • Familiarity with LLMs, AI tools, or prompt engineering

Engagement Details

  • Contract Length: 1+ Month

  • Schedule: Part-time to Full-time depending project needs

  • Immediate start

  • Global 24-hour operations environment

Hiring Process

  1. Application Review

  2. Mandatory Assessment (must complete within 24 hours)

  3. Language Vetting

  4. Final Shortlist and Pre-Onboarding Discussion