Dutch AI Quality Analyst (Personalization)
Careerflow
Software Engineering, IT, Data Science, Quality Assurance
Netherlands
Role Overview
We are hiring Dutch-speaking AI Quality Analysts to evaluate a new personalization feature for Gemini. In this role, you will assess how effectively the AI uses information from past Gemini conversations, Gmail, Google Search, and YouTube activity to generate more relevant and helpful responses.
This role combines creativity, critical thinking, and strong language skills. You will create realistic prompts based on personal context and evaluate the AI’s responses for quality, relevance, grounding, and usefulness.
Responsibilities
Design and execute multi-turn prompts (1 to 5 turns) based on real-life personal context
Evaluate AI responses for relevance, personalization quality, and helpfulness
Identify hallucinations, unsupported claims, and incorrect inferences
Review whether personal data is integrated naturally into responses
Compare side-by-side responses and rank the stronger output
Write clear rationales explaining scoring decisions
Validate debug information and data source usage
Maintain data hygiene by deleting completed evaluation chats
Collaborate with remote global teams and meet quality benchmarks
Required Qualifications
Fluent in Dutch with strong reading and writing skills
Good English communication skills
Willing to use your primary personal Google account (not a test account) and enable personal data sources for evaluation
Strong analytical thinking and judgment
Excellent written communication and attention to detail
Comfortable working independently in a remote environment
Desktop or laptop with stable internet connection
Available for 20 hours per week minimum
Able to provide 2 hours overlap with PST timezone
Preferred Qualifications
Experience in data annotation, AI quality review, content moderation, localization, or similar roles
BS/BA degree or equivalent experience in Linguistics, Journalism, Policy, Ethics, Computer Science, or related fields
Familiarity with LLMs, AI tools, or prompt engineering
Engagement Details
Contract Length: 1+ Month
Schedule: Part-time to Full-time depending project needs
Immediate start
Global 24-hour operations environment
Hiring Process
Application Review
Mandatory Assessment (must complete within 24 hours)
Language Vetting
Final Shortlist and Pre-Onboarding Discussion