Review and improve AI-generated Arabic responses for accuracy, localization quality, factual correctness, reasoning quality, and response formatting
Evaluate AI-generated research trajectories including search queries, source selection, verification steps, and reasoning paths used to answer search-based prompts
Analyze Arabic and English responses for localization quality, contextual appropriateness, factual consistency, clarity, and cultural relevance for Arabic-speaking audiences
Assess the credibility and reliability of online sources used to support factual or time-sensitive claims
Detect unsupported claims, factual inaccuracies, weak reasoning, inefficient research workflows, misleading conclusions, and localization inconsistencies
Edit and refine AI-generated answers to improve factual accuracy, structure, localization quality, readability, and adherence to project guidelines
Optimize search efficiency by evaluating query quality, verification strategies, evidence gathering approaches, and research-step effectiveness
Provide structured written feedback explaining reasoning flaws, factual corrections, source-quality concerns, and response improvement opportunities
Support AI model improvement through annotation workflows, multilingual evaluation tasks, fact-checking reviews, response ranking, and quality assurance processes
Contribute to AI training datasets that improve multilingual search reasoning, factual verification, localization quality, and web-research performance in Arabic
Requirements
Education: Bachelor s degree preferred in Linguistics, Communications, Journalism, Translation, Research, Information Science, or a related field; equivalent professional experience may also qualify
Native or near-native Arabic proficiency with excellent writing, editing, localization, and comprehension skills
Strong English proficiency (written and spoken) for guideline interpretation, bilingual evaluation tasks, and structured feedback writing
Previous experience in AI training, data annotation, localization QA, editorial review, journalism, research, fact-checking, or content evaluation workflows preferred
Strong understanding of search-based reasoning, factual verification, online research workflows, and source credibility assessment
Ability to evaluate AI-generated search trajectories, including query effectiveness, verification quality, evidence reliability, and reasoning consistency
Excellent analytical thinking and attention to detail when identifying factual inaccuracies, unsupported claims, misleading reasoning, or localization issues
Comfortable working with structured rubrics, annotation guidelines, response evaluation systems, and high-volume contractor workflows
Strong written communication skills with the ability to provide concise, actionable, and well-structured feedback
Experience evaluating multilingual content and culturally localized responses for Arabic-speaking audiences is strongly preferred
Familiarity with AI systems and tools such as ChatGPT, Gemini, Perplexity, Claude, or similar platforms preferred
Experience annotating or reviewing research/search trajectories is strongly preferred
Reliable remote work practices, independent time management, and consistency across quality-review workflows required