Newark, California, United States
Performed daily evaluation of multimodal AI outputs (images, audio, and video) against prompts to assess model performance and alignment. Applied systematic judgment techniques to ensure consistency in quality, relevance, and coherence across diverse outputs. Maintained accuracy and efficiency in large-scale annotation workflows, contributing to reliable training data for model fine-tuning and optimization. Collaborated with cross-functional AI teams to identify trends, provide actionable feedback, and improve generative model capabilities. Utilized specialized labeling platforms and guidelines to support preference-based reinforcement learning and human feedback loops.