Bryan Dinh

Conversational AI & LLM Data Analyst

Los Angeles Metropolitan Area

About

I am experienced in data annotation & labelling, QA annotation, data entry, and audio transcription. As a heritage speaker of Vietnamese, I have researched, designed, and tested the efficacy of my own generative LLM-based chatbots for heritage language learning for my culminating Master's project. My linguistic interests include conversation design, voice-user interface design, and sociolinguistics.

Experience

Data Labeling Analyst II @ Meta at Tundra Technical Solutions
Jun 2024 - Present · 2 yrs 1 mo
• Label, quality-check, and audit datasets to improve model accuracy for Meta’s commerce AI chatbot • Apply linguistic expertise and conversation design practices to evaluate conversational data in English, Vietnamese, and Tagalog • Achieved and maintained labeling accuracy, ensuring high-quality training data for AI models • Collaborate with engineers and researchers to refine labeling guidelines and boost model performance • Flag edge cases, patterns, and ambiguities to inform product and research teams • Support cross-functional teams in scaling labeling processes and optimizing AI/ML workflows • Deliver data-driven feedback that reduce annotation ambiguity and increase model training efficiency
LLM Data Annotator at e2f, inc.
Mar 2024 - May 2024 · 3 mos
• Assess prompt clarity, ambiguity, and potential harm • Determine if prompts are seeking information or time-sensitive • Utilize enhanced fact-checking to ensure AI-generated response accuracy • Evaluate responses for naturalness, comprehensiveness, and adherence to guidelines
AI Data Annotator at RWS Group
Aug 2023 - Oct 2023 · 3 mos
• Employ linguistic expertise to score and improve upon existing prompts (questions) and AI-generated responses across a range of topics in English for client's LLM • Reference online resources to rephrase and write cohesive, accurate, responsive, and sometimes empathetic answers to prompts • Test (QA) and label other annotators' work based on guidelines • Correct incorrect prompt responses using natural language and examples • Evaluate AI model responses to prompts through scoring, ranking, A/B testing, etc.
Instructional Student Assistant at California State University, Long Beach
Sep 2021 - Dec 2021 · 4 mos
• Provided instructional support for the Department of Advanced Studies in Education and Counseling • Graded assignments and provided feedback according to provided rubrics • Created annotated bibliography on emerging instructional materials for faculty
Linguistic Transcriptionist at UCLA Henry Samueli School of Engineering and Applied Science
May 2019 - Jun 2020 · 1 yr 2 mos
• Employed knowledge of acoustic and articulatory phonetics to accurately transcribe audio files • Collaborated with fellow transcribers to extrapolate linguistic patterns from data • Interpreted waveforms and acoustics via Praat software • Labelled and sorted transcribed audio via Microsoft Office Suite • Followed the International Phonetic Alphabet (IPA) for proper transcription of detailed utterances • Results in improving development of automatic speech recognition software (ASR) for children's speech pathology assessments