Berlin, Berlin, Germany
I am a data scientist and an AI/ML engineer specializing in natural language processing (NLP) and large language models (LLMs), with a current focus on developing advanced multi-agent systems. I have been writing Python code for ten years, with five years of professional experience. My deep proficiency in Python, combined with a strong foundation in AI and NLP principles, enables me to swiftly develop prototype solutions for complex problems. I bring extensive experience in applying a variety of machine learning techniques to address unique challenges across different domains. I thrive in tackling demanding tasks and excel in high-pressure environments. In addition to my technical expertise, I am a highly effective communicator. I have a knack for understanding complex, unfamiliar concepts and translating them into clear explanations for diverse audiences. I am experienced in leading technical teams of up to 40 people, charting strategic direction in high-entropy projects and managing client communications. I am known for my strong sense of responsibility and ownership, consistently ensuring that I deliver high-quality results on every project I undertake.
Working on Generative Engine Analytics.
- Designed, developed, deployed and maintained an advanced multi-agent patient analytics platform that empowers pharmaceutical clients with deeper, data-driven insights into patient journeys. -Built custom evaluation and optimization workflows to measure and improve system performance.
In this role, I work with a prominent US-based big tech company, recognized as a key competitor in the AI industry. My responsibilities have expanded to include: - Leading and overseeing multiple teams of data scientists to evaluate a large language model (LLM)-based agent and curate synthetic data to address the model's identified shortcomings. - Providing strategic guidance to Pod Leads, ensuring alignment and consistency across all teams under my leadership. - Analyzing evaluation results to uncover actionable insights and drive effective synthetic data curation strategies. - Driving alignment on priorities and strategy through regular engagements with the client.
In this role, I am working with a prominent US-based big tech company, recognized as a key competitor in the AI industry. My key responsibilities include: - Leading a team of data scientists to evaluate a large language model (LLM)-based agent and curate synthetic data to address the model's identified shortcomings. - Analyzing evaluation results to extract actionable insights and formulate effective synthetic data curation strategies. - Aligning with the client on project priorities and strategic direction to ensure optimal outcomes.
In this role, I am working with a prominent US-based big tech company, recognized as a key competitor in the AI industry. My responsibilities include evaluating a large language model (LLM)-based agent and curating synthetic data to address and mitigate the model's identified shortcomings.
I was a part of the AI/ML team tasked with developing NLP modules that capture sensitive bits of information in unstructured data in order to ensure GDPR compliance.
My responsibilities as a Data Scientist at Octimine were: * Use state-of-the-art machine learning models to improve semantic search. * Data driven design and evaluation of novel search strategies. * Work with large textual datasets exceeding 500k documents. * Apply NLP techniques to legal and scientific text. * Draw insights by analyzing large datasets. * Query SQL and NoSQL databases. * Utilize remote servers using ssh. * Maintain a well documented code base. * Work autonomously to plan and execute projects spanning several months. * Communicate my insights to other team members by holding regular meetings.