New York City Metropolitan Area
Interested in high-performance systems, the Python data science ecosystem, distributed ML infrastructure, and applied machine learning.
Alkera turns your entire data pipeline into a single source of truth that AI agents can manipulate. Every query, transformation, and model always gets the full context. Unified context and lineage for your entire data pipeline, from your database to transformations.
Developed a multi-modal biology reasoning model with latent reasoning. *Presented talk for proceeding paper at ISMB 2024* - Designed a Transformer-based multi-modal model using PyTorch Geometric for biomedical text and molecular data. Experimented with innovative improvements to text and code generation metrics to enhance correlation with human evaluation.
Designed FUSE filesystem for performantly serving from caches. Created efficient Python tooling for analyzing performance across distributed compute jobs.
Led the development of a card delivery platform to increase usage of card payments across entire product and support build-out of features necessary for large enterprise customers.
One of the winners of an internal hackathon with a project involving RAG-based LLM to enrich inbound leads.
Developed and maintained insurance billing portals, using Vue.js/Nuxt and Java/Spring Boot. Constructed Docker images for CI/CD with Jenkins & OpenShift to reduce resource usage.