San Francisco Bay Area
ædifico, ergo sum
- Scaling Gateway API Infrastructure and building a MultiCluster Inference Gateway. - Contributor to the Kubernetes OSS projects - Gateway API and Inference Extension.
- Integrated Gemini (Google's flagship LLM Model) into Assistant for several platforms (Pixel Tablet, GoogleTV). - Experimented with on-device fulfillment to run requests on TV's but didn't got far (Not recommend running LLM workloads on CPU.
- Worked on search engine optimization and improved enterprise clients' talent discovery and matching process. - Started technical book club.
- Built a computer vision solution with C++ OpenCV library to automate device inspection, utilizing rule-based algorithms and machine learning, saving 100+ human-month hours. - Added data replication and splet read-write workload distribution for on-prem database.
- Deployed machine learning models to detect and even prevent medical fraud for Medicare and Medicaid users.