Braşov, Romania
I work on AI systems. I’ve been doing it since before the it became mainstream. I’m mostly interested in what happens after the demo: data, scale, infrastructure, reliability, and the ways things fail in production. I write The AI Merge, a newsletter explaining AI foundations and systems without the fluff. Currently building an AI Systems course for engineers who want to understand the full end-to-end stack. 🗞️ https://read.theaimerge.com
Writing code, saving lives.
Helping 9.5k+ engineers design, build, and deploy production AI systems.
✅ Actively involved in all stages of ML project lifecycle. 𝐄𝐯𝐞𝐫𝐬𝐞𝐞𝐧 is an AI company that empowers end-to-end visibility across the entire supply chain for the biggest retail clients in the world, aiming to reduce shrink and increase perpetual inventory accuracy using it’s VisualAI platform. ☑ Worked on a theft-prevention system that was showcased at 𝐍𝐞𝐰𝐘𝐨𝐫𝐤 𝐍𝐑𝐅2023, and at numerous demo-sessions that 𝐚𝐭𝐭𝐫𝐚𝐜𝐭𝐞𝐝 3 𝐧𝐞𝐰 𝐜𝐥𝐢𝐞𝐧𝐭𝐬 into wanting the product for their stores. ☑ 𝐇𝐚𝐧𝐝𝐥𝐢𝐧𝐠 project/tooling 𝐫𝐚𝐦𝐩-𝐮𝐩 phase for new members within the team, successfully reducing the period from 1 𝐦𝐨𝐧𝐭𝐡 𝐭𝐨 𝐥𝐞𝐬𝐬 𝐭𝐡𝐚𝐧 1 𝐰𝐞𝐞𝐤. ☑ Handled 𝐞𝐧𝐝-𝐭𝐨-𝐞𝐧𝐝 engineering and 𝐝𝐞𝐩𝐥𝐨𝐲𝐦𝐞𝐧𝐭 to 5 𝐩𝐢𝐥𝐨𝐭 𝐬𝐭𝐨𝐫𝐞𝐬. ☑ Improved the 𝐢𝐭𝐞𝐫𝐚𝐭𝐢𝐨𝐧 𝐬𝐩𝐞𝐞𝐝 within the team by 𝐚𝐮𝐭𝐨𝐦𝐚𝐭𝐢𝐧𝐠 𝐌𝐋𝐎𝐩𝐬 practices such as Structured Model Experimenting, Resources Monitoring, Model Registry, CI/CD and Jira Integrations. ☑ 𝐎𝐫𝐠𝐚𝐧𝐢𝐳𝐞𝐝 and established a 𝐝𝐨𝐜𝐮𝐦𝐞𝐧𝐭𝐚𝐭𝐢𝐨𝐧 𝐬𝐭𝐫𝐮𝐜𝐭𝐮𝐫𝐞 which enabled us to 𝐫𝐞𝐚𝐜𝐡 > 70% 𝐜𝐨𝐯𝐞𝐫𝐚𝐠𝐞 both on code,tools documentation and custom models tailored for various clients. ☑ Worked on a live camera feed processing pipeline that ran 3 𝐝𝐞𝐞𝐩 𝐥𝐞𝐚𝐫𝐧𝐢𝐧𝐠 𝐦𝐨𝐝𝐮𝐥𝐞𝐬 while achieving 24-25 FPS on a 4K video stream with high QPS rate. ☑ Implemented a 𝐦𝐨𝐧𝐢𝐭𝐨𝐫𝐢𝐧𝐠 𝐩𝐢𝐩𝐞𝐥𝐢𝐧𝐞 𝐟𝐨𝐫 𝐨𝐮𝐫 𝐞𝐝𝐠𝐞 𝐝𝐞𝐩𝐥𝐨𝐲𝐦𝐞𝐧𝐭𝐬 that saved time with hardware requirements and optimisations using Prometheus, Grafana, Kibana and ES. ☑ Actively 𝐩𝐚𝐫𝐭𝐢𝐜𝐢𝐩𝐚𝐭𝐢𝐧𝐠 𝐢𝐧 𝐭𝐡𝐞 𝐥𝐢𝐟𝐞𝐜𝐲𝐜𝐥𝐞 of the current project, providing support to QA, Development and Deployment processes. ☑ 𝐀𝐮𝐭𝐨𝐦𝐚𝐭𝐞𝐝 the multi-gpu 𝐦𝐨𝐝𝐞𝐥 𝐜𝐨𝐧𝐯𝐞𝐫𝐬𝐢𝐨𝐧 and integration with high-throughput Model Serving technologies like Nvidia Triton Server.
Sharing insights and actionable advice towards building production-ready ML systems.
✅ Contributed to a free hands-on course on end-to-end LLMops. Worked alongside 𝐏𝐚𝐮 𝐋𝐚𝐛𝐚𝐫𝐭𝐚 𝐁𝐚𝐣𝐨 𝐚𝐧𝐝 𝐏𝐚𝐮𝐥 𝐈𝐮𝐬𝐳𝐭𝐢𝐧 into developing this end-to-end course to help students 𝐛𝐮𝐢𝐥𝐝 𝐚 𝐜𝐨𝐦𝐩𝐥𝐞𝐭𝐞 𝐋𝐋𝐌 𝐚𝐩𝐩𝐥𝐢𝐜𝐚𝐭𝐢𝐨𝐧 while following the 𝐛𝐞𝐬𝐭 𝐩𝐫𝐚𝐜𝐭𝐢𝐜𝐞𝐬 both in software development, model deployment and MLOps integrations. ☑ Implemented the EDA stage on LLM fine-tuning dataset. ☑ Implemented the LangChain logic and modules that connected the QDrant vector database to inference pipeline and model registry. ☑ Worked on the training/fine-tuning pipeline and prompt monitoring using CometML. ☑ Provided support and approved Pull Requests. ☑ Implemented and documented best practices on Prompt Templates and LLM workflow. Check the course here: http://bit.ly/3Rxjyvp