Rodolfo Kohn

Principal Engineer - Embedded systems, Kubernetes, distributed systems, cloud systems

Portland, Oregon, United States

About

Rodolfo is an entrepreneur, software architect, and developer with the goal of leading the Site Reliability state-of-the-art from Observability tools and manual investigation to automated full coverage, detection and root-cause identification. Rodolfo has an extensive experience architecting, developing, and operating robust, highly available, reliable, monitoring-aware, and horizontally scalable distributed systems. He’s worked in software engineering for +25 years and has led technical teams, products, new processes and new practices in companies like Accenture, Motorola, Intel, and McAfee with outstanding achievements on delivered products and team (vs. individual) success. He has devoted most of his career, in the technical aspect, to assure cloud and on-prem systems, micro-services, streaming, and data storage can scale horizontally, have high availability, are designed for failures, and have enough performance to deliver excellent user experience. Rodolfo has also worked with different teams to find and solve the most difficult system issues that may jeopardize the business. He is mainly a team player that strives for team and company success not only through a high-quality product but also through personal development and sense of achievement of every player. He teams up with upper management to advice on, and own, technical direction and with product management to create a common view of product and architectural requirements. Furthermore, Rodolfo has extensive experience in network protocols and that is an essential skill to find and solve problems in Distributed Systems. Finally, Rodolfo participated in some Due Diligence teams for Keiretsu Forum, an angel investor community. As part of the Due Diligence team, he analyzed and evaluated Product, Technology, IP and Management Team of a number of startups, co-writing the corresponding reports for investors. Main technologies and skills: distributed systems and cloud services. AWS, OCI. Muti-region. Hierarchical and p2p systems. Kafka streaming. Cassandra. Victoria Metrics, MongoDB, ElasticSearch. Cloud services scalability, availability, design for failures, performance, and manageability. Kubernetes. Docker. Networking Protocols (IPv4, IPv6, TCP, UDP, HTTP internals, MQTT, Gossip protocols, proprietary networking protocols, BSD socket interface, TLS, Asynchronous communication, OpenSSL). APM, SNMP, CIM, WBEM, WS-Management. Linux, C/C++, Python, Go, Java. Machine Learning. Data Science. SIEM. Security. Stackoverflow activity: http://stackoverflow.com/users/2906820/rodolk

Experience

  • Principal Engineer at Panasonic Avionics Corporation
    May 2025 - Present · 1 yr 2 mos

    Responsible for Head End systems design.

  • Wayaga LLC (5 yrs 6 mos)
    • Founder, Principal Consultant, and Distributed Systems Architect
      Apr 2021 - Present · 5 yrs 3 mos

      Wayaga helps customers improve performance, scalability, and availability of their cloud, on-prem, and hybrid systems to assure business continuity and excellent user experience. Technologies: AWS, Kubernetes (EKS), Kafka, Prometheus, Grafana, Time-series databases, InfluxDB, JMeter, Golang, Python, Bash, C/C++, Data Streaming Pipelines for AI, Networking Protocols, Data packet analysis, Cloud, Performance, Scalability, Manageability, High Availability. IoT industry.

    • DrNetwork creator and developer
      Jan 2021 - Present · 5 yrs 6 mos

      Developed DrNetwork, a product that can observe communication protocols between distributed applications in any environment and detect error patterns making them visible to developers, SREs, Devops, and IT Ops. It shows system problems in the network that no other systems show. Moreover, for Kubernetes environments it triggers investigation actions to automatically detect and issue and find the root cause with great accuracy. Technologies: C/C++, Golang, AWS, Kubernetes, EKS, embedded systems, ElasticSearch, Redis, TCP/IP, HTTP, TLS.

    • Consulting for Lucid Motors
      Mar 2022 - Mar 2023 · 1 yr 1 mo

      Rodolfo solved challenging performance and scalability issues in Lucid's data streaming architecture that affected a key business area. Rodolfo thoroughly evaluated and helped set up a new database technology that can scale out horizontally in the cloud and can support the insertion of more than 100M signals per second.

  • Staff Database Reliability Engineer at Lucid Motors
    Dec 2023 - May 2025 · 1 yr 6 mos

    Assurance of high availability and scalability of data insertion pipeline and data storage for vehicle signals at a throughput higher than 100 millions signals per second. Implementation of APM for cloud services. Performance improvement of log shipping and storage. Data streaming pipelines, time-series databases, document databases, monitoring systems. Multi-region cloud systems. Kafka, Victoria Metrics, Go, Java, IOT, MQTT, AWS, multi-region.

  • Systems and Software Architect at Autonopia
    Apr 2023 - Jul 2023 · 4 mos

    Rodolfo is in charge of assuring Autonopia's robot's software is designed with the proper quality attributes to assure high availability, reliability, and flexibility. He is responsible for the software quality and for the implementation of agile tools and processes that support CI/CD. Technologies: ROS, C/C++, Python, Linux, Embedded Systems, Networking Protocols

  • McAfee (3 yrs 10 mos)
    • Principal Engineer and SIEM Architect
      Apr 2018 - Jan 2021 · 2 yrs 10 mos

      Security Information and Event Management (SIEM). SOC solutions. Led an amazing team in the re-architecture of McAfee SIEM to scale its database horizontally and incorporate state-of-the-art technology like Kafka, for real-time data streaming and an open data platform, and micro-services. As a result, McAfee now can provide large-scale SIEM services as in the new deal with Oracle (links below). Our product is in the leaders' quadrant at Gartner report for SIEM. From Gartner’s Magic Quadrant for Security Information and Event Management: “McAfee has implemented a modern SIEM architecture that leverages big data technologies, such as Kafka and Elasticsearch. The open nature of the data tier allows organizations looking to feed data into or out of ESM to have flexible options".

    • Sr. Distributed Systems Architect
      Apr 2017 - Apr 2018 · 1 yr 1 mo

      Enterprise Architect of SIEM (Security Information and Event Management) and its scalable database. Definition and implementation of scalable and highly available security solutions for Corporate. Architecture definition and implementation of a peer-to-peer, scalable, highly available, and high performance database. Architecture definition for a scalable and highly available SIEM (Security Informantion and Event Management). Worked with a wonderful team of people that can do what others deem impossible. Design of a cluster management solution for a peer-to-peer, proprietary, high performance database for SIEM. Design and implementation of a gossip protocol over TLS. Lead and training of teams on design for failures patterns in distributed systems. Lead design of performance, scalability, and design for failures test solutions. Training and empowerment of each team member with focus on each person's skills and goals. I strongly believe a good leader is responsible for product success, team's future, and team members' potential growth. Technologies: C/C++, Linux, Networking protocols, Gossip protocols, P2P, Kafka, Docker, distributed systems.