Linh Le

Big Data Engineer

Hanoi Capital Region

About

I started my career as an AI Researcher, but now I work as a Data Engineer and I enjoy this position. I have experience in working with terabytes of data in a data lake, building and operating ETL pipelines and big data infrastructure such as Hadoop Distributed Platform. This has given me strong skills in writing SQL queries, especially for telecommunications data, and in object-oriented and functional programming with Python, Java, and Scala. In addition, I have knowledge of Data Structures and Algorithms, Machine Learning, Data Modeling, and other essential topics in Computer Science.

Experience

  • Big Data Engineer at Trusting Social
    Nov 2022 - Jan 2026 · 3 yrs 3 mos

    I am currently working with both on-premise and cloud data platforms: - Developing and operate an on-premise system with Hortonworks Data Platform, which includes Hadoop, Spark, and other tools for managing and processing big data. - Building frameworks to process data on Google Cloud Platform, using services such as Cloud Storage, BigQuery, Dataproc, Dataflow, and Cloud Functions. - Having experiences with devops tool like Docker, Terraform

  • Data Engineer at Viettel DGD
    Apr 2021 - Jun 2022 · 1 yr 3 mos

    Participate in the Viettel Data Lake Project, which is a large-scale data platform for telecommunications data: - Operating periodic data streams for analysis and reporting & develop new data ingestion and aggregation flows. - Working with the Hadoop ecosystem, Apache Spark, Hive QL, a data integration tool, Oracle Database, FTP systems, HDFS and Linux command line environment. - Following agile software development principles.