Greater Houston
Hi! I'm a Data Engineer with 2 years of relevant experience, specializing in building data pipelines within Databricks. EDUCATION: University of Houston C.T Bauer College of Business Business Honors Program DATA SCIENCE SKILLS: Python Pandas Scikit-learn TensorFlow R Shiny Seaborn Numpy Matplotlib SQL Tableau Statistics Calculus LANGUAGES: Python R Java PROJECTS: 2032 US Primaries LSTM Stock Prediction NBA Data Analysis and Linear Regression AWARDS: 1st Place J.P. Morgan Data For Good Hackathon
Databricks, Spark, Azure, Python, SQL,
Co-Founded the first Data Science and AI student organization at the University of Houston
● Applied bootstrapping algorithms in Python to convert German contract data to other languages using NLP. ● Programmed SQL queries to pull 500 MB of data to match team members to opportunities to close CLM deals. ● Created Data Visualizations in Google Data Studio to demonstrate how deals are affected when a solutions engineer is attached.
• Created a Data Pipeline using Databricks and Spark to transform genomic data to usable data. Coded using Packages in R such as TidyR, DPlyr, and GGplot. • Created a web application with interactive PCA, box and whisker, and other graphs using Shiny in R for the Bioinformatics team. Worked on UI and Server in RStudio • Used R and Shiny to visualize genomic data in order to apply deep learning methods to predict clinical biomarkers.