Greater Philadelphia
Always accepting new challenges, I believe growth comes from taking initiative, staying adaptable, and turning opportunities into meaningful impact. Over the years, I have grown from an intern into a Data Engineer at Children's Hospital of Philadelphia, where I have spent the last 7+ years building data solutions, modernizing platforms, and helping improve operational efficiency across teams. My academic background includes a Bachelor’s degree in Computer Science from Rutgers University and a Master’s degree in Data Science. Early in my career, I gained experience in Analytics and Big Data through internships at Children's Hospital of Philadelphia and Larsen & Toubro Infotech. Throughout my professional journey, I have worked on ETL pipelines, cloud and analytics platforms, automation initiatives, and scalable data engineering solutions using technologies such as Snowflake, Airflow, DBT, Python, and SQL. In addition to technical development, I have increasingly taken ownership of projects, collaborated across teams, mentored engineers, and contributed to improving engineering processes and delivery standards. I enjoy working in environments where continuous learning, communication, and leadership are valued just as much as technical expertise. I believe that with curiosity, consistency, and sincere effort, every challenge can become an opportunity to learn, lead, and create long-term impact.
Worked as an Intern in Analytics & Reporting team for 8 months: Technologies: Java, QlikView, SAP BO, XML, SQL, API, POSTMAN, Informatica, ETL, Batch Script Dashboard: • Developed BI Analytics QlikView Dashboards to maintain user security of the users in SAP BO using Java • Utilized RESTful Web API Services to retrieve XML data from the Healthcare data warehouse • Executed SQL queries in QlikView to extract & load the data of approx..20000 datasets in QlikView dashboard • Performed API testing to validate XML, JSON data retrieval from Tomcat Server using POSTMAN Automation Tool: • Developed a tool to automate migration process of data from Development to Production using Batch Script through SDLC • Tool usage resulted in 60% reduction of operational workload across development & admin teams Google Analytics: • Aided Business Ops team to develop Google Analytics dashboard using Qlik to view real-time data without manually logging in ETL Scheduler: • Utilized workload automation software to execute Job Chains in ETL for Informatica PowerCenter
Technologies: R, HDFS, MapReduce, Hive • MapReduce: Developed and executed program to calculate word count using Hadoop and MapReduce • Hive: Consolidated enterprise data & migrated to Cloudera through FTP & SFTP servers using Hive commands • R and Text mining: Analyzed approximately 1000 words in multiple text files having unstructured data using R. Developed & executed programs on Text mining to find word similarity, cosine similarity in multiple files. Achieved the Term-frequency Inverse document frequency using Natural Language Processing (NLP) • Machine Learning: Analyzed the data of customer search using Confusion matrix, User/Item-based collaborative filtering