Junior Data Engineer

GEP Worldwide

São Paulo

Description

Welcome to GEP

Who We Are

GEP, with over 30 offices internationally, is the fastest growing procurement and supply chain solutions firm – consulting, software and managed services. GEP has succeeded by being smart and creative; solving complex problems and finding opportunities for the world’s largest organizations.

How You Grow At GEP

We recognize people who make a genuine difference, work hard and drive achievements. Results are noticed and rewarded. It’s how you will grow a career at GEP, and in a much shorter time frame than at other firms.

Celebrating Everyone

GEP succeeds through the ideas and creativity of our team members so we embrace people of all experiences, nationalities, abilities, cultures, races, gender identities, sexual orientations and ages. What makes you unique and different is celebrated and will help GEP stand out even more. And we are a women-founded and -owned company so our foundation is making GEP a great place to work for women, a place where women can learn, advance and give back.

What You Will Do

As a Junior Data Engineer at GEP, you will play a role in supporting our data infrastructure and ensuring the smooth flow of information across systems. You will work closely with senior engineers and cross-functional teams to build and maintain data pipelines, contributing to data-driven decision-making processes. You will:

  • Assist in designing and developing scalable web scrapers to support business analytics and reporting needs.
  • Collaborate with analysts and teams to ensure quality and availability of scraped data for various projects.
  • Perform data extraction from websites under guidance, handling basic dynamic content and anti-scraping measures.
  • Support the integration of new web data sources into existing systems, ensuring compatibility and accuracy.
  • Contribute to troubleshooting scraping-related issues and provide timely resolutions.
  • Document scraping processes and workflows to ensure transparency and repeatability.
  • Develop data engineering projects on Databricks

What You Should Bring

  • Education and Experience

oBachelor’s degree in Engineering, Computer Science, Information Technology, or a related field, or equivalent experience.

o0-2 years of experience in data-related roles (internships or academic projects are considered).

  • Languages

oFluent in Portuguese. Proficiency in English (spoken and written) is desirable due to potential collaboration with global teams. Spanish is a plus.

Other Skills

As GEP experiences active and rapid growth, we’re looking for passionate individuals who want to flourish along with us — helping to pave the way to a brighter future.

  • Knowledge of Python programming language focused on data (Pandas, Plotly, Streamlit).
  • Familiarity with web scraping tools (e.g., BeautifulSoup, Scrapy) and HTML/CSS parsing.
  • Understanding of web scraping concepts, including handling proxies and basic anti-scraping techniques.
  • Understanding of data engineering concepts and tools like Airflow and Databricks
  • Strong problem-solving skills and attention to detail.
  • Ability to work collaboratively in a team environment and adapt to feedback.
  • Closing Statement

Are you one of us?

GEP is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, ethnicity, color, national origin, religion, sex, protected veteran status, disability status, or any other characteristics protected by federal, state or local law. We are committed to hiring and valuing a global diverse work team. GEP is proud to be an EEO/AA employer M/F/D/V.