Internship: Data engineering: Schema evolution - Ieper, Belgium
Internship: Data engineering: Schema evolution
Your future job
Data engineering: schema evolution handling on incoming data flow.There's a constant increase (in volumes and ingestion rates) on incoming dataflows from various sources towards our data analytics and reporting platforms. Over time, the fields and/or their definitions within those flows can change.
Next to the development of schema evolution detection and handling scripts, these setups need to be embedded within the data lake environment in order to be used in an operational environment.
The goal of the project is to identify schema evolution on incoming data flows, to 'productize' this mechanism and to incorporate it on the data lake intake side.
- Schema evolution detection and handling on different types of data flows
- Productization of the solution (setting up data pipelines, versioning, pipelines for deploying different versions,...)
- Implement the mechanism in the data lake setup
(From 4 weeks up to 6 months)
- Implement the mechanism in the data lake setup
(From 4 weeks up to 6 months)
Your profile
- Student in Bachelor or Master in IT
IT software analysis, design and development practices
Minimal: Python & SQL development
Preferably: Git/Gitlab CI, Docker, Kubernetes
Data engineering knowledge
Main technologies used:
- Metadata solutions - catalogs, ...
- Databricks / spark
- Python, SQL
- Continuous delivery (GIT, gitlab CI, Docker, Kubernetes,..)
- Datalake concepts knowledge
Competencies the student could develop
- Working with state-of-the-art enterprise applications frameworks used to develop and deploy applications to be used worldwide
- Analysis and development methodologies like Domain Driven Design and continuous integration/ deployment
- Data engineering tasks like ETL, Python
We offer
- a challenging job in a dynamic high-tech international environment
- the opportunity to take ownership of your professional passion in order to contribute to the success of the company
- an enjoyable, team-oriented and professional atmosphere in a flat-structured organization
- versatile development opportunities