Yassir is the leading super App in the Maghreb region set to change the way daily services are provided. It currently operates in 45 cities across Algeria, Morocco, Tunisia and has recently expanded into France, Canada and Sub-Saharan Africa. Backed with $200M in funding from venture capitalists in Silicon Valley, Europe and beyond, we offer on‑demand services such as ride‑hailing and last‑mile delivery. Building on this infrastructure, we are now introducing financial services to help our users pay, save and borrow digitally. Our mission is to usher the continent into a digital economy era, creating a marketplace that brings people what they need while infusing social values. Responsibilities Build a centralized data lake on GCP Data services by integrating diverse data sources throughout the enterprise Develop, maintain, and optimize Spark‑powered batch and streaming data processing pipelines. Leverage GCP data services for complex data engineering tasks and ensure smooth integration with other platform components Design and implement data validation and quality checks to ensure the data’s accuracy, completeness, and consistency as it flows through the pipelines Work with Data Science and Machine Learning teams to engage in advanced analytics Collaborate with cross‑functional teams, including data analysts, business users, operational and marketing teams, to extract insights and value from data Collaborate with the product team to design, implement, and maintain data models for analytical use cases Design, develop, and upkeep data dashboards for various teams using Looker Studio Engage in technology explorations, research and development, POCs, and conduct deep investigations and troubleshooting Design and manage ETL/ELT processes, ensuring data integrity, availability and performance Troubleshoot data issues and conduct root‑cause analysis when reporting data is in question Required Technical Skills PySpark Batch and Streaming GCP Dataproc, Dataflow, DataStream, Dataplex, Pub/Sub, BigQuery and Cloud Storage NoSQL (preferably MongoDB) Programming languages: Scala / Python Great Expectation, or similar DQ framework Familiarity with workflow management tools like Airflow, Prefect or Luigi Understanding of Data Governance, Data Warehousing and Data Modelling Good SQL knowledge Business Able to communicate effectively, distill technical knowledge into digestible messages in a succinct / visual way Proactively identify and contribute with team development initiatives, and support junior members Good To Have Skills Infrastructure-as-Code, preferably Terraform Docker and Kubernetes Looker AI / ML engineering knowledge Lineage, or relevant tools e.g. Atlan DBT At Yassir, we believe in the power of diversity and an inclusive culture. If you’re ready to bring your unique perspective and experiences to the table, we’re excited to listen. We look forward to receiving your application. Best of luck! We may use artificial intelligence tools to support parts of the hiring process, such as reviewing applications, analyzing resumes or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us. #J-18808-Ljbffr
Mid/Senior Data Engineer
YASSIR
johannesburg, johannesburg
Published 7 days ago
Report job