Location: Cape Town, Western Cape, South Africa Permanent Hybrid role Main Purpose of Role The Data & ML Engineer is responsible for executing the technical implementation of the Data Transformation. The role focuses on the data development journey through data ingestion, processing, automated data pipelines, and the establishment of cross-functional system integrations and AI/ML-ready data structures. The Data & ML Engineer must translate strategic direction into concrete technical solutions, make sound architectural recommendations, and deliver scalable, robust, production-grade data capabilities. Key Responsibilities Design, build and maintain end-to-end automated data pipelines from internal and external sources: Platform maintenance and development including calculation development, on-boarding assets and equipment and continuous improvement. Develop, implement and extend the integrated SSOT database, consolidating data from CAMs, ERP, OT/IoT, SharePoint and other platforms. Develop modular and scalable data ingestion using API integrations, SQL Stored Procedures and ETL frameworks, maintaining reliable, automated data ingestion between internal and external platforms to the SSOT database. Support and provide data development solutions. Be a practical driver and enforcer of the Data Transformation. Design and develop scalable data models (staging, core, marts, feature sets) that support strategic reporting, advanced analytics and ML. Develop scalable data models across clearly defined business layers, including staging (raw landed data), core (cleaned and standardised single source of truth), marts (business-ready views for specific domains), and feature sets (model-ready tables for Machine Learning and advanced analytics). Develop and support workflow automation and lightweight data applications using tools such as Power Apps and Power Automate, integrating these solutions with the core data platform. Develop ML algorithms, dedicated toward anomaly detection, prediction and neural networks. Ensuring all data models, pipelines and storage approaches are AI/ML-ready, including feature-ready datasets for pattern recognition, prediction and anomaly detection. Own end-to-end development and processing (data algorithms and ML solutions) Technical platform development, data orchestration and data management Adhere to governing data management, security and governance standards in line with the Data Governance Policy (Data Engineering, Audit and Risk, Cyber Security and IT requirements). Maintain, document and track comprehensive technical documentation and change management records for architectures, pipelines, automations, environments and access. Work with divisional data owners to reduce data silos, standardise data flows and ensure adherence to agreed standards and timelines. Skills and Competencies ETL/ELT orchestration and job scheduling (Automated workflows) Data modelling (staging, core, marts, feature sets); production operations (monitoring, alerting, incident response) Strong SQL; proficiency in Python for data engineering and ML-enabling tasks; and solid programming foundations in Python, SQL and/or C# Ability to make scalable architectural decisions and prepare data for ML and model integration into workflows. Solid ML foundations (feature engineering, evaluation, overfitting, drift) and ability to design data pipelines that are fit for ML. Automated workflow development (Power Automate and PowerApps) Strong, hands-on engineering mindset; comfortable taking technical ownership of assigned work: Proven ability to design and lead data platform or data product builds. Identifies problems, proposes solutions and drives implementation. Clear, structured communication skills; and can explain technical aspects to non-technical stakeholders and leadership. Strong systems thinking and architecture skills: designs for scalability, maintainability and AI/ML-readiness from the outset. Strong engineering discipline: version control, testing, deployment processes, documentation and incident handling. Enjoys building automation, integrations and ML-ready datasets. Experience, Knowledge and Qualifications Degree in Computer Science, Information Systems, Engineering, Mathematics or a related field. Proficient in data engineering. 2-4 years in data engineering/ data platform development or similar technical role, with hands on responsibility and clear evidence of data engineering development. Programming foundations in Python, SQL and/or C#, with clear evidence of making scalable architectural decisions. Hands-on responsibility for API integration (REST/JSON, auth, pagination, error handling); ETL/ ELT orchestration and job scheduling; data modelling (staging, core, marts/feature sets); and production operations (monitoring, alerting, incident response). Strong SQL skills (DDL/DML, performance tuning, stored procedures, views, functions). Proficiency in Python for data engineering and ML-enabling tasks, plus experience with ETL tooling and automation frameworks. Strong foundations in ML concepts and practical experience preparing data for ML and integrating models into data workflows (even if not a pure data scientist). Advantageous: Hands-on experience with ML models in production (e.g. forecasting, classification, anomaly detection) and associated MLOps tooling. Exposure to neural networks/ deep learning (e.g. TensorFlow, PyTorch) and modern ML pipelines. Experience with data lakes/ big data architectures and orchestration tools (e.g. Airflow, Prefect, Azure Data Factory or similar). Familiarity with AI/ML governance, model risk and secure data handling. Experience working with industrial/IoT. This role offers a total CTC of ZAR 397K - ZAR 529K #J-18808-Ljbffr