Senior Data Engineer: Pipelines Cloud

ai71

Job Description: Senior Data Engineer (Real-Time Pipelines)

AI71 , Abu Dhabi’s pioneering AI venture focused on decentralized data and sovereign AI, is seeking a Senior Data Engineer specializing in Real-Time Pipelines . This role is central to the firm’s mission of transforming large-scale, high-velocity data into the “fuel” for advanced Large Language Models (LLMs) and generative AI applications.

As a Senior Data Engineer, you will be the architect of the data flow. You aren’t just moving data from A to B; you are building low-latency, fault-tolerant systems that can process millions of events per second in a cloud-native environment. In an AI-first organization like AI71, your work directly impacts the training efficiency and inference quality of cutting-edge AI models. If you have a deep pedigree in distributed systems and thrive at the intersection of Data Engineering and DevOps, this role offers a chance to build the infrastructure of the future.


Key Responsibilities

  • Architecture Design: Design and implement scalable, real-time data ingestion and processing pipelines using a “Lambda” or “Kappa” architecture.

  • Cloud Infrastructure: Build and manage data infrastructure on AWS or GCP , leveraging services like Amazon Kinesis, Managed Kafka, or Google Pub/Sub.

  • Pipeline Optimization: Refine ETL/ELT processes to ensure high data quality, minimal latency, and cost-efficient resource utilization.

  • Orchestration & Containerization: Deploy and manage data workloads using Docker and Kubernetes , ensuring high availability and automated scaling.

  • Data Governance: Implement robust data schemas and versioning controls to support AI researchers and data scientists.

  • Performance Tuning: Optimize SQL queries and distributed compute jobs (eg, Spark/Flink) to handle petabyte-scale datasets.


Qualifications & Requirements

  • Experience: Minimum 8+ years of professional experience in data engineering or backend systems.

  • Programming Mastery: Expert-level proficiency in Python (specifically for data processing) and SQL .

  • Cloud & DevOps: Hands-on experience with AWS or GCP and proficiency in Kubernetes and Docker .

  • Streaming Technologies: Proven experience with real-time tools such as Apache Kafka , Flink , or Spark Streaming .

  • Education: Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field.

  • Mindset: Ability to operate in a fast-paced, mission-driven environment with a focus on impactful AI applications.


Job Data Summary

Category Details
Company Name AI71
Position Title Senior Data Engineer: Real-Time Pipelines
Location Abu Dhabi, United Arab Emirates
Salary Range AED 120,000 – AED 200,000 per year
Experience Level Senior (8+ Years)
Core Stack Python, SQL, K8s, Kafka, AWS/GCP
Industry Artificial Intelligence / Machine Learning

To apply for this job please visit www.learn4good.com.