Principal Data Platform Architect
We're seeking a highly skilled and experienced data platform architect to lead the design, development, and optimization of our data ecosystem. This role is ideal for someone who thrives at the intersection of cloud architecture, big data engineering, and enabling AI / ML capabilities at scale.
Your day-to-day will involve :
- Architecting and building scalable data platforms using AWS services such as S3, Glue, Lambda, Redshift, EMR, and CloudWatch.
- Designing and optimizing end-to-end ETL / ELT pipelines using Databricks, PySpark, Python, and SQL to support batch and real-time data workflows.
- Defining, building, and maintaining data models and warehouse structures optimized for analytics and ML workloads.
- Implementing and maintaining CI / CD pipelines for data workflows and ML models using Jenkins, Git, and other DevOps tools.
- Experience building and supporting real-time data pipelines using tools such as Kafka, Kinesis, or Structured Streaming.
- Driving the adoption of DataOps and MLOps best practices, ensuring robust testing, observability, monitoring, and rollback strategies.
- Partnering with machine learning engineers to enable scalable model training, deployment, and monitoring pipelines.
- Establishing and enforcing data quality, governance, security, and cataloging standards.
- Evaluating and recommending new tools and frameworks that enhance the scalability and reliability of the data ecosystem.
- Mentoring junior engineers, promoting engineering excellence, and participating in architectural decision-making.
Required skills and qualifications include :
8+ years of experience in data engineering, with at least 3+ years in a principal or lead-level role.Strong experience with AWS data services (e.g., S3, Glue, Lambda, Redshift, EMR).Deep expertise in Databricks (clusters, jobs, Delta Lake, Unity Catalog, notebooks).Proficiency in Python and PySpark for developing large-scale data processing jobs.Advanced SQL skills, including complex joins, window functions, CTEs, and performance tuning.Hands-on experience with Jenkins for CI / CD automation in data / ML workflows.Solid understanding of DataOps / MLOps practices, including version control, testing, monitoring, and deployment of data pipelines and models.Experience with orchestration tools such as Airflow, dbt, or similar.Familiarity with data security, compliance, and governance frameworks.Benefits of this role include :
A dynamic and innovative work environment that fosters open communication and collaboration across all levels.Opportunities for professional growth and development, including mentorship and participation in architectural decision-making.The chance to make a meaningful impact on the company's success and contribute to exciting projects that transform the in-venue entertainment industry.This role requires a strong problem-solving and communication skillset, with the ability to lead cross-functional technical discussions and drive results-oriented solutions.
),