Job Details

View jobs in our app

Learn more about the app. Workinapps.com

Data Engineer Data Pipelines and ETL

2026-06-12 Paramount Global San Francisco,CA

Description:

Data Engineer Data Pipelines and ETL

45938 Burbank, CA, US, 91505 San Francisco, CA, US, 94107 Research Burbank Full-Time On-Site

On a mission to unleash the power of content you in? We've got the brands, we've got the stars, we've got the power to achieve our mission to entertain the planet now all we're missing is YOU! Becoming a part of Paramount means joining a team of passionate people who not only recognize the power of content but also enjoy a touch of fun and uniqueness. Together, we co-create moments that matter both for our audiences and our employees and aim to leave a positive mark on culture.

Summary

The Data Engineering team is hiring a Data Engineer Data Pipeline & ETL. You will help build and maintain scalable data platforms and ETL/ELT pipelines in a fast-moving environment. In this role, you will build and support batch and real-time data systems powering analytics, ML, and AI applications. You will also grow your expertise in modern data architecture and cloud-native best practices.

Key Responsibilities

Build and Maintain Scalable Data Pipelines
Design, develop, and maintain scalable batch and streaming data pipelines for large-scale structured and unstructured datasets.
Build robust ETL/ELT frameworks supporting analytics, BI, experimentation, and machine learning use cases.
Optimize pipelines for performance, reliability, scalability, and cost efficiency.
Implement advanced ingestion patterns including CDC, incremental loads, and event-driven processing.
Data Modeling & Data Warehouse Architecture
Design scalable, dimensional, and hybrid data models optimized for analytics and ML use cases.
Develop reusable transformation layers (semantic layers) that serve BI, ML, and AI applications.
Write optimized, production-grade SQL for large-scale analytics workloads.
Contribute to query optimization, indexing, partitioning, and performance tuning across distributed systems and cloud warehouses.
Modern Data Pipeline Development
Build and maintain modular data components following established framework patterns.
Contribute to architectural decisions across streaming systems, data lakes, and warehouses.
Data Quality, Governance & Observability
Implement automated data validation, anomaly detection, and monitoring frameworks.
Establish data lineage and metadata standards to support reproducibility in ML workflows.
Enforce governance, privacy, and security best practices, particularly for sensitive AI datasets.
Ensure responsible AI data usage and compliance standards.

Required Technical Skills

Advanced Data Pipeline & ETL/ELT Expertise
24+ years of experience building and scaling ETL/ELT pipelines in production environments.
Experience with workflow orchestration tools such as Airflow, Composer, or similar platforms.
Strong understanding of distributed data processing concepts.
SQL & Data Modeling for Analytics & ML
Expert-level SQL skills for large-scale transformation and analytics.
Experience designing scalable warehouse schemas and ML-ready data layers.
Strong experience optimizing complex queries across multi-terabyte datasets.
Programming & ML Data Integration
Proficiency in Python (or similar language) for data processing and ML pipeline integration.
Experience with distributed processing frameworks such as Spark.
Familiarity integrating data pipelines with ML platforms such as Vertex AI (preferred), Databricks ML, or equivalent.
Streaming & Event-Driven Systems
Experience building real-time data pipelines using Kafka, Pub/Sub, or similar technologies.
Understanding of feature streaming, low-latency data processing, and event-driven architectures.
Ability to architect and build real-time dashboards using Superset.
Cloud & Modern AI Data Platforms
Experience designing cloud-native data architectures (GCP preferred).
Experience with lakehouse architectures and cloud data warehouses.
Familiarity with vector databases, embeddings pipelines, and AI-serving infrastructure is a plus.

Basic Qualifications

Bachelor's or Master's degree in Computer Science, Engineering, or related field (or equivalent experience). 24+ years of experience in data engineering, data pipeline development, or related fields. Strong foundation in modern data engineering principles, distributed systems design, and cloud-native architectures. Demonstrated ability to design and operate large-scale production data systems. Proven track record of technical leadership and cross-functional collaboration. Strong problem-solving skills and ability to thrive in complex, fast-paced environments. Detail-oriented and committed to engineering excellence and continuous improvement.

Job Details

View jobs in our app

Data Engineer Data Pipelines and ETL

Data Engineer Data Pipelines and ETL

Summary

Key Responsibilities

Required Technical Skills

Basic Qualifications

Apply for this Job

Registration Required

Login to Apply

You are leaving our site

Registration Required

Email this job to a friend

Job: Data Engineer Data Pipelines and ETL

Job Alert Sign Up

Add To Job Alert

Job Alert Updated

Email Customer Care