Senior Data Engineer - Dbt, CI/CD
About the role
At ClickHouse, we believe in making product decisions grounded in data, not gut feelings. To support that, we've built a comprehensive internal data warehouse — read more about it on our blog. It powers analytics, forecasting, and decision-making across every team in the company.
Our stack centers on ClickHouse as the core storage and processing engine, complemented by dbt, Airflow, S3, AWS, Superset, and GitHub Actions. We ingest data from 30+ distinct external sources, process over 2.5 PB of data, and deliver most data marts to business consumers with a 1-hour end-to-end latency - a standard that demands highly optimized, rock-solid transformation pipelines.
We're looking for a Senior Data Engineer to help evolve the DWH platform, push its technical boundaries, and deliver reliable, high-performance data models to business users across the organization.
Key Responsibilities
DWH Platform Development
Design and develop reusable Airflow components - operators, connectors, and hooks - along with custom integrations tailored to our data architecture.
Build reusable dbt macros and abstractions: incremental strategies, ETL building blocks, and generic data quality tests, all optimized for ClickHouse-specific behavior and our data patterns.
Develop DataOps tooling - CI pipelines, data migration frameworks, and security controls - with a strong focus on reliability, self-service usability, and scale.
Team contributions
Conduct thorough code reviews for team members and other team contributors.
Actively participate in technical and architectural design discussions.
Drive technical leadership in specific areas of our stack.
Continuously improve the team's development environment, with a focus on automation - including AI-assisted workflows.
Contribute to platform stability through proactive troubleshooting and timely incident resolution.
Data modeling
Develop, refactor, and optimize business data models in line with our modeling standards and ClickHouse best practices.
Mentor and support contributors through code reviews and technical guidance.
Lead the design and implementation of the most technically demanding data marts - particularly those involving high data velocity, large volumes, or complex business logic.
Qualifications
Required:
Exceptional SQL skills - this is non-negotiable.
Strong hands-on experience with Airflow, dbt, and Python.
Proven track record building and optimizing large-scale, high-throughput data pipelines.
Solid understanding of data warehousing fundamentals: ETL/ELT, dimensional modeling, and data quality.
Bachelor's degree in Computer Science or a related field.
Comfortable thriving in a fast-paced startup environment — exciting and dynamic, but sometimes wonderfully chaotic.
Preferred:
Hands-on experience with ClickHouse - a significant advantage.
Familiarity with AWS or other cloud platforms.
Experience with CI/CD development, especially using GitHub Actions.
Key information
Share job
Craft your data & AI talent profile!
Join Dataaxy, be seen by top recruiters, and amplify your career opportunities.
Sign upTalent marketplace
Data & AI profiles
Maya Chen
Senior Data Scientist
Noah Martin
Machine Learning Engineer
Ava Wilson
Analytics Engineer
428
Profiles
82%
Matched
24h
Response