S
source · wttj·req · jb_0957069494·listed 1d ago
Senior Data Engineer (Infra)
Solidus Labs·London, United Kingdom·Hybrid·Full-time
Sourced listing · wttjNo salary disclosed
compensation · not disclosed
Salary not shared
Sign up to see our estimate based on role, location, and seniority.
source · estimate pending
Summary
the pitchJoin our ambitious start-up as a Senior Data Engineer. You will be responsible for designing and optimizing the ClickHouse data layer, driving data reliability and deduplication strategies, and establishing monitoring and observability for the ClickHouse layer. You will also serve as the internal ClickHouse authority, collaborate with downstream consumers, and define and enforce schema versioning and governance standards. The ideal candidate will have at least 8 years of experience in data engineering, deep hands-on ClickHouse expertise, and excellent communication skills.
Role
posted by company- Proficiency across the broader data engineering stack: Apache Kafka, Spark, Airflow, Kubernetes, Redis, Snowflake, and caching technologies
- Experience working in low-latency, real-time systems processing billions of events a day
- Strong background as a software engineer with at least 5+ years of hands-on experience with Java, Rust, or Python
- Expert-level SQL and query optimization skills, with a strong emphasis on ClickHouse-specific patterns - materialized views, projections, TTLs, and merge tree tuning
- Experience with monitoring and observability tools (Prometheus, Grafana, or similar), with the ability to define and own operational health metrics for a ClickHouse deployment
- BSc. in Computer Sciences
- 8+ years in data engineering and data pipeline development on high-volume, low-latency production environments
- Deep, hands-on ClickHouse expertise - including cluster architecture, table engine selection, replication, sharding, and query optimization. Experience engaging with the ClickHouse vendor team or community is a strong plus
- Curiosity, ability to work independently, and a track record of proactively identifying and driving solutions
- Excellent verbal and written communication skills, including the ability to coach and influence engineers across teams in a remote environment
Key responsibilities
- Design and optimize the ClickHouse data layer, including table engines, partition strategies, materialized views, and storage policies, to ensure high performance at billions-of-events scale.
- Drive data reliability and deduplication strategies within ClickHouse, leveraging engine-level features and pipeline-level controls to guarantee data completeness and consistency.
- Establish and continuously improve monitoring, alerting, and observability for the ClickHouse layer, covering replication health, merge performance, query latency, and resource utilization.