Data Engineer

Remote, Oregon, United States24 Jun 20261HWDAT

emb-global/data-engineer

Data Engineer

Job Description

Position: Senior Data Engineer — Analytics Platform Engagement Mode: Not specified Location: Not specified Position Overview: We are building an analytics platform on top of a set of operational source systems and are seeking a Senior Data Engineer to architect and build the ingestion layer that pulls event and reference data from these systems into a clean, queryable analytics platform. This is a hands-on, autonomous role where, in the initial phase, you will act as the sole architect — choosing ingestion strategies for each source, designing the event schema and data model, and proving the pipeline end to end on one source before scaling to the rest. As the platform matures, you will play a key role in shaping the team that builds on top of it. The role demands strong technical ownership, pragmatic delivery, and the ability to work across complex, on-prem operational environments with segregated network zones. Key Responsibilities: • Design and build the ingestion layer from on-prem operational systems into Kafka using CDC (Debezium) off SQL Server backends and vendor REST/SOAP APIs where appropriate • Stand up the streaming backbone including Kafka, Kafka Connect, and Schema Registry (Avro/Protobuf) with schemas that survive dedup, replays, and out-of-order delivery • Orchestrate batch loads, backfills, dimension refreshes, and reconciliation jobs in Airflow • Model core domain entities and the event streams they generate with correct semantics for downstream analytics • Land data in the warehouse/lakehouse and build reconciliation checks that prove completeness and correctness • Own documentation, infrastructure-as-code, and runbooks from day one as a core deliverable • Work with the security/network team to move data out of segregated network zones safely • Architect ingestion strategies for each source system before scaling across the platform • Prove the pipeline end to end on one source before fanning out to additional sources • Shape and grow the data engineering team as the platform matures Required Skills: • 5+ years in data engineering with real ownership of an ingestion architecture end to end • Strong Kafka expertise beyond produce/consume: Kafka Connect, Schema Registry, and change data capture (Debezium or equivalent) • Deep SQL Server / T-SQL skills: reading undocumented schemas, enabling CDC, working against production databases without disruption • Airflow or comparable orchestrator for production batch pipelines
API/SDK integration experience with messy, proprietary, or poorly documented sources • Solid data modeling, especially event-heavy data with slowly-changing dimensions • Pragmatic delivery approach: ability to land one source cleanly before scaling • Experience with REST and SOAP API integrations • Avro and Protobuf schema design • Infrastructure-as-code practices • Experience designing and managing event schemas for deduplication, replays, and out-of-order delivery Preferred (Bonus) Skills: • Experience working in on-prem, secure, or air-gapped network environments (VPNs, jump hosts, hybrid architectures) • Familiarity with data privacy and compliance standards such as GDPR, data retention, and residency requirements for sensitive PII • Lakehouse and/or warehouse experience with platforms such as Snowflake, BigQuery, or Databricks

Quick Apply

~2 min

Apply through whichever channel suits you best.

CompanyEMB Global

Departmentrecruitment

Posted24 Jun 2026