Senior Data Engineer - Forward deployed
Software Engineering, Data Science
United States · Canada · Mexico
About Andela:
At Andela, we know brilliance is evenly distributed around the world, but opportunity is not. For over 10 years, Andela has connected its customers with top global, remote technical talent from over 135 countries with the majority residing in emerging markets like Africa and Latin America.
As one of the world’s largest talent marketplaces, Andela gives companies greater flexibility to quickly deploy qualified technologists. With talent highly skilled in advanced technologies to support Application Development, Artificial Intelligence, Cloud & DevOps, Data Engineering, and much more, customers experience 33% faster project delivery. The company’s exclusive AI-powered platform, Andela Talent Cloud, is the industry’s only unified platform managing the complete global talent lifecycle and enables customers to fill individual roles or engage fully managed teams up to 66% faster.
Andela is on the precipice of two breakout industry transformations: one in staffing/hiring and the other in software development, both accelerated by generative AI.
Are you an exceptional, hungry leader seasoned in scaling businesses through transformation and growth? Join us and change the world.
Job Summary:
Andela is transitioning from a world-class talent marketplace into a high-scale, AI-integrated Talent Cloud.
In this Senior Data Engineer role, you will be forward deployed with an enterprise-scale organization in the automotive industry undergoing a significant transformation in how it leverages data and analytics to drive commercial and operational decisions. They are investing in modern data infrastructure, advanced analytics, and AI/ML capabilities to improve performance across key business areas. The environment emphasizes strategic thinking, cross-functional collaboration, and the ability to translate complex data into actionable insights.
This role requires someone who enjoys working with customers, thinks about data as a product, ensures data is reliable, accessible, and structured to enable self-service analytics across the organization.
Exceptional Leadership:
As an Andelan, you’ll serve as a role model for the rest of the company. Think about the feedback your peers typically give you – if it usually sounds like the below, we want to hear from you.
Low ego, low drama, servant leader: You share credit, take blame. You like being wrong because it means someone else had an even better idea.
One team mentality: You break silos across teams. You put the company and mission first above your team alone.
Great listener, hungry for feedback: You’re always seeking to improve – our product, our business, yourself. You solicit diverse opinions and deeply listen.
Owner, not renter: You see a problem, you fix it or find someone who will. The buck stops with you.
Player-coach: You fly high (create strategy) AND low (know the details that matter). You roll up your sleeves and get scrappy. You do this proactively collaborating with your team while actively engaging in important details.
Business problem solver: You’re not just a functional expert; you consistently get praise for approaching your function through the lens of solving business problems.
Key Responsibilities:
Build and maintain Snowflake data pipelines for Dealer 360 and Aftermarket workstreams respectively
Design and implement the dealer and aftermarket feature stores (Layer 1–2)
Build ingestion pipelines for all external data sources (JD Power, PIN, S&P, Vehicle Registration, competitive scraping + 2 TBC)
Write and maintain dbt models for data transformation, cleaning, and normalisation
Enforce schema validation, data quality checks, and freshness SLAs across all feeds
Collaborate with the Data Architect to implement the unified data model
Produce documented data lineage for every pipeline before any model is trained against it
Qualifications:
8+ years in data engineering on cloud platforms
Snowflake — data modelling, query optimisation, staging environments
Python — pandas, PySpark, data pipeline scripting
Experience building feature stores for ML consumption
Strong understanding of schema design and dimensional modelling
Nice to have
Experience in automotive, retail, or dealer network data
Familiarity with CRM data structures (for Aftermarket hire)
Azure — Data Factory, Blob Storage, or Synapse
Apache Airflow or similar orchestration tooling
Azure DevOps for pipeline CI/CD
#LI-REMOTE
#LI-RDR