Kendrick Horeftis

Data Engineer, Solution Architect, Problem Solver

Available Now

19 years of building pipelines that don't wake you at 2am

From startup co-founder to Fortune 5 engineer - crafting the solutions and systems teams rely on.

Work with me

Experience

19 years building data platforms across startups and enterprise.

Present|Feb 2022

Principal Data Engineer / Solutions Architect

SELVR·Remote

Financial Services Platform

  • Architected a real-time reporting platform that unified 5 siloed payment processors, streaming 10M+ daily records through Kafka into Snowflake and eliminating $30k/month in revenue losses from delayed visibility.
  • Engineered a Redshift-to-Snowflake migration with full data model re-architecture, integrating ERP and Salesforce feeds to deliver self-service analytics across multiple business units.

Live Commerce Platform

  • Rescued an abandoned data environment with a 65% daily pipeline success rate, consolidating 430 Airflow DAGs to 12 through a YAML-driven factory pattern, saving ~$40k/month in compute and eliminating daily operational triage.
  • Modernized a 15k-tenant platform from daily batch processing to near-real-time by migrating to Snowflake with hourly CDC ingestion, introducing dbt to compress total pipeline runtime from 20+ hours to 90 minutes.

Healthcare Data Platform

  • Deployed a hybrid data lake and warehouse on GCS and BigQuery meeting HIPAA, PCI, CCPA, and GDPR requirements, migrating from Oracle RDS on fully serverless compute for elastic scaling and cost predictability.
AWS
GCP
Snowflake
Redshift
BigQuery
dbt
+6
Airflow
Kafka
Kubernetes
Terraform
GitHub Actions
Oracle
Feb 2022|Jun 2019

Principal Data Engineer / Co-Founder

OverlayAnalytics·Remote

  • Built the entire data platform solo from first line of code to production, engineering a multi-tenant financial reporting system across 50+ isolated client environments on serverless AWS and Snowflake that won the inaugural Snowflake Startup Challenge and secured a $3.3M seed round.
  • Compressed financial reporting from six-week manual cycles to four-hour automated delivery for mid-market finance teams, scaling to $1.25M MRR before acquisition through near-zero marginal onboarding cost and custom multi-tenant dbt tooling that eliminated per-client engineering effort.
AWS
Docker
Kubernetes
Snowflake
dbt
Airflow
+5
Kafka
Terraform
React
Cube.js
GitLab CI
Jun 2019|Aug 2018

Staff Data Engineer

Cigna·Remote

  • Spearheaded the migration of a 100TB enterprise data warehouse from on-prem Teradata and Hadoop to GCP, architecting a data lake on GCS with BigQuery as the new EDW and eliminating over $800k in annual licensing costs.
  • Pioneered enterprise adoption of dbt in its earliest releases, establishing transformation standards that reduced model development cycles by ~50% across the broader data engineering organization.
  • Built a custom metadata and governance platform on GCP serving a 10,000-user data organization, enabling data contracts and SLAs that reduced operational costs across the migrated cloud infrastructure.
GCP
BigQuery
OpenShift
Teradata
Hadoop
dbt
+4
Airflow
Kafka
Terraform
Tableau
Aug 2018|Oct 2016

Staff Data Engineer

UnitedHealth Group·Remote

  • Engineered a real-time platform compressing 3B daily call records from 30-minute batch delays to 15-second freshness, powering mission-critical routing decisions where minutes of downtime carried nine-figure financial exposure.
  • Delivered 8-figure annual savings by engineering individual-level performance analytics across a 150k-employee contact operation, replacing opaque aggregate reporting with actionable optimization signals.
AWS
SQL Server
Redshift
Snowflake
Tableau
React
+3
Java
Python
PySpark
Oct 2016|Oct 2014

Senior Data Engineer

Elevance Health (formerly Anthem, Inc.)·Remote

  • Compressed M&A data integration from 6 months to 2 weeks by engineering templated models and automated migrations, turning a bespoke manual process into a repeatable playbook for a Fortune 33 health insurer.
  • Drove 6-figure annual savings per acquisition by retiring legacy licensing and infrastructure across 15 integrations, standardizing heterogeneous source systems into Anthem's unified data platform.
AWS
Python
SQL Server
Oracle
DB2
Impala
+1
Hive
Oct 2014|Jun 2014

Senior Data Engineer (Contract)

Intelemedia Communications·Plano, TX

  • Migrated 100 on-prem SSIS pipelines to serverless Kafka and Lambda on AWS, shifting call routing and lead scoring from stale batch feeds to real-time streams for enterprise client operations.
AWS
Python
SQL Server
SSIS
Lambda
S3
Jun 2014|Oct 2012

Data Engineer

Santander Consumer USA, Inc.·Lewisville, TX

  • Slashed daily ETL runtimes by 87% by refactoring 1,400 stored procs into 300 on top of newly formed data lake.
  • Integrated predictive default scoring directly into dialer routing logic, intercepting at-risk borrowers at loan inception and cutting repossessions by 12%.
SQL Server
Control-M
Bash
R
Oct 2012|May 2007

Data Analyst

Mouser Electronics·Mansfield, TX

  • Automated parametric data extraction across millions of component SKUs through pattern recognition algorithms, eliminating 4 planned hires and saving $80k annually.
  • Halved daily pipeline runtimes from 6 hours to 3 by overhauling legacy SQL Server ETL into Python, accelerating supplier ingestion and catalog reporting.
Python
SQL Server
SSIS
SSRS
Bash
Tableau

Tech Stack