Saif Shaikh
University of Ottawa - Tabaret Hall Ottawa, Canada

About

I’m a data engineer focused on building large-scale pipelines, orchestration systems, and ML infrastructure. I care about reliability, lineage, and quality across the entire data lifecycle.

Recent work includes lakehouse migrations, feature platform development, and backend services that support production ML.

Reach me: sshai072@uottawa.ca @saifshai saifullah-shaikh

Work

Atlas Data Systems

Senior Data Engineer

2022 — Present
  • Led migration to a lakehouse architecture with CDC ingestion and quality checks.
  • Built multi-tenant orchestration for 200+ pipelines with SLA monitoring.

Vector Labs

Backend / ML Engineer

2019 — 2022
  • Shipped model serving platform with canary releases and auto-scaling.
  • Implemented feature store pipelines for real-time personalization.

Selected Projects

Lakehouse Metrics Pipeline

Batch + streaming ingestion with quality gates and cost-aware storage.

Delta LakeSparkData Quality
View

Orchestrated Feature Store

Workflow-driven feature computation with lineage, backfills, and monitoring.

AirflowFeature StoreMonitoring
View

ML Inference Platform

Low-latency serving with autoscaling, canary deploys, and observability.

KubernetesML OpsSLOs
View

Blog

Building reliable DAGs: patterns that prevent silent data loss

Aug 15, 2024

Serving ML at scale without burning the budget

Jul 12, 2024

A practical guide to data observability baselines

Jun 3, 2024