Data Intelligence · Infrastructure·Architect & Developer·2024 — PresentProduction

DATA WAREHOUSE

DATA WAREHOUSE cover

Data used to live in 5 disconnected platforms. Every report was manual, hours of work, days late. This warehouse unified Calendly, Valley, ClickUp, GoHighLevel, and Clay into one PostgreSQL brain, with AI-powered enrichment pipelines and confidence scoring — so insights are real-time and leadership has visibility without asking.

Scale
48,000+
Prospects processed
99.9%
Uptime
Self-hosted bare metal
Hosting
5 platforms
Sources unified
Stack
PostgreSQLn8n (self-hosted)ClaudeGPTMetabaseHetznerWebhooksJSON Schema
Highlights
01

Multi-source Ingestion

Calendly, Valley (LinkedIn), ClickUp, GoHighLevel, and Clay all funnel into a unified PostgreSQL warehouse via webhook-driven pipelines. One schema, one query language, one place to look.

02

Version-controlled n8n Workflows

Self-hosted n8n with all workflows version-controlled in Git. Automated retries, error alerting, and deployment pipelines — so workflow changes are reviewable and rollbackable, not magic boxes.

03

AI Enrichment Pipelines

Claude and GPT power enrichment pipelines that classify, categorize, and augment raw prospect data. JSON Schema validation at every stage ensures downstream consumers always get clean, typed records.

04

Confidence Scoring & Human-in-the-loop

Uncertain enrichment matches get flagged with a confidence score below threshold. Low-confidence records route to a human review queue rather than silently polluting the warehouse with wrong data.

© 2026 Subin Joshua Sunil · Built in the dark.

← Back to portfolio