NEW Medallion Lakehouse Pipeline — dbt, DuckDB, BigQuery

Garrett Schumacher

Data Analyst • Data Storytelling • Human-in-the-Loop ML • Automation • Business & Operations Management

Download Resume Email GitHub LinkedIn Book a Meeting

An operations leader evolving into data analytics and engineering. This portfolio is a hands-on collection of projects and case studies where I’ve learned, practiced, and applied these skills to realistic business challenges, demonstrating how I bridge operations expertise with technical solutions.

Open to remote | contract | internships | hybrid | in-person

Analyst Resource Hub

Analyst Resource Hub

A curated, modular knowledge base of checklists, decision cards, guidebooks, and reusable scripts across Python, SQL, and analytics workflows. Originally built in Obsidian and published for quick, skimmable reference.

QuickRefs • Checklists • Guidebooks • Decision Cards • Templates

Visit the Resource Hub ↗  •  View Repository

SQL Stories Ecosystem

Narrative-Driven Sandbox for Analytics

I built this ecosystem to bridge the gap between theory and practice. It’s an end-to-end sandbox for exploring how data is generated, connected, and transformed into action. It serves as both my personal learning lab and a resource for others—a practical roadmap from raw data to compelling stories.

SQL Stories Ecosystem

NEW! Google Cloud Storage Extension: Ship generator output into partitioned Parquet with lineage-rich metadata ready for lake hydration. View here
Transformation Layer: The ecom-datalake-pipelines project extends this ecosystem with production-grade Bronze → Silver → Gold transformations using dbt, DuckDB, and BigQuery. View repository

Featured Projects

Case Studies

Request from the VP of Sales

End-to-end SQL workflow with time series analysis of synthetic e-commerce data, highlighting revenue leakage, churn patterns, and return-related risks.

SQL • Looker • Data Analysis

SQL Stories: Inventory Audit

Investigates inventory efficiency challenges for a simulated e-commerce retailer, addressing locked capital, problematic returns, and under-utilization.

SQL • Python • Data Storytelling

SQL Stories: Customer Retention

Investigates customer retention dynamics, focusing on early churn, repeat purchase conversion, loyalty program gaps, and marketing channel effectiveness.

SQL • Python • Cohort Analysis

Data Generators

Synthetic data generators for testing, learning, and portfolio development. These tools create realistic datasets with controlled complexity and messiness profiles for hands-on practice with SQL, Python, and analytics workflows.

Ecom Sales Data Generator

Generates realistic, relational e-commerce datasets with configurable volumes, seasonality, and messiness levels. The engine behind the SQL Stories ecosystem.

Python • Pandas • SQLite • YAML

Dirty Birds Data Generator

Simulates penguin tagging data for ecological analysis, QA, and model prototyping. Inspired by Palmer Penguins, it adds custom randomness, messiness injection, and resight logic to reflect longitudinal field studies.

Python • Pandas • YAML • CSV

Certifications & Credentials

Google Data Analytics Certification

Professional certificate demonstrating proficiency in data cleaning, analysis, visualization, SQL and R programming.

Google Advanced Data Analytics Certification

Advanced training in statistical analysis, regression models, and machine learning with Python.

Skills & Tools

Languages & Databases

Python SQL HTML/CSS JavaScript Google Apps Script YAML SQLite BigQuery

Analysis & Visualization

Data Storytelling Data Visualization ETL & Data Quality Looker / Tableau

Machine Learning & MLOps

Machine Learning scikit-learn Pydantic MLflow Jupyter / Colab

Cloud, Tools & Business

Google Cloud Platform Google Workspace Git / GitHub Business & Operations Process Optimization Stakeholder Engagement

Thanks for Viewing

Thank you for exploring my work. I’m open to discussing entry‑level roles, apprenticeships, or contract opportunities where I can contribute, learn, and grow.

I’m actively continuing my education in data analytics and engineering, and I’m excited to keep building, collaborating, and improving every day.

Let's Connect

I'm always open to discussing new projects, creative ideas, or opportunities to be part of an amazing team. The best way to reach me is through email or LinkedIn.