About Me

A fresh look brought to you by DALL-E

Professional

I’m a data scientist and AI leader with 20 years of hands-on and executive experience at EY, PG&E, KPMG, Bloomreach, and Centene. My teams turn raw data into production-grade AI systems, deploying end-to-end ETL and inference pipelines, simulations and predictive models, interactive web apps, and low-latency APIs that steer strategy for Medicare, Medicaid, and Marketplace programs.

Beyond the day job, I chair the Southern California R Users Group (SoCal RUG), a nonprofit that hosts monthly meetups on R, Python, Julia, and all things data science. SoCal RUG also partners with the University of California and Chapman University to deliver SQL/R/Python bootcamps, git workshops, and hackathon support to data science graduate programs.

Data Science & MLOps Toolkit

  • GenAI & LLMs: Databricks, HuggingFace, Ollama (local development), Cline (open-source autonomous AI coding agent), vector DB + RAG (with LangChain / LangGraph, ChromaDB, DuckDB, Quiver, ragnar), MCP server and agent development, Microsoft’s MarkItDown

  • ML & Analytics:

    • R: tidyverse, tidymodels, XGBoost, Torch, TensorFlow, Keras, Shiny, leaflet, plotly, Quarto, ellmer, dbplyr, devtools, webR, DuckDB, and more
    • Python: pandas, scikit-learn, PyTorch, Streamlit, PySpark, MLflow, Polars, PyArrow, Shiny for Python, plotly, narwhals, Anyscale Ray, and more
  • Data Engineering: Databricks + Unity Catalog, Snowflake, Redshift, Google BigQuery, Teradata, Apache Arrow & Parquet & ADBC, DuckDB, Iceberg, Polars, AWS S3

  • DevOps + MLOps: git, CI/CD workflows (GitLab / GitHub), MLflow, Docker, Kubernetes, Posit Connect, Rancher

  • Other: Jupyter Lab, Agile Scrum (& the tools surrounding it, e.g., Jira, Miro, etc.), ServiceNow, Netlify, Alation (data lineage), Excel (still a secret weapon)

Media + Presentations