About Me

Professional

I’m a data scientist and AI leader with 20 years of hands-on and executive experience at EY, PG&E, KPMG, Centene, and Jeppesen ForeFlight. My teams turn raw data into production-grade AI systems, deploying end-to-end ETL and inference pipelines, simulations and predictive models, interactive web apps, and low-latency APIs that steer strategy for internal business partner and executive leadership.

Beyond the day job, I chair the Southern California R Users Group (SoCal RUG), a nonprofit that hosts monthly meetups on R, Python, Julia, and all things data science. SoCal RUG also partners with the University of California and Chapman University to deliver SQL/R/Python bootcamps, git workshops, and hackathon support to data science graduate programs.

Data Science & MLOps Toolkit

GenAI & LLMs: Databricks, HuggingFace, Ollama (local development), Cline (open-source autonomous AI coding agent), vector DB + RAG (with LangChain / LangGraph, ChromaDB, DuckDB, Quiver, ragnar), MCP server and agent development, Microsoft’s MarkItDown
ML & Analytics:
- R: tidyverse, tidymodels, XGBoost, Torch, TensorFlow, Keras, Shiny, leaflet, plotly, Quarto, ellmer, dbplyr, devtools, webR, DuckDB, and more
- Python: pandas, scikit-learn, PyTorch, Streamlit, PySpark, MLflow, Polars, PyArrow, Shiny for Python, plotly, narwhals, Anyscale Ray, and more
Data Engineering: Databricks + Unity Catalog, Snowflake, Redshift, Google BigQuery, Teradata, Apache Arrow & Parquet & ADBC, DuckDB, Iceberg, Polars, AWS S3
DevOps + MLOps: git, CI/CD workflows (GitLab / GitHub), MLflow, Docker, Kubernetes, Posit Connect, Rancher
Other: Jupyter Lab, Agile Scrum (& the tools surrounding it, e.g., Jira, Miro, etc.), ServiceNow, Netlify, Alation (data lineage), Excel (still a secret weapon)

Media + Presentations

[2025-03-18] Loyola presentation: Tidymodels: A Brief Intro to Modern Modeling with R
[2025-01-29] CSU Long Beach presentation: Big Data, Tiny Laptop
[2024-04-27] UC Irvine hackathon workshop: ETL with Arrow & DuckDB
[2023-09-26] SoCal RUG presentation: Highlights from Posit’s 2023 Annual Conference
[2023-09-25] R Consortium interview: Empowering Healthcare with R
[2023-03-21] SoCal RUG presentation: Build a Shiny App Demo as a Cover Letter Accessory
[2022-12-08] Posit’s Data Science Hangout: From Excel to Machine Learning
[2021-10-15] UC Irvine panel discussion: Latinx Initiative Conference
[2021-06-09] Grid Dynamics Data Points Conference: Healthcare Finance Beyond Excel
[2019-06-13] UC Irvine’s Newsroom: Scatter Podcast launched by Merage Student
[2019-05-20] Orange County RUG: About the Predictive Modeling Hackathon Winners
[2019-04-14] Forbes mention: Scatter Podcast debut