Data Science Bytes
  • Blog
  • Python & R
  • About
Categories
All (43)
arrow (3)
blogdown (2)
bslib (2)
chromadb (1)
cyphr (1)
docker (1)
dplyr (4)
dt (1)
duckdb (3)
duckplyr (1)
ggplot2 (4)
github (1)
huggingface (1)
langchain (1)
linux (2)
machine learning (4)
packages (3)
podcast (1)
polars (1)
positron (2)
privacy (1)
probably (1)
python (1)
quarto (1)
quick tips (3)
r6 (1)
rayshader (1)
renv (1)
rmarkdown (3)
shiny (5)
social good (1)
sodium (1)
statistics (5)
tibble (1)
tidymodels (6)
tidyverse (5)
visualizations (5)
webR (1)

Blog

Intro to Tidymodels

tidymodels
positron
Presentation materials from a recent talk I gave at Loyola Marymount University
Mar 24, 2025
Javier Orraca-Deatcu

Vector DB + RAG Maker

python
langchain
huggingface
chromadb
Introducing a vector database and Retrieval-Augmented Generation (‘RAG’) system for coding in R, designed to provide bleeding-edge responses from curated documentation.
Mar 7, 2025
Javier Orraca-Deatcu

Big Data, Tiny Laptop

positron
arrow
duckdb
polars
quarto
Presentation materials from last night’s Tech by the Beach x SoCal RUG x CSULB meetup
Jan 30, 2025
Javier Orraca-Deatcu

ETL with Arrow & DuckDB

dplyr
arrow
duckdb
duckplyr
Presentation materials from this weekend’s SoCal RUG x UC Irvine hackathon️
Apr 29, 2024
Javier Orraca-Deatcu

You Don’t Need OOP for Data Science

r6
tidyverse
tidymodels
For R users, the R6 package - among several object-oriented systems for R - brings forth the elegance of object-oriented programming (OOP) to a traditionally ‘stats first’ language. That said, do you need to know OOP to be a successful data scientist? No, probably not.
Nov 23, 2023
Javier Orraca-Deatcu

webR via Quarto Extensions

webR
dplyr
webR has been on my mind recently… With the recent release of webR 0.2.0, it’s time to dive in!
Aug 23, 2023
Javier Orraca-Deatcu

Impressing Hiring Teams with a Shiny App Demo

tidyverse
shiny
renv
bslib
Last week, I presented to SoCal RUG on how to build a Shiny app demo as a resume or cover letter accessory. This ‘accessory’ will impress hiring managers with whom you’re interviewing and help differentiate you from other data science applicants.
Mar 28, 2023
Javier Orraca-Deatcu

R Goodies for 2023

tidymodels
probably
arrow
duckdb
sodium
cyphr
Despite all of the noise of AI-generated art and ChatGPT, 2022 was a great year for data science. These are some of my favorite tricks that I will continue to explore in 2023.
Jan 15, 2023
Javier Orraca-Deatcu

Data Science Hangout

tidyverse
shiny
machine learning
tidymodels
I was recently a guest on Posit’s Data Science Hangout, hosted by Rachael Dempsey, and I had a blast. This is the recording where I talked about my transition from Excel-based financial modeling to machine learning at scale with R.
Jan 1, 2023
Javier Orraca-Deatcu

Build a Shiny App Demo

shiny
github
bslib
docker
During my interview process with Bloomreach, I developed an interactive Shiny app styled with the Bloomreach branding and color scheme. This tutorial will help you do the same to demo the power of Shiny to prospective employers.
Nov 15, 2022
Javier Orraca-Deatcu

Torch for R + luz

machine learning
The ‘torch for R’ ecosystem is a collection of extensions for torch, an R framework for machine learning and artificial intelligence based on PyTorch.
Oct 22, 2021
Javier Orraca-Deatcu

Beautiful Maps with Rayshader

rayshader
ggplot2
visualizations
If you love beautiful 2D and 3D maps, now you can create your own with elevation data, the rayshader package, OpenStreetMap, and ggplot2.
Sep 7, 2021
Javier Orraca-Deatcu

A Brief History of the Dataframe

tibble
Insightful read sourced from Towards Data Science on the history of the dataframe - From its origins in the S programming language, to R, to pandas for Python (to the tibble for R).
May 24, 2021
Javier Orraca-Deatcu

DT: An R Interface to the JavaScript library DataTables

dt
The R package DT provides an R interface to the JavaScript library DataTables. R data objects (matrices or data frames) can be displayed as tables on HTML pages, and DataTables provides filtering, pagination, sorting, interactivity with Shiny, and many other features.
May 3, 2021
Javier Orraca-Deatcu

rstudio::global tips, tricks, and more

tidymodels
RStudio’s annual conference saw roughly 17,000 attendees for their first global, all-virtual, 24-hour event. I attended several sessions throughout the day and I’ll highlight my favorite data bytes learned that day. I’ll also share relatable content for better modeling with R.
Jan 24, 2021
Javier Orraca-Deatcu

rsthemes: Customizing your RStudio IDE

quick tips
Change up your editor theme with {rsthemes}, a collection of themes to freshen up the RStudio IDE aesthetics.
Jan 19, 2021
Javier Orraca-Deatcu

Preparing for 2021 Goals with a Raspberry Pi

linux
I have been planning my 2021 goals and the Raspberry Pi 4 will help me kill a few birds with one stone.
Dec 31, 2020
Javier Orraca-Deatcu

One-Hour R Package Development Tutorial by Shannon Pileggi, PhD

packages
I followed this tutorial and created a package successfully. It took 45 minutes. Follow, this, guide, my fellow R friends!
Dec 19, 2020
Javier Orraca-Deatcu

What’s New in dbplyr 2.0.0

tidyverse
dplyr
dbplyr, a database backend for dplyr, just released v2.0.0 today… Awesome stuff here!
Nov 4, 2020
Javier Orraca-Deatcu

Detailed R Package Development Tutorial from Method Bites

packages
This is one of the best, most detailed, how-to guides for developing your own R package from A-to-Z
Sep 13, 2020
Javier Orraca-Deatcu

Shiny Voice-Activated input

shiny
Want to make your Shiny apps voice-interactive? Now it’s possible.
Jul 3, 2020
Javier Orraca-Deatcu

Testing dplyr 1.0.0

tidyverse
dplyr
Finally getting around to trying out dplyr 1.0.0… Love it!
Jun 20, 2020
Javier Orraca-Deatcu

Lying with Statistics

statistics
ggplot2
visualizations
It’s quite easy to manipulate raw data in a manner that ‘proves’ your point. For the sake of exploring this topic further, I’ll analyze police killing data and present it in three different ways.
Apr 19, 2020
Javier Orraca-Deatcu

D&D Adventures + blogdown

rmarkdown
blogdown
It’s an incredible time to be creative! I’ve recently been re-learning Dungeons & Dragons with a group of friends and having so much fun in the process.
Apr 5, 2020
Javier Orraca-Deatcu

themis: Extra Steps for tidymodels + recipes

tidymodels
themis contain extra steps for the recipes package for dealing with unbalanced data. The name themis is that of the ancient Greek goddess who is typically depicted with a balance.
Feb 20, 2020
Javier Orraca-Deatcu

Copy R Objects to Clipboard

quick tips
I was today years old when I learned that you could easily copy/paste R objects to your clipboard.
Feb 8, 2020
Javier Orraca-Deatcu

Hadley Wickham on SuperDataScience Podcast

podcast
My eyes were opened to the world of analytics and data science in part through Kirill Eremenko and his amazing SuperDataScience podcast.
Feb 4, 2020
Javier Orraca-Deatcu

Resampling with k-fold Cross Validation

statistics
machine learning
Click for a high-level recap of k-fold cross validation as a resampling method.
Feb 1, 2020
Javier Orraca-Deatcu

An Algorithm for Better Espresso

statistics
Coffee lovers… You might find this study fascinating. Finally, a model for the perfect cup of espresso!
Jan 22, 2020
Javier Orraca-Deatcu

Analyzing Earthquakes in Puerto Rico

statistics
ggplot2
visualizations
Puerto Rico has been experiencing atypical seismic activity since November 2019.
Jan 20, 2020
Javier Orraca-Deatcu

Create Your Own R Package

packages
If you are interested in developing your own R packages, this thorough A-to-Z tutorial resource is one to bookmark.
Jan 18, 2020
Javier Orraca-Deatcu

Writing to Excel from R

quick tips
Spending your days R but working closely with analysts and leadership that live in Excel? Get familiar with openxlsx.
Jan 9, 2020
Javier Orraca-Deatcu

California Consumer Privacy Act

privacy
The California Consumer Privacy Act is set to go into effect on January 1, 2020. The CCPA, similar in nature to GDPR, will provide California residents with new consumer data rights.
Jan 4, 2020
Javier Orraca-Deatcu

Holiday Expectations vs Reality

Holiday expectations…
Jan 1, 2020
Javier Orraca-Deatcu

Ubuntu Update to 19.10

linux
Things not to do with Linux, learned the hard way, as we approach the end of 2019…
Dec 31, 2019
Javier Orraca-Deatcu

Book Rec: Hands-On Machine Learning with R

machine learning
Digging into Hands-On Machine Learning with R by Brad Boehmke, PhD, and Brandon Greenwell.
Dec 17, 2019
Javier Orraca-Deatcu

Spreading Joy with Java Joy

social good
Bay Area friends… Next time your organization needs coffee for an event, quarterly meeting, all-hands, or lunch and learn, reach out to Java Joy!
Dec 15, 2019
Javier Orraca-Deatcu

Education Reform and Lessons from Finland

statistics
Quartz and Google’s Avinash Kaushik share insights comparing national education systems, test performance, and time spent on learning.
Dec 4, 2019
Javier Orraca-Deatcu

Building a Website with Blogdown

rmarkdown
blogdown
Learn how to build and deploy a website with R, blogdown, GitHub, Hugo, and Netlify.
Nov 18, 2019
Javier Orraca-Deatcu

15 Tips for Making Better Use of R

rmarkdown
If you are an R user not using R Markdown, these tricks could be helpful for you.
Nov 11, 2019
Javier Orraca-Deatcu

Shiny Developer Expert Course

shiny
Matt Dancho and Business Science introduce a new R Shiny Expert Developer course.
Nov 5, 2019
Javier Orraca-Deatcu

Visual & Data Journalism at BBC

ggplot2
visualizations
The BBC News data science and visualization team published this great overview on their analytics and visualizations lessons learned over the course of 1.5 years.
Oct 22, 2019
Javier Orraca-Deatcu

Information is Beautiful 2019 Awards

visualizations
Visualization professionals present their best at the annual Information is Beautiful Awards show.
Oct 19, 2019
Javier Orraca-Deatcu
No matching items
     
    Cookie Preferences