Event box

Clustering & Classification (2 of 11): Centroid-Based Models

The Advanced R Series: Clustering & Classification

Data rarely comes neatly labeled or structured, yet patterns still exist—even when they are not immediately obvious. Clustering and classification methods allow researchers to uncover structure in their data, group similar observations, and reduce dimensionality without imposing rigid assumptions about the underlying relationships.

This series introduces researchers to statistical and machine-learning methods for grouping, modeling, and interpreting high-dimensional data. Participants will learn a broad range of approaches—from hierarchical and centroid-based models to probabilistic, fuzzy, density-based, graph-based, and mixed-type clustering techniques—along with strategies for dimensionality reduction and fairness considerations. Emphasis is placed on understanding model assumptions, evaluating model performance, and selecting methods that align with the characteristics of the data rather than forcing data to fit inappropriate models.

All workshops will use R and RStudio, so some experience with R or other programming languages is encouraged but not required. See the R Fundamentals for Data Analysis for an introduction to R and RStudio. Attendees without prior experience are encouraged to review this content.

Centroid-Based Models (workshop 2 of 11): This session focuses on centroid-based approaches such as k-means and k-medoids. We examine initialization strategies, distance metrics, convergence behaviour, and model diagnostics. Participants will learn to fit these models in R, compare clustering quality, and understand when centroid-based methods are most appropriate.

Application: Customer Segmentation Data

Questions? Please reach out to the Centre for Scholarly Communication at csc.ok@ubc.ca.

A full schedule of workshops can be found at csc.ok.ubc.ca/workshops/

Date:
Monday, January 19, 2026
Time:
3:30pm - 4:30pm
Room:
LIB 111
Location:
Okanagan - Centre for Scholarly Communication
Audience:
  Faculty     Graduate     Post-Doc     Staff     Undergraduate  
Categories:
  Data  
Presenter(s):
Jesse Ghashti

Registration is required. There are 30 in-person seats available. There are 50 online seats available.

Get rid of it

Presenter(s)

Jesse Ghashti

More Events Like This...