Explain and visualize how kmeans and kmeans++ work.
I am a Master’s student at the Northeastern University. I am currently researching on algorithms to speed up the computation of Influence Functions used for deep learning model interpretation and debugging. I am also interested in working on projects in the domain of NLP and Computer Vision.
Master's in Computer Science, 2024
Northeastern University
Undergraduate in Statistical Practice, 2018
University of Minnesota Twin Cities
Undergraduate in Psychology, 2018
University of Minnesota Twin Cities
Explain and visualize how kmeans and kmeans++ work.
Self-define a cross validation function for classifier using R.
Create a KNN classifier using R from scratch.
Basic theories, simulation and visualizations of Multiple Linear Regression.
Use machine learning technique to find business strategies in telemarketing campaigns.
Use the famous dimension reduction technique, PCA, to compress image size.
tidy dataset allows R’s vectorized nature to shine. Most built-in R functions work with vectors of values. That makes transforming tidy data feel particularly natural.
Importing data is an essential step in data modelling. This post discusses about readr package and ways to use functions from it to import data quickly and reproducibly. Writing data to a file is also mentioned.
Tibbles are tweaked data frames but it works better than the old data.frame. The tibble package will be discussed at here.
Learn how to use various plots to unearth the hidden pattern of data during Exploratory Data Analysis.
Learn how to combine WHERE clauses to create sophisticated search conditions. Also learn how to use the NOT and IN operators.
Learn what wildcards are, how they are used, and how to perform wildcard searches using the LIKE operator for sophisticated filtering.
Learn what SQL is and review some basic database terminology.
Learn how to use the SELECT statement to retrieve one or more columns of data from a table. Code commenting and returning distinct values are introduced too.
Learn how to use the SELECT statement’s ORDER BY clause to sort retrieved data as needed.
Learn how to use the SELECT statement’s WHERE clause to specify search conditions.
related to statistics and machine learnings
These are some of the classes that I took :
My CV is available in PDF form.