Jeremy Jordan
  • Home
  • About
  • Data Science
  • Reading List
  • Quotes
  • Life
  • Favorite Talks
  • Materials Science
Sign in

Data Science

Cool things I've learned about data science.
Data Science

Decision trees for classification.

In this third post on supervised machine learning classifiers, I'll be talking about one of the oldest and most widely used techniques - decision trees. Decision trees work well with noisy or missing data and are incredibly fast at runtime. They're additionally nice because you can
14 Mar 2017 5 min read
Data Science

Grouping data points with k-means clustering.

K-means clustering is a simple method for partitioning $n$ data points in $k$ groups, or clusters. Essentially, the process goes as follows: 1. Select $k$ centroids. These will be the center point for each segment. 2. Assign data points to nearest centroid. 3. Reassign centroid value to be the calculated
15 Dec 2016 7 min read
Data Science

Identifying related bodies of text using TF-IDF vectorization.

Term frequency-inverse document frequency (TF-IDF) vectorization is a mouthful to say, but it's also a simple and convenient way to characterize bodies of text. Due to its simplicity, this method scales better than some other topic modeling techniques (latent dirichlet allocation, probabilistic latent semantic indexing) when dealing with
07 Oct 2016 4 min read
← Newer Posts Page 3 of 3
Jeremy Jordan © 2025
Powered by Ghost