A Collection of Bad Ideas
  • Blog
  • Projects
  • About Me

Hanchung Lee


A Collection of Bad Ideas

Managing Multiple AWS and Git profiles

Setting Proper Config To Avoid Pushing Code to the Wrong Place

Posted on April 15, 2021

It is common to have hundreds to thousands of features available for a given machine learning problem. As a data scientist and machine learning practitioner, the first step we have to do after defining the problem is to select the optimal subset of features. Or, in another sense, we will... [Read More]
Tags: Software Engineering AWS Git

Feature Selection and Dimensionality Reduction

Does this feature spark joy using using Scikit Learn and Pandas

Posted on April 12, 2021

It is common to have hundreds to thousands of features available for a given machine learning problem. As a data scientist and machine learning practitioner, the first step we have to do after defining the problem is to select the optimal subset of features. Or, in another sense, we will... [Read More]
Tags: Machine Learning Feature Engineering

Evaluation of Clustering Algorithms for Information Retrieval

Using F-measure to evaluate clustering over pairs of points

Posted on November 17, 2020

A common question for clustering is that, once we cluster documents (e.g., articles, images, etc) together, how do we determine how good is the clustering results given ground truth clusters? [Read More]
Tags: Evaluation Clustering Machine Learning

Detecting Election Irregularities

Using Benford's Law for irregularity detection in natural numbers

Posted on November 6, 2020

As 2020 General Election draws to a conclusion, the losing side is, as usual, raising questions about potential election fraud. So we believe it would be interesting to see if we can use Benford’s law to detect irregularities. [Read More]
Tags: Fraud Detection Machine Learning

Reformer Presentation at Weights and Biases Deep Learning Salon

Weights and Biases is awesome

Posted on April 26, 2020

Recently I had the opportunity to give a talk at Weights and Biases’ Deep Learning Salon. I find Reformers to be an very interesting paper where it combines a lot of computer science techniques to deep neural networks. The talk has been recorded and published on Youtube. Please enjoy the... [Read More]
Tags: Presentations Machine Learning
  • Older Posts →
  • RSS
  • Email me
  • GitHub
  • Twitter
  • LinkedIn

Hanchung Lee  •  2021  •  leehanchung.github.io

Theme by beautiful-jekyll