Writing
Flaregun - A Tiny PyTorch Helper Library
Notes for Harvard's CS 287 NLP Course
NOTE: These are all taken from Chris Tanner’s great NLP course CS 287 (taught at Harvard). None of this is my own work. This is just a collection of screenshots and notes from my own reading of the course’s lecture slides for my own reference and understanding. I would highly recommend reading the full lecture slides available here.
Helpful Linux Commands
How to Analyze Memory Usage of Folders/Files on your Mac
Three ways to view memory usage (i.e. “disk usage”) on your Mac, in descending order of preference.
Walkthrough of the OMOP CDM (Part 1)
In Part 1 of this series on the OMOP CDM, we explain what the OMOP CDM is, define key terms such as “concept”, “source value”, “vocabulary”, and “domain”, and describe how they related to each other.
Python Line-by-Line Profiling of a Program's Speed
How long does each line in your Python program take to run?
How are Stanford's STARR database,the OMOP Common Data Model, and Epic's EHR Related?
In this post, I explain what Epic (Chronicles, Clarity, etc.), OMOP CDM, and Stanford STARR are, how they are all related, and what the benefits/features of each are.
How to connect VSCode to your remote server via SSH
If you’ve ever had to ssh
into a server to run programs, you may be taking an unnecessary productivity hit each time you relegate yourself to coding in a Jupyter notebook on localhost:8000
.
Plotting the Distribution of MLB Batting Statistics Over Time
Most people know that a batting average over .300 is the mark of a great hitter, and that hitting .200 will land you on the bench.
How to Publish Jupyter .ipynb Notebooks to a Jekyll Static Blog
Goal: Publish a .ipynb
on my Jekyll static site as painlessly as possible.
Matplotlib Tips + Tricks
These are all taken from this great 3-hr YouTube tutorial by Ben Root from SciPy 2018. I’ve condensed the main takeaways of the talk into the following list of key concepts / tricks that I hadn’t previously been aware of.
Combining ROC Curves with Indifference Curves to Measure an ML Model's Utility
For the below examples, assume we have a binary classification task where the class label is $y \in {0, 1}$, and the model’s predictions are $\hat{y} \in {0, 1}$
Sensitivity + Specificity + PPV + TP/FP/TN/FN Formulas
A brief cheat sheet / reference guide containing the definitions, formulas, and explanations of the most commonly used model evaluation metrics for binary classification tasks.
Notes on Vim Tutor
I’ve always been intimidated by vim
. After taking MIT’s Missing Semester Course (a free online course that I’d highly recommend), I learned about a built-in utility called vimtutor
that automatically comes installed with vim
.
Tips to Free Up Mac Storage
tl;dr – If you have XCode installed, then you could easily be wasting up to 10% of your disk space.
Malcolm Gladwell v. Chess
In Season 4, Episode 3 of his podcast Revisionist History, Malcolm Gladwell argues against the use of the LSAT in law school admissions. A full transcript of the episode can be read here.