Lessons Learned from using HuggingFace for LLM Inference in Google Colab

December 8, 2024

My original goal was to set up an llm quickly in a google colab python notebook (because I didn’t want to execute locally + I wanted access to an nvidia gpu quickly for free). I originally was looking at Ollama but its client/server architecture didn’t seem to elegantly nicely with google colab. Side note, see this great blog post for an understanding of Ollama’s architecture.

Read More

Intro to Tokenizers

October 5, 2024

Notes on the tokenizer space, what they are, what problems they solve, some example code, and some areas to explore in the future.

Read More

May 2024 Summary of Resources

June 1, 2024

Summary of resources I read/watched in May. The focus of this month was to get familiar with terms and concepts underlying Generative AI.

Read More

Computer Hardware and Architecture - A Bird’s Eye View

June 13, 2021

In this blog post we take a step lower from our previous blog posts on compilers/virtual machines and operating systems. We will string together how physics and hardware are built and used to execute our abstract thoughts and ideas illustrated in higher level languages.

Read More

The Producer Consumer Problem in C++

May 30, 2021

We will go over a solution to the Producer Consumer problem in concurrency with multiple producers and consumers in a buffer of bounded size. The solution is written in C++ and uses mutexes and condition variables. This post is based off of the blog post here by Baptiste Wicht.

Read More

Operating Systems in a Nutshell

May 8, 2021

This blog post introduces Operating Systems and the pieces involved, specifically: processes and threads, drivers, file systems, and virtual memory.

Read More

Defusing CMU's Bomb Lab using GDB

August 28, 2020

This post walks through CMU’s ‘bomb’ lab, which involves defusing a ‘bomb’ by finding the correct inputs to successive phases in a binary executable using GDB.

Read More

SIR Model From Scratch in Python

August 6, 2020

This post explains the SIR model and includes a Python implementation that generates a graphic describing a population’s infectious status over time.

Read More