Posts by Year

2024

Serving Fastchat - Personal Journey

Serving fastchat for people to experiment with various LLMs. This guide also incluides setting up Vllm to serve multiple models on a single GPU.

Back to Top ↑

2023

Back to Top ↑

2022

Mixed Precision Training

Less memory more speeeeedddd. Training Models with mixed precision for lower memory footprint, and faster training.

LazyPredict

I am just too lazy to compare multiple Machine Learning algorithms.

Back to Top ↑