Recent Posts

Serving Fastchat - Personal Journey

Serving fastchat for people to experiment with various LLMs. This guide also incluides setting up Vllm to serve multiple models on a single GPU.