Serving Fastchat - Personal Journey
Serving fastchat for people to experiment with various LLMs. This guide also incluides setting up Vllm to serve multiple models on a single GPU.
Serving fastchat for people to experiment with various LLMs. This guide also incluides setting up Vllm to serve multiple models on a single GPU.
A take on trying to help understand LLMs and Transformers - Now the dataset!
A take on trying to help understand LLMs and Transformers - In a code first approach!
Arch linux makes it better to manage deep learning system, and understand the system better.
Combining Keras and JAX as a backend, makes JAX to be meant for Humans