About
Tuning for Efficient Inferencing with vLLM on MI300X
Over the past couple weeks I’ve been doing testing on an 8 x AMD MI300X node provided by Hot Aisle. I’ll have an article on some of my experiments training with MI300’s…
Oct 24, 2024
Leonard Lin
An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct
All models have biases and most Instruct/Chat models are aligned for “safety”, with Western moral biases, etc. There’s spirited debate on when and where those lines should…
Jun 11, 2024
Leonard Lin
Evaling llm-jp-eval (evals are hard)
With training of shisa-v2 starting in earnest, I’ve been digging a bit more into llm-jp-eval, which I used as a quick and simple benchmark to help to track shisa-v1…
May 24, 2024
Leonard Lin
Shisa 7B
Shisa 7B was the original Japanese-English bilingual model that kicked everything off. The model card is posted here for posterity.
Dec 6, 2023
Leonard Lin
No matching items