About
🤗
Shisa V2シリーズ正式リリース:日本語タスクに特化した次世代バイリンガルLLMを無料公開
本ページは、当社公式プレスリリース「Shisa V2」のローカルコピーです。正式版はこちらをご覧ください。
Apr 22, 2025
Shisa.AI
Shisa V2
We’re proud to announce Shisa V2, the latest generation of our bilingual Japanese-English language models from Shisa.AI. Over the past few months, our team has been pushing…
Apr 14, 2025
Leonard Lin
Llama 4 Japanese Performance
Last weekend Meta launched Llama 4, starting with two models: Scout - a 17B active parameter, 16 expert (109B total parameter) model, and Maverick - a 17B active parameter…
Apr 11, 2025
Leonard Lin
1 Million Downloads of shisa-gamma-7b-v1!
Exactly, one year ago, Sakana AI first published Evolving New Foundation Models: Unleashing the Power of Automating Model Development, which used our shisa-gamma-7b-v1 as…
Mar 21, 2025
Leonard Lin
Tuning for Efficient Inferencing with vLLM on MI300X
Over the past couple weeks I’ve been doing testing on an 8 x AMD MI300X node provided by Hot Aisle. I’ll have an article on some of my experiments training with MI300’s…
Oct 24, 2024
Leonard Lin
An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct
All models have biases and most Instruct/Chat models are aligned for “safety”, with Western moral biases, etc. There’s spirited debate on when and where those lines should…
Jun 11, 2024
Leonard Lin
Copyright and AI Training Data in Japan
Currently, per Japanese copyright law (PDF), re-affirmed as current policy in April 2023 by Keiko Nagaoka, the Japanese Minister of Education, Culture, Sports, Science, and…
May 25, 2024
Leonard Lin
Evaling llm-jp-eval (evals are hard)
With training of shisa-v2 starting in earnest, I’ve been digging a bit more into llm-jp-eval, which I used as a quick and simple benchmark to help to track shisa-v1…
May 24, 2024
Leonard Lin
Sakana AI Evolves Models with shisa-gamma-7b-v1
Sakana AI just published some exciting new work on Evolutionary Model Merges of LLMs, applying evolutionary techniques to dicover optimal ways of combining different models.…
Mar 21, 2024
Leonard Lin
Shisa 7B
Shisa 7B was the original Japanese-English bilingual model that kicked everything off. The model card is posted here for posterity.
Dec 6, 2023
Leonard Lin
No matching items