Shisa.AI
  • About
  • 🤗

Shisa V2 405B: Japan’s Highest Performing LLM

We are incredibly excited to announce one more addition to the Shisa V2 family of open-source, SOTA JA/EN bilingual models: Shisa V2 405B.

Jun 3, 2025
Leonard Lin

Shisa.AI、国産モデルで最高性能を誇る多言語対応LLMを開発

〜GPT-4を超える日本語性能を実現、本日モデルをオープンソースで公開〜

Jun 3, 2025
Shisa.AI

Qwen 3 Japanese Performance

Similar to our previous Llama 4 Japanese Performance review, here’s an initial one for Alibaba’s latest Qwen 3 release. This is going to be more of a first look/preview, and…
May 1, 2025
Leonard Lin

Shisa V2シリーズ正式リリース:日本語タスクに特化した次世代バイリンガルLLMを無料公開

~複数モデルクラスにおいて日本語ベンチマーク最高スコアを達成~

Apr 22, 2025
Shisa.AI

Shisa V2

We’re proud to announce Shisa V2, the latest generation of our bilingual Japanese-English language models from Shisa.AI.

Apr 14, 2025
Leonard Lin

Llama 4 Japanese Performance

Last weekend Meta launched Llama 4, starting with two models: Scout - a 17B active parameter, 16 expert (109B total parameter) model, and Maverick - a 17B active parameter…
Apr 11, 2025
Leonard Lin

1 Million Downloads of shisa-gamma-7b-v1!

Exactly, one year ago, Sakana AI first published Evolving New Foundation Models: Unleashing the Power of Automating Model Development, which used our shisa-gamma-7b-v1 as…
Mar 21, 2025
Leonard Lin

Tuning for Efficient Inferencing with vLLM on MI300X

Over the past couple weeks I’ve been doing testing on an 8 x AMD MI300X node provided by Hot Aisle. I’ll have an article on some of my experiments training with MI300’s…
Oct 24, 2024
Leonard Lin

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

All models have biases and most Instruct/Chat models are aligned for “safety”, with Western moral biases, etc. There’s spirited debate on when and where those lines should…
Jun 11, 2024
Leonard Lin
 

Copyright and AI Training Data in Japan

Currently, per Japanese copyright law (PDF), re-affirmed as current policy in April 2023 by Keiko Nagaoka, the Japanese Minister of Education, Culture, Sports, Science, and…
May 25, 2024
Leonard Lin

Evaling llm-jp-eval (evals are hard)

With training of shisa-v2 starting in earnest, I’ve been digging a bit more into llm-jp-eval, which I used as a quick and simple benchmark to help to track shisa-v1…
May 24, 2024
Leonard Lin

Sakana AI Evolves Models with shisa-gamma-7b-v1

Sakana AI just published some exciting new work on Evolutionary Model Merges of LLMs, applying evolutionary techniques to dicover optimal ways of combining different models.…
Mar 21, 2024
Leonard Lin

Shisa 7B

Shisa 7B was the original Japanese-English bilingual model that kicked everything off. The model card is posted here for posterity.
Dec 6, 2023
Leonard Lin
No matching items