Llana 3.2 is out — calmer, more careful, slightly slower
Llana 3.2 is a refresh focused on refusal calibration, long-form coherence, and code review quality. We are slightly slower than 3.1 in raw tokens-per-second; we think the trade is right. Notes inside.
Read the post →