We tried out Google’s new family of multi-modal models with variants compact enough to work on local devices. They work well.
I put GPT-5.5 through a 10-round test: It scored 93/100, losing points only for exuberance ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results