Meta’s benchmarks for its new AI fashions are a bit deceptive | TechCrunch

One of many new flagship AI fashions Meta launched on Saturday, Maverick, ranks second on LM…

Anthropic used Pokémon to benchmark its latest AI mannequin | TechCrunch

Anthropic used Pokémon to benchmark its latest AI mannequin. Sure, actually. In a weblog post printed…

These researchers used NPR Sunday Puzzle inquiries to benchmark AI ‘reasoning’ fashions | TechCrunch

Each Sunday, NPR host Will Shortz, The New York Occasions’ crossword puzzle guru, will get to…

Meta launches new program to enhance speech and translation AI | TechCrunch

Meta is launching a brand new program in partnership with UNESCO to gather speech recordings and…