『Can a 4B Model Beat Claude in a Mech Arena? (BIF Ep 9)』のカバーアート

Can a 4B Model Beat Claude in a Mech Arena? (BIF Ep 9)

Can a 4B Model Beat Claude in a Mech Arena? (BIF Ep 9)

無料で聴く

ポッドキャストの詳細を見る

Josh came in with one goal: resist showing us the 3D demo he spent two weeks building. He succeeded. Barely.

Instead we talked about the actual interesting stuff — why he's building systems for AI agents instead of humans, how a mech arena became his favorite benchmarking tool, and what happens when you let Claude playtest your game engine for you.

Also: two Blackwell GPUs, a Glicko rating system, and the most cursed MMORPG premise we've ever heard.

  • 00:00 Intro — Mike's late, just Joe and Josh
  • 01:04 Six weeks later: redefining success
  • 02:34 Enter Matt, the PM partner
  • 04:35 Federated agents and the mech arena
  • 07:44 BattleBots but autonomous
  • 09:20 Why he killed the 3D demo
  • 17:32 Mike finally shows up
  • 20:58 How the arena actually works
  • 24:55 Josh's model stack and hardware ceiling
  • 35:48 Monday idea, Friday artifact
  • 38:23 BYO LLM: prompt it, throw it in, watch it lose
  • 43:15 Tournaments, ratings, Steam for agents
  • 51:28 Token fight economics
  • 54:16 Three-day POC rule and wrapping up
adbl_web_anon_alc_button_suppression_t1
まだレビューはありません