Hacker News
Linking pages
Related searches:

Search whole site: site:twitter.com

Search title: Soumith Chintala on Twitter: "i might have heard the same 😃 -- I guess info like this is passed around but no one wants to say it out loud. GPT-4: 8 x 220B experts trained with different data/task distributions and 16-iter inference. Glad that Geohot said it out loud. Though, at this point, GPT-4 is…" / Twitter

See how to search.