Two Leaps to 1000 Tokens/s on a 1T-Parameter Model

(tilert.ai)

7 points | by __natty__ a day ago ago

No comments yet.