FlexGen: Running large language models on a single GPU
13 by behnamoh | 0 comments on Hacker News.
0 https://ift.tt/9u3PtY8 13 FlexGen: Running large language models on a single GPU
13 by behnamoh | 0 comments on Hacker News.
0 https://ift.tt/9u3PtY8 13 FlexGen: Running large language models on a single GPU
Comments
Post a Comment