New top story on Hacker News: Show HN: sllm – Split a GPU node with other developers, unlimited tokens

Show HN: sllm – Split a GPU node with other developers, unlimited tokens
13 by jrandolf | 1 comments on Hacker News.
Running DeepSeek V3 (685B) requires 8×H100 GPUs which is about $14k/month. Most developers only need 15-25 tok/s. sllm lets you join a cohort of developers sharing a dedicated node. You reserve a spot with your card, and nobody is charged until the cohort fills. Prices start at $5/mo for smaller models. The LLMs are completely private (we don't log any traffic). The API is OpenAI-compatible (we run vLLM), so you just swap the base URL. Currently offering a few models.

Running DeepSeek V3 (685B) requires 8×H100 GPUs which is about $14k/month. Most developers only need 15-25 tok/s. sllm lets you join a cohort of developers sharing a dedicated node. You reserve a spot with your card, and nobody is charged until the cohort fills. Prices start at $5/mo for smaller models. The LLMs are completely private (we don't log any traffic). The API is OpenAI-compatible (we run vLLM), so you just swap the base URL. Currently offering a few models. 1 https://ift.tt/29zrWCE 13 Show HN: sllm – Split a GPU node with other developers, unlimited tokens

weightlohealt

Search This Blog

New top story on Hacker News: Show HN: sllm – Split a GPU node with other developers, unlimited tokens

Comments

Post a Comment

diet weight loss

helth