Top
New
Ask
Show
Jobs
slothfulhamster
2d ago
MoonshotAI unveils Kimi's large-scale LLM serving architecture
arxiv.org
18
1
ervinxie
I have been wondering the reason why online generative AI can serving so many requests. This really gives me an explanation.