New Server Hopes to Break Through AI’s “Memory Wall”

# A New Server Could Speed Up AI Chatbots by Solving a Hidden Bottleneck AI chatbots are currently slowed down by a shortage of memory—the speed at which they can retrieve and process information, not the size of their "brain." A new startup called Majestic Labs is building a server with 60 times more memory than Nvidia's current best option, betting that this will let AI models generate responses faster and more efficiently. The company believes this approach will be cheaper than how competitors like Nvidia are currently scaling up their AI systems.
Memory is arguably the most serious constraint on modern AI large language models (LLMs). According to one influential paper, LLM token generation is an inherently memory-bound task, meaning the rate at which models output text is limited by how quickly data can be read in from memory. The severity
More from Best AI Tools
Get new guides every week
Real AI income strategies, tool reviews, and plain-English news — free in your inbox.



