Nvidia and DataStax just made generative AI smarter and leaner

Nvidia and DataStax launched new technology today that dramatically reduces storage requirements for companies deploying generative AI systems, while enabling faster and more accurate information retrieval across multiple languages.

The new Nvidia NeMo Retriever microservices, integrated with DataStax’s AI platform, cuts data storage volume by 35 times compared to traditional approaches — a crucial capability as enterprise data is projected to reach more than 20 zettabytes by 2027.

“Today’s enterprise unstructured data is at 11 zettabytes, roughly equal to 800,000 copies of the Library of Congress, and 83% of that is unstructured with 50% being audio and video,” said Kari Briski, VP of product management for AI at Nvidia, in an interview with VentureBeat. “Significantly reducing these storage costs while enabling companies to effectively embed and retrieve information becomes a game changer.”

The technology is already proving transformative for Wikimedia Foundation, which used the integrated solution to reduce processing time for 10 million Wikipedia entries from 30 days to under three days. The system handles real-time updates across hundreds of thousands of entries being edited daily by 24,000 global volunteers.

“You can’t just rely on large language models for content – you need context from your existing enterprise data,” explained Chet Kapoor, CEO of DataStax. “This is where our hybrid search capability comes in, combining both semantic search and traditional text search, then using Nvidia’s re-ranker technology to deliver the most relevant results in real-time at global scale.”

Enterprise data security meets AI accessibility

The partnership addresses a critical challenge facing enterprises: how to make their vast stores of private data accessible to AI systems without exposing sensitive information to external language models.

“Take FedEx — 60% of their data sits in our products, including all package delivery information for the past 20 years with personal details. That’s not going to Gemini or OpenAI anytime soon, or ever,” Kapoor explained.

The technology is finding early adoption across industries, with financial services firms leading the charge despite regulatory constraints. “I’ve been blown away by how far ahead financial services firms are now,” said Kapoor, citing Commonwealth Bank of Australia and Capital One as examples.

The next frontier for AI: Multimodal document processing

Looking ahead, Nvidia plans to expand the technology’s capabilities to handle more complex document formats. “We’re seeing great results with multimodal PDF processing — understanding tables, graphs, charts and images and how they relate across pages,” Briski revealed. “It’s a really hard problem that we’re excited to tackle.”

For enterprises drowning in unstructured data while trying to deploy AI responsibly, the new offering provides a path to make their information assets AI-ready without compromising security or breaking the bank on storage costs. The solution is available immediately through the Nvidia API catalog with a 90-day free trial license.

The announcement underscores the growing focus on enterprise AI infrastructure as companies move beyond experimentation to large-scale deployment, with data management and cost efficiency becoming critical success factors.

The post Nvidia and DataStax just made generative AI smarter and leaner — here’s how appeared first on Venture Beat.

Nvidia and DataStax just made generative AI smarter and leaner — here’s how

Arte y básquetbol sin fronteras: Abby Aceves y Estefania Ajcip son voces de la migración y la resiliencia

Why I stopped arguing with my Trump-hating relatives

I moved from Chicago to San Diego for love. My friends were jealous, but I couldn’t leave the California city fast enough.

Elon Musk’s DOGE overdrive, Trump bashes Boeing, and supersonic planes: Business news roundup

‘Severance’s Tramell Tillman And Patricia Arquette Unpack Milchick’s Haunting Performance Review: “The Rules Just Keep Changing”

The Art of Splitting Up

France still looking to block EU-Mercosur trade deal, Macron says

Why ‘Stranger Things’ star Matthew Modine didn’t buy Millie Bobby Brown a wedding present

Officials Fired at Traffic Safety Agency Investigating Musk’s Company

Trump Fires Chairman of the Joint Chiefs of Staff and Two Other Military Officers

Middle East: Hamas releases 6 Israeli hostages from Gaza

World’s smallest pacemaker treats baby’s dangerous heart condition

Nvidia and DataStax just made generative AI smarter and leaner — here’s how

NASCAR Driver Shares Heartwarming Message About First Victory

JONATHAN TURLEY: Why defamation suit against Whoopi Goldberg could be piece of cake

John Clements, Whose Research Saved Thousands of Babies, Dies at 101

Trending Posts

Dogs With ‘Beautiful Friendship’ Despite Age Gap Win Pet of the Week

Sherman Oaks Notre Dame upsets Harvard-Westlake in Open Division playoffs

‘The Sticky’ Canceled After One Season At Amazon

We got new carpet in over 50% of our house. Here are 8 things I wish I’d known before we started.

Meghan Markle fights to show she really is authentic in new Netflix series: ‘Nobody knows who she is’

Site Navigation

Nvidia and DataStax just made generative AI smarter and leaner — here’s how

Enterprise data security meets AI accessibility

The next frontier for AI: Multimodal document processing

Trending Posts

Site Navigation

Follow Us