• Latest
  • Trending
  • All
  • News
  • Business
  • Politics
  • Science
  • World
  • Lifestyle
  • Tech
Nvidia’s Grace Hopper Superchips for generative AI enter full production

Nvidia’s Grace Hopper Superchips for generative AI enter full production

May 29, 2023
EU to start releasing money to Tunisia under migration pact

EU to start releasing money to Tunisia under migration pact

September 22, 2023
Croatian Museums Return Art Looted During Holocaust to Jewish Heir

Croatian Museums Return Art Looted During Holocaust to Jewish Heir

September 22, 2023
How New Spyware Can Infect Your Phone Through Online Ads

How New Spyware Can Infect Your Phone Through Online Ads

September 22, 2023
Trump Tries to Strike Down New York AG Case Before It Starts

Trump Tries to Strike Down New York AG Case Before It Starts

September 22, 2023
U.A.W. Begins Strike at GM, Ford and Stellantis Plants. Here’s What to Know

U.A.W. Expands Strikes at Automakers: Here’s What to Know.

September 22, 2023
Yen Bears May Come Roaring Back After BOJ Stands Pat: Watchers

Yen Bears May Come Roaring Back After BOJ Stands Pat: Watchers

September 22, 2023
Beyoncé Invites Fan To Renaissance Show After ‘Ableist’ Plane Incident

Beyoncé Invites Fan To Renaissance Show After ‘Ableist’ Plane Incident

September 22, 2023
Gatland has faith that team that beat Fiji can overcome Australia

Gatland has faith that team that beat Fiji can overcome Australia

September 22, 2023
Rift With Canada Puts Spotlight on India’s Security Services

Rift With Canada Puts Spotlight on India’s Security Services

September 22, 2023
Half of DOD civilians would get furloughed in a shutdown, plans show

Half of DOD civilians would get furloughed in a shutdown, plans show

September 22, 2023
U.A.W. Widens Strikes at G.M. and Stellantis, but Cites Progress in Ford Talks

U.A.W. Widens Strikes at G.M. and Stellantis, but Cites Progress in Ford Talks

September 22, 2023
See the private jets a Bahamian aviation firm says Sam Bankman-Fried and FTX funded with a $28 million handshake deal, and are now at the center of a 3-way ownership battle

See the private jets a Bahamian aviation firm says Sam Bankman-Fried and FTX funded with a $28 million handshake deal, and are now at the center of a 3-way ownership battle

September 22, 2023
DNYUZ
  • Home
  • News
    • U.S.
    • World
    • Politics
    • Opinion
    • Business
    • Crime
    • Education
    • Environment
    • Science
  • Entertainment
    • Culture
    • Music
    • Movie
    • Television
    • Theater
    • Gaming
    • Sports
  • Tech
    • Apps
    • Autos
    • Gear
    • Mobile
    • Startup
  • Lifestyle
    • Arts
    • Fashion
    • Food
    • Health
    • Travel
No Result
View All Result
DNYUZ
No Result
View All Result
Home News

Nvidia’s Grace Hopper Superchips for generative AI enter full production

May 29, 2023
in News
Nvidia’s Grace Hopper Superchips for generative AI enter full production
518
SHARES
1.5k
VIEWS
Share on FacebookShare on Twitter

Nvidia announced that the Nvidia GH200 Grace Hopper Superchip is in full production, set to power systems that run complex AI programs.

Also targeted and high-performance computing (HPC) workloads, the GH200-powered systems join more than 400 system configurations based on Nvidia’s latest CPU and GPU architectures — including Nvidia Grace, Nvidia Hopper and Nvidia Ada Lovelace — created to help meet the surging demand for generative AI.

At the Computex trade show in Taiwan, Nvidia CEO Jensen Huang revealed new systems, partners and additional details surrounding the GH200 Grace Hopper Superchip, which brings together the Arm-based Nvidia Grace CPU and Hopper GPU architectures using Nvidia NVLink-C2C interconnect technology.

This delivers up to 900GB/s total bandwidth — or seven times higher bandwidth than the standard PCIe Gen5 lanes found in traditional accelerated systems, providing incredible compute capability to address the most demanding generative AI and HPC applications.

“Generative AI is rapidly transforming businesses, unlocking new opportunities and accelerating discovery in healthcare, finance, business services and many more industries,” said Ian Buck, vice president of accelerated computing at Nvidia, in a statement. “With Grace Hopper Superchips in full production, manufacturers worldwide will soon provide the accelerated infrastructure enterprises need to build and deploy generative AI applications that leverage their unique proprietary data.”

Global hyperscalers and supercomputing centers in Europe and the U.S. are among several customers that will have access to GH200-powered systems.

“We’re all experiencing the joy of what giant AI models can do,” Buck said in a press briefing.

Hundreds of accelerated systems and cloud instances

Taiwan manufacturers are among the many system manufacturers worldwide introducing systems powered by the latest Nvidia technology, including Aaeon, Advantech, Aetina, ASRock Rack, Asus, Gigabyte, Ingrasys, Inventec, Pegatron, QCT, Tyan, Wistron and Wiwynn.

Additionally, global server manufacturers Cisco, Dell Technologies, Hewlett Packard Enterprise, Lenovo, Supermicro, and Eviden, an Atos company, offer a broad array of Nvidia-accelerated systems.

Cloud partners for Nvidia H100 include Amazon Web Services (AWS), Cirrascale, CoreWeave, Google Cloud, Lambda, Microsoft Azure, Oracle Cloud Infrastructure, Paperspace and Vultr.

Nvidia AI Enterprise, the software layer of the Nvidia AI platform, offers over 100 frameworks, pretrained models and development tools to streamline development and deployment of production AI, including generative AI, computer vision and speech AI.

Systems with GH200 Superchips are expected to be available beginning later this year.

Nvidia unveils MGX server specification

To meet the diverse accelerated computing needs of data centers, Nvidia today unveiled the Nvidia MGX server specification, which provides system manufacturers with a modular reference architecture to quickly and cost-effectively build more than 100 server variations to suit a wide range of AI, high performance computing and Omniverse applications.

ASRock Rack, ASUS, GIGABYTE, Pegatron, QCT and Supermicro will adopt MGX, which can slash development costs by up to three-quarters and reduce development time by two-thirds to just six months.

“Enterprises are seeking more accelerated computing options when architecting data centers that meet their specific business and application needs,” said Kaustubh Sanghani, vice president of GPU products at Nvidia, in a statement. “We created MGX to help organizations bootstrap enterprise AI, while saving them significant amounts of time and money.”

With MGX, manufacturers start with a basic system architecture optimized for accelerated computing for their server chassis, and then select their GPU, DPU and CPU. Design variations can address unique workloads, such as HPC, data science, large language models, edge computing, graphics and video, enterprise AI, and design and simulation.

Multiple tasks like AI training and 5G can be handled on a single machine, while upgrades to future hardware generations can be frictionless. MGX can also be easily integrated into cloud and enterprise data centers, Nvidia said.

QCT and Supermicro will be the first to market, with MGX designs appearing in August. Supermicro’s ARS-221GL-NR system, announced today, will include the Nvidia GraceTM CPU Superchip, while QCT’s S74G-2U system, also announced today, will use the Nvidia GH200 Grace Hopper Superchip.

Additionally, SoftBank plans to roll out multiple hyperscale data centers across Japan and use MGX to dynamically allocate GPU resources between generative AI and 5G applications.

“As generative AI permeates across business and consumer lifestyles, building the right infrastructure for the right cost is one of network operators’ greatest challenges,” said Junichi Miyakawa, CEO at SoftBank, in a statement. “We expect that Nvidia MGX can tackle such challenges and allow for multi-use AI, 5Gand more depending on real-time workload requirements.”

MGX differs from Nvidia HGX in that it offers flexible, multi-generational compatibility with Nvidia products to ensure that system builders can reuse existing designs and easily adopt next-generation products without expensive redesigns. In contrast, HGX is based on an NVLink- connected multi-GPUbaseboard tailored to scale to create the ultimate in AI and HPC systems.

Nvidia announces DGX GH200 AI Supercomputer

Nvidia also announced a new class of large-memory AI supercomputer — an Nvidia DGX supercomputer powered by Nvidia GH200 Grace Hopper Superchips and the Nvidia NVLink Switch System — created to enable the development of giant, next-generation models for generative AI language applications, recommender systems and data analytics workloads.

The Nvidia DGX GH200’s shared memory space uses NVLink interconnect technology with the NVLink Switch System to combine 256 GH200 Superchips, allowing them to perform as a single GPU. This provides 1 exaflop of performance and 144 terabytes of shared memory — nearly 500x more memory than in a single Nvidia DGX A100 system.

“Generative AI, large language models and recommender systems are the digital engines of the modern economy,” said Huang. “DGX GH200 AI supercomputers integrate Nvidia’s most advanced acceleratedcomputing and networking technologies to expand the frontier of AI.”

GH200 superchips eliminate the need for a traditional CPU-to-GPU PCIe connection by combining an Arm-based Nvidia Grace CPU with an Nvidia H100 Tensor Core GPU in the same package, using Nvidia NVLink-C2C chip interconnects. This increases the bandwidth between GPU and CPU by 7x compared with the latest PCIe technology, slashes interconnect power consumption by more than 5x, and provides a 600GB Hopper architecture GPU building block for DGX GH200 supercomputers.

DGX GH200 is the first supercomputer to pair Grace Hopper Superchips with the Nvidia NVLink Switch System, a new interconnect that enables all GPUs in a DGX GH200 system to work together as one. The previous generation system only provided for eight GPUs to be combined with NVLink as one GPU without compromising performance.

The DGX GH200 architecture provides 10 times more bandwidth than the previous generation, delivering the power of a massive AI supercomputer with the simplicity of programming a single GPU.

Google Cloud, Meta and Microsoft are among the first expected to gain access to the DGX GH200 to explore its capabilities for generative AI workloads. Nvidia also intends to provide the DGX GH200 design as a blueprint to cloud service providers and other hyperscalers so they can further customize it for their infrastructure.

“Building advanced generative models requires innovative approaches to AI infrastructure,” said Mark Lohmeyer, vice president of Compute at Google Cloud, in a statement. “The new NVLink scale and shared memory of Grace Hopper Superchips address key bottlenecks in large-scale AI and we look forward to exploring its capabilities for Google Cloud and our generative AI initiatives.”

Nvidia DGX GH200 supercomputers are expected to be available by the end of the year.

Lastly, Huang announced that a new supercomputer called Nvidia Taipei-1 will bring more accelerated computing resources to Asia to advance the development of AI and industrial metaverse applications.

Taipei-1 will expand the reach of the Nvidia DGX Cloud AI supercomputing service into the region with 64DGX H100 AI supercomputers. The system will also include 64 Nvidia OVX systems to accelerate localresearch and development, and Nvidia networking to power efficient accelerated computing at any scale.Owned and operated by Nvidia, the system is expected to come online later this year.

Leading Taiwan education and research institutes will be among the first to access Taipei-1 to advancehealthcare, large language models, climate science, robotics, smart manufacturing and industrial digitaltwins. National Taiwan University plans to study large language model speech learning as its initial Taipei-1 project.

“National Taiwan University researchers are dedicated to advancing science across a broad range ofdisciplines, a commitment that increasingly requires accelerated computing,” said Shao-Hua Sun, assistantprofessor, Electrical Engineering Department at National Taiwan University, in a statement. “The Nvidia Taipei-1 supercomputer will help our researchers, faculty and students leverage AI and digital twins to address complex challenges across many industries.”

The post Nvidia’s Grace Hopper Superchips for generative AI enter full production appeared first on Venture Beat.

Share207Tweet130Share

Trending Posts

Hoda Kotb Has One Message For Exclamation Point Haters On ‘Today With Hoda & Jenna’: “TOO BAD!” 

Hoda Kotb Has One Message For Exclamation Point Haters On ‘Today With Hoda & Jenna’: “TOO BAD!” 

September 22, 2023
How The September Full Harvest Moon Affects Each Zodiac Sign

How The September Full Harvest Moon Affects Each Zodiac Sign

September 22, 2023
Sex Education’s Final Season Comes to a Cozy Climax

Sex Education’s Final Season Comes to a Cozy Climax

September 22, 2023
Tunisia finally sees some migration money from EU despite backlash

Tunisia finally sees some migration money from EU despite backlash

September 22, 2023
A Visit to the U.S. Revives an Embattled Netanyahu

A Visit to the U.S. Revives an Embattled Netanyahu

September 22, 2023
In Alabama, White Tide Rushes On

In Alabama, White Tide Rushes On

August 22, 2023

Copyright © 2023.

Site Navigation

  • About
  • Advertise
  • Privacy & Policy
  • Contact

Follow Us

No Result
View All Result
  • Home
  • News
    • U.S.
    • World
    • Politics
    • Opinion
    • Business
    • Crime
    • Education
    • Environment
    • Science
  • Entertainment
    • Culture
    • Gaming
    • Music
    • Movie
    • Sports
    • Television
    • Theater
  • Tech
    • Apps
    • Autos
    • Gear
    • Mobile
    • Startup
  • Lifestyle
    • Arts
    • Fashion
    • Food
    • Health
    • Travel

Copyright © 2023.

We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies.
Cookie settingsACCEPT
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT