Tag: NVIDIAs

This is what Nvidia’s Exaflop supercomputer-in-a-rack looks like — the DGX GB200 NVL72 tower most likely uses 48V, 2.5kA to deliver a staggering 1,440 petaflops, could cost millions

Post author By lisa nichols
Post date March 23, 2024
No Comments on This is what Nvidia’s Exaflop supercomputer-in-a-rack looks like — the DGX GB200 NVL72 tower most likely uses 48V, 2.5kA to deliver a staggering 1,440 petaflops, could cost millions

[ad_1]

Nvidia recently unveiled its DGX GB200 NVL72 supercomputer-in-a-rack at Nvidia GTC 2024 and Patrick Kennedy at Serve The Home took a selection of great photos showcasing the impressive beast.

The name of the DGX GB200 NVL72 tells you much of what you need to know. The GB200 signifies the Grace Blackwell GB200 compute structure, while the NVL72 denotes there are 72 Blackwell GPUs connected by NVLink.

The Blackwell platform contains 208 billion transistors across its two GPU dies. These are connected by 10 TB/second chip-to-chip link into a single, unified GPU. Blackwell, set to ship later this year, will reportedly offer up to 20 petaflops of FP4 power and be up to 30x faster than Hopper for AI inference tasks.

TechRadar Pro also snapped our own picture of the DGX GB200 at Nvidia GTC 2024 (Image credit: Future / Mike Moore)

120kW power load

The rack scale system comprises ten compute nodes in the top stack, each featuring dual Infiniband ports, four E1.S drive trays, and management ports. Each node is powered by two Grace Arm CPUs connected to two Blackwell GPUs. Below these nodes are nine NVSwitch shelves, with gold handles for easy removal.

The rear of the rack reveals the power delivery system designed for blind-mate power via the bus bar, liquid cooling nozzles, and NVLink connections for each component. This setup allows for slight movement to ensure proper blind mating.

DGX GB200 NVL72 weighs 1.36 metric tons (3,000 lbs) and consumes a 120kW, a power load that Serve The Home points out, not all data centers will be able to handle. As many can only support a maximum of 60kW racks, a future half-stack system seems a possibility. The rack uses 2 miles (3.2 km) of copper cabling instead of optics to lower the system’s power draw by 20kW.

You can view the rest of the photos taken by Kennedy at GTC 2024 here.

This popular gaming PC vendor believes it can make AI training much more affordable for small businesses — Maingear partners with Phison to deliver a quad-GPU system using Nvidia’s RTX 6000 Ada

[ad_1]

The surge in demand for large-scale generative AI models has led to a significant increase in hardware requirements, making model training costly and inaccessible for many SMBs and educational establishments.

High-performance custom PC builder Maingear has partnered with storage giant Phison on a new range of Maingear Pro AI workstations that boast powerful Intel Xeon W7-3455 CPUs.

The new workstations can be configured with up to 1TB of DDR5 memory, and up to 4x RTX 5000 Ada or 4x RTX 6000 Ada GPUs. These GPUs are supported by Phison aiDAPTIV+ caching SSDs and software, to significantly lower the cost of LLM development and training.

Off-the-shelf components

Maingear Pro AI workstations fit in a standard desktop tower PC design so they can easily be stored under a desk, or placed anywhere in an office (a 4U rackmount chassis is also available). Maingear says these workstations have been designed with off-the-shelf components for easy upgrades and Noctua cooling components to manage heat and reduce noise when under load.

Maingear founder and CEO, Wallace Santos, stated, “Our dedication to crafting highly capable yet budget-friendly solutions guarantees SMBs, universities, and research facilities a competitive advantage in an industry formerly restricted by multimillion-dollar investments.”

There are three preconfigured systems available to order now:

PRO AI: SHODAN 64 (MSRP: $28,000)

Chassis: Fractal Design Define 7 XL
GPU: 2x NVIDIA RTX 5000 Ada
CPU: Intel Xeon W7-3455
Cooling: Noctua NH-U12S DX-4677, Noctua Fan Kit
Motherboard: Supermicro X13SWA-TF
Memory: 512GB Kingston Server Premier RAM (8×64)
OS: Linux – Ubuntu
OS Drive: 2TB Gen4 M.2 NVMe SSD
AI Kit: Phison AiDAPTIV+ Layer
Additional Storage: 16TB Seagate Barracuda HDD
Power: 1600W XPG Fusion 80+ Platinum
Warranty: 2-year MAINGEAR PRO Solutions WarrantyEmpty list

PRO AI: SHODAN 96 (MSRP: $37,000)

Chassis: Fractal Design Define 7 XL
GPU: 2x NVIDIA RTX 6000 Ada
CPU: Intel Xeon W7-3455
Cooling: Noctua NH-U12S DX-4677, Noctua Fan Kit
Motherboard: Supermicro X13SWA-TF
Memory: 512GB Kingston Server Premier RAM (8×64)
OS: Linux – Ubuntu
OS Drive: 2TB Gen4 M.2 NVMe SSD
AI Kit: Phison AiDAPTIV+
Additional Storage: 16TB Seagate IronWolf HDD
Power: 1600W XPG Fusion 80+ Platinum
Warranty: 2-year MAINGEAR PRO Solutions Warranty

PRO AI: SHODAN 192 (MSRP: $60,000)

Chassis: Fractal Design Define 7 XL
GPU: 4x NVIDIA RTX 6000 Ada
CPU: Intel Xeon W7-3455
Cooling: Noctua NH-U12S DX-4677, Noctua Fan Kit
Motherboard: Supermicro X13SWA-TF
Memory: 512GB Kingston Server Premier RAM (8×64)
OS: Linux – Ubuntu
OS Drive: 2TB Gen4 M.2 NVMe SSD
AI Kit: Phison AiDAPTIV+
Additional Storage: 16TB Seagate IronWolf HDD
Power: 1600W XPG Fusion 80+ Platinum
Warranty: 2-year MAINGEAR PRO Solutions Warranty

Nvidia’s Project GROOT brings the human-robot future a significant step closer

Post author By lisa nichols
Post date March 18, 2024
No Comments on Nvidia’s Project GROOT brings the human-robot future a significant step closer

[ad_1]

The age of humanoid robots could be a significant step closer thanks to a new release from Nvidia.

The computing giant has announced the launch of Project GROOT, its new foundational model aimed at helping the development of such robots in industrial use cases.

Revealed at Nvidia GTC 2024, the company says its new launch will enable robots to be smarter and more functional than ever before – and they’ll do so by watching humans.

We are GROOT?

Announcing the launch of Project GROOT (standing for “Generalist Robot 00 Technology”) on stage at Nvidia GTC 2024, Jensen Huang, company founder and CEO revealed robots powered by the platform will be designed to understand natural language and emulate movements by observing human actions.

This will allow them to quickly learn coordination, dexterity and other skills in order to navigate, adapt and interact with the real world – and definitely not lead to a robot uprising at all.

Huang went on to show off a number of demos which saw Project GROOT-powered robots carry out a number of tasks, from XXX, showing their possibility.

“Building foundation models for general humanoid robots is one of the most exciting problems to solve in AI today,” Huang said. “The enabling technologies are coming together for leading roboticists around the world to take giant leaps towards artificial general robotics.”

The scale of importance for Project GROOT was also highlighted by the fact Nvidia has built a new computing system, Jetson Thor, specifically designed for humanoid robots.

The SoC includes a GPU based on the latest Nvidia Blackwell architecture, which includes a transformer engine able to delivering 800 teraflops of AI performance, allowing them to run multimodal generative AI models like GR00T.

Nvidia Isaac Robotics

(Image credit: Nvidia)

The company also revealed upgrades to its Nvidia Isaac robotics platform, designed to make robotic arms smarter, more flexible and more efficient than ever – making them a much more appealing choice for factories and industrial use cases across the world.

This includes new collections of robotics pretrained models, libraries and reference hardware aimed at helping faster learning and better efficiency.

‘A single chip to outperform a small GPU data center’: Yet another AI chip firm wants to challenge Nvidia’s GPU-centric world — Taalas wants to have super specialized AI chips

[ad_1]

Toronto-based AI chip startup Taalas has emerged from stealth with $50 million in funding and the lofty aim of revolutionizing the GPU-centric world dominated by Nvidia.

Founded by Ljubisa Bajic, Lejla Bajic, and Drago Ignjatovic, all previously from Tenstorrent (the creator of Grayskull), Taalas is developing an automated flow for quickly turning any AI model – Transformers, SSMs, Diffusers, MoEs, etc. – into custom silicon. The company claims that the resulting Hardcore Models are 1000x more efficient than their software counterparts.

The startup also says that one of its chips can hold an entire large AI model without requiring external memory, and the efficiency of hard-wired computation enables a single chip to outperform a small GPU data center.

Casting intelligence directly into silicon

“Artificial intelligence is like electrical power – an essential good that will need to be made available to all. Commoditizing AI requires a 1000x improvement in computational power and efficiency, a goal that is unattainable via the current incremental approaches. The path forward is to realize that we should not be simulating intelligence on general purpose computers, but casting intelligence directly into silicon. Implementing deep learning models in silicon is the straightest path to sustainable AI,” said Ljubisa Bajic, Taalas’ CEO.

“We believe the Taalas ‘direct-to-silicon’ foundry unlocks three fundamental breakthroughs: dramatically resetting the cost structure of AI today, viably enabling the next 10-100x growth in model size, and efficiently running powerful models locally on any consumer device. This is perhaps the most important mission in computing today for the future scalability of AI. And we are proud to support this remarkable n-of-1 team as they do it,” said Matt Humphrey, Partner at Quiet Capital which led the two rounds of funding alongside Pierre Lamond, an advisor at Eclipse Ventures.

Taalas says it will be taping out its first large language model chip in the third quarter of 2024, and aiming to make its chips available to the first customers in Q1 2025.

NVIDIA’s AI personal assistant demo available for RTX GPU PCs

Post author By miranda cosgrove
Post date February 19, 2024
No Comments on NVIDIA’s AI personal assistant demo available for RTX GPU PCs

NVIDIA RTX AI PCs Chat Bot Demo

NVIDIA has recently unveiled a new technology demonstration that is set to enhance the way we interact with artificial intelligence. This new feature, known as “Chat With RTX,” is designed to work seamlessly on Windows RTX PCs, leveraging the power of NVIDIA RTX GPUs to deliver a personalized and efficient chatbot experience. The technology is aimed at providing users with quick, secure, and contextually relevant responses, drawing from their own documents and notes to ensure a private and customized interaction.

At the heart of “Chat With RTX” lies a sophisticated GPT large language model that is capable of tailoring conversations to the user’s specific needs. This is not your average chatbot; it’s an intelligent system that can process a variety of file types, including text documents, PDFs, Word documents, XML files, and even transcriptions from YouTube videos. This versatility allows the chatbot to provide assistance that is highly relevant to the user’s personal content.

NVIDIA RTX AI PCs Chat Bot Demo

One of the key features of NVIDIA’s new tech demo is the use of retrieval-augmented generation (RAG), which significantly enhances the quality of the chatbot’s responses. In addition, the demo incorporates TensorRT-LLM, a tool for optimizing large language models, ensuring that the chatbot operates at peak efficiency. Thanks to RTX acceleration, the chatbot is not only accurate but also incredibly fast, running directly on a user’s Windows RTX PC without the need for cloud processing.

Developers, in particular, may find “Chat With RTX” intriguing as it builds upon the TensorRT-LLM RAG developer reference project available on GitHub. This provides them with a valuable opportunity to explore advanced AI models and potentially integrate similar technologies into their own projects.

For those interested in experiencing “Chat With RTX,” there are certain system requirements that must be met. The user’s PC should be equipped with a GeForce RTX 30 Series GPU or a more advanced model, with a minimum of 8GB of VRAM. Additionally, the PC must be running either Windows 10 or 11 and have the latest NVIDIA drivers installed to ensure compatibility with the demo.

NVIDIA has acknowledged a current installation issue with “Chat With RTX” and has promised to resolve it in a forthcoming update. In the meantime, users are advised to install the application in the default directory to avoid any complications.

Furthermore, NVIDIA is encouraging developers to push the boundaries of generative AI by hosting a contest. Participants are invited to create innovative applications using NVIDIA RTX GPUs, with the chance to win prizes. This contest not only stimulates creativity within the developer community but also showcases the potential of NVIDIA’s technology in driving forward AI applications.

The introduction of “Chat With RTX” is a testament to NVIDIA’s ongoing efforts to advance AI and GPU technology. By focusing on high-performance computing and data privacy, NVIDIA is making it possible to integrate sophisticated AI capabilities into everyday tasks. This technology allows users to benefit from a smart, responsive, and personalized AI assistant, all while keeping their data securely processed on their local machine. As NVIDIA continues to innovate and address any initial teething problems, “Chat With RTX” is poised to become an essential tool for those seeking a more intelligent and responsive computing experience.

Filed Under: Technology News, Top News

Latest timeswonderful Deals

Disclosure: Some of our articles include affiliate links. If you buy something through one of these links, timeswonderful may earn an affiliate commission. Learn about our Disclosure Policy.

Tags Assistant, demo, GPU, NVIDIAs, PCs, Personal, RTX