Happening Now
Friday, November 7- NVIDIA: Jensen Huang and Bill Dally Awarded Prestigious Queen Elizabeth Prize for Engineering
- Nebius Launches Token Factory to Deliver Production AI Inference at Scale
- Hyperion Research Announces Availability of New AI in HPC ROI Study
- VAST Data Secures Commercial Partnership Deal with CoreWeave for $1.17B
- Nebius Deploys Blackwell Ultra AI Infrastructure in the UK
- Lenovo Highlights Need for Liquid-Cooled, AI-Ready Infrastructure in New Report
- Giga Computing Announces Worldwide Availability of Its NVIDIA RTX PRO Server
- BrainChip Unveils AKD1500 Edge AI Co-Processor at Embedded World North America
- NCSA Awards 38 Students Fiddler Innovation Fellowships
- ORNL, NVIDIA, HPE Advance Quantum Computing, AI and HPC for Science
- Ai2 Launches OlmoEarth: Foundation Models and Open Infrastructure to Tackle the Planet’s Biggest Problems
- RapidFire AI Launches Open Source Package to Accelerate Agentic RAG and Context Engineering Success
- Researchers Explore How Generative AI Tools Are Shaping Computer Science Education
- AWS and OpenAI Announce Multi-Year Strategic Partnership
- European Commission Launches ‘Resource for AI Science in Europe’
- SandboxAQ Launches Public Database Exposing Cryptographic Risks in Open-Source Software
- UCSD: Generative AI Can Help Athletes Avoid Injuries
- CoreWeave to Enter the US Federal Market
- Nscale and VAST Data Unite to Build Global Fabric for AI Accelerated by NVIDIA
-
-
Recent News
-
Contributors
Alex WoodieEditorial Director
and Contributing
Editor
Jaime HamptonManaging Editor
Ali AzharContributing Editor
Drew JollyContributing Editor
Author Archives: Doug Eadline
Doug Eadline
Nvidia Releasing Open-Source Optimized Tensor RT-LLM Runtime with Commercial Foundational AI Models to Follow Later This Year
September 14th, 2023 Comments Off on Nvidia Releasing Open-Source Optimized Tensor RT-LLM Runtime with Commercial Foundational AI Models to Follow Later This Year
Nvidia's large-language models will become generally available later this year, the company confirmed. Organizations widely rely on Nvidia's graphics processors to write AI applications. The company has also created proprietary pre-trained models similar to OpenAI's GPT-4 and Google's PaLM-2. ...
MLPerf Releases Latest Inference Results and New Storage Benchmark
September 14th, 2023 Comments Off on MLPerf Releases Latest Inference Results and New Storage Benchmark
MLCommons this week issued the results of its latest MLPerf Inference (v3.1) benchmark exercise. Nvidia was again the top performing accelerator, but Intel (Xeon CPU) and Habana (Gaudi1 and 2) performed well. Google provided a peak at its new ...
Nvidia H100: Are 550,000 GPUs Enough for This Year?
August 21st, 2023 Comments Off on Nvidia H100: Are 550,000 GPUs Enough for This Year?
The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its latest H100 GPUs worldwide in 2023. The appetite for GPUs is obviously coming ...
GigaIO’s New SuperNode Takes-off with Record Breaking AMD GPU Performance
August 11th, 2023 Comments Off on GigaIO’s New SuperNode Takes-off with Record Breaking AMD GPU Performance
The HPC user's dream is to keep stuffing GPUs into a rack mount box and make everything go faster. There are some servers that offer up to eight GPUs, but the standard server usually offers four GPU slots. Fair ...


