inference

(IM Imagery/Shutterstock)

Qualcomm Debuts AI200 and AI250 Chips as It Moves Into AI Infrastructure

Qualcomm is expanding beyond its mobile roots with a new line of data center hardware designed for AI inference. The San Diego-based company introduced its new AI200 and AI250 ...Full Article

NextSilicon Says Maverick-2 Delivers 4x Performance-Per-Watt Vs. Blackwell GPU

NextSilicon has shared some internal benchmark results for its latest AI and HPC chip, the Maverick-2, and the claims are nothing if not bold. The company claims that its ...Full Article

AMD and OpenAI Unveil Massive Chip Deal for AI Inference

Another week, another massive investment in chips by an AI firm. This week’s edition features OpenAI committing to buying billions of dollars worth of AI accelerators from AMD, one ...Full Article

(Source: Pingingz/Shutterstock)

AI Lessons Learned from DeepSeek’s Meteoric Rise

The AI world is still buzzing from last week’s debut of DeepSeek’s reasoning model, which demonstrates category-leading performance at a bargain-basement price. While the details of the Chinese AI ...Full Article

Nvidia’s Speedy New Inference Engine Keeps BERT Latency Within a Millisecond

Disappointment abounds when your data scientists dial in the accuracy on deep learning models to a high degree but are then eventually forced to gut the model for inference ...Full Article

Source: IBM Research

IBM’s Latest Prototype Low-Power AI Chip Offers ‘Precision Scaling’

IBM has released details of a prototype AI chip geared toward low-precision training and inference across different AI model types while retaining model quality within AI applications. In a ...Full Article

Source: Nvidia

Nvidia Probes Accelerators, Photons, GPU Scaling

Nvidia spotlighted an AI inference accelerator, emerging optical interconnects and a new programming framework designed to scale GPU performance during this week’s GTC China virtual event. In a keynote, ...Full Article

Xilinx Keeps Pace in AI Accelerator Race

FPGAs are increasingly used to accelerate AI workloads in datacenters for tasks like machine learning inference. A growing list of FPGA accelerators are challenging datacenter GPU deployments, promising to ...Full Article

NeoML Released as TensorFlow Alternative

A new open source library for training machine learning models is billed as rivaling the performance of AI models trained with established libraries like TensorFlow, especially models running on ...Full Article

via Shutterstock

SiFive Adds Tools for Cloud-Based Chip Design

Chip designers are drawing on new cloud resources along with conventional electronic design automation (EDA) tools to accelerate IC templates from tape-out to custom silicon. Among the challengers to ...Full Article