AMD closed the performance gap with Nvidia's Blackwell accelerators with the launch of the MI355X this spring. Now the company just needs to overcome Nvidia's CUDA software advantage and make that ...
Qualcomm Incorporated QCOM recently announced the launch of AI200 and AI250 chip-based AI accelerator cards and racks. The leading-edge AI inference optimized solutions for data centers are powered by ...
Nvidia has been able to increase Blackwell GPU performance by up to 2.8x per GPU in a period of just three short months.
The big four cloud giants are turning to Nvidia's Dynamo to boost inference performance, with the chip designer's new Kubernetes-based API helping to further ease complex orchestration. According to a ...
Researchers propose low-latency topologies and processing-in-network as memory and interconnect bottlenecks threaten ...
Chipmakers Nvidia and Groq entered into a non-exclusive tech licensing agreement last week aimed at speeding up and lowering the cost of running pre-trained large language models. Why it matters: Groq ...
Inference Labs announces a $2.3 million pre-seed funding round to focus on developing a network for zero-knowledge verification with Proof-of-Inference for AI. As the global integration of artificial ...
NVIDIA BlueField-4 powers NVIDIA Inference Context Memory Storage Platform, a new kind of AI-native storage infrastructure ...
Groq has entered into a non-exclusive licensing agreement with Nvidia covering its inference technology. Groq reports the pact focuses on expanding access to high-performance, low-cost inference ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results