Inference - Search News

Not Nvidia. Not Broadcom. Intel Is Going to Be the Biggest Winner of the Artificial Intelligence (AI) Inference Era.

Sales of Intel's central processing units and custom AI processors are gaining traction as AI inference workloads grow.

Deepinfra lands $107M in funding to build out its dedicated inference cloud for open-source models

Deepinfra lands $107M in funding to build out its dedicated inference cloud for open-source models - SiliconANGLE ...

Nebius Just Became a Serious Contender in AI Inference. It’s a Strong Buy Even at All-Time Highs.

As the world moves from AI training to AI inference, Nebius Group is proactively taking the initiative to dominate the future ...

Pioneering AI Inference Acceleration Provider Selects Silicom's Inference-Specific Solution

Silicom Ltd. (NASDAQ: SILC), a leading provider of networking and data infrastructure solutions, today announced that one of ...

DatacenterDynamics

Anthropic in discussions to buy inference chips from UK startup Fractile – report

Anthropic has held discussions with Fractile to buy inference chips from the UK-based startup when its hardware becomes ...

Forbes

The Rise Of The AI Inference Economy

Forbes contributors publish independent expert analyses and insights. I write about the economics of AI. When OpenAI’s ChatGPT first exploded onto the scene in late 2022, it sparked a global obsession ...

Forbes

Who Has The Fastest AI Inference, And Why Does It Matter?

A food fight erupted at the AI HW Summit earlier this year, where three companies all claimed to offer the fastest AI processing. All were faster than GPUs. Now Cerebras has claimed insanely fast AI ...

16d

Google unveils chips for AI training and inference in latest shot at Nvidia

Google is packing ample amounts of static random access memory into a dedicated chip for running artificial intelligence models, following Nvidia's plans.

SDxCentral

Zero Latency launches beta for AI inference orchestration platform

Zero Latency (formerly Hyphastructure) launched a closed beta for Zerogrid, a distributed AI inference platform designed to route workloads across edge infrastructure according to latency, data ...

Ventureburn

DeepInfra Raises $107M To Scale Global Inference Infrastructure

DeepInfra raises $107M to expand global inference capacity, support new AI models, and enhance developer tooling across its ...

SDxCentral

Viavi test platform brings security to inference and AI data centers

Viavi Solutions has unveiled the latest iteration of its CyberFlood testing platform.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results