SUNNYVALE, Calif.--(BUSINESS WIRE)--Today, Cerebras Systems, the pioneer in high performance AI compute, announced Cerebras Inference, the fastest AI inference solution in the world. Delivering 1,800 ...
SAN FRANCISCO--(BUSINESS WIRE)--Elastic (NYSE: ESTC), the Search AI Company, announced the Elasticsearch Open Inference API now supports Jina AI’s latest embedding models and reranking products.
Applications using Hugging Face embeddings on Elasticsearch now benefit from native chunking “Developers are at the heart of our business, and extending more of our GenAI and search primitives to ...
Today, Continuum AI released OrcaRouter and OrcaRouter Lite — a unified inference layer that routes across 200+ frontier and open-source language models, with zero markup on BYOK traffic.
OpenRouter Inc., a startup working to ease the development of artificial intelligence applications, today announced that it has secured $40 million in funding. The company raised the capital over two ...
SINGAPORE, SINGAPORE, SINGAPORE, May 10, 2026 /EINPresswire.com/ -- Comprehensive analysis of 2.4 billion API calls ...
At LlamaCon, Meta launched the Llama API in a limited free preview, aiming to increase developer access to its Llama models. At the first-ever LlamaCon, Meta today made several announcements and ...
DigitalOcean (NYSE: DOCN) today announced the launch of its Inference Engine, a set of new production capabilities that give AI builders exceptional performance and unified control over how they run, ...
DeepInfra raises $107M to expand global inference capacity, support new AI models, and enhance developer tooling across its ...
Happy Horse, a new state of the art video model is now available April 27th via fal's generative media cloud for developers ...