As I highlighted in my last article, two decades after the DARPA Grand Challenge, the autonomous vehicle (AV) industry is still waiting for breakthroughs—particularly in addressing the “long tail ...
Vision language models (VLMs) have made impressive strides over the past year, but can they handle real-world enterprise challenges? All signs point to yes, with one caveat: They still need maturing ...
H2O.ai, a provider of open-source AI platforms, announced today two new vision-language models designed to improve document analysis and optical character recognition (OCR) tasks. The models, named ...
If India’s AI ambitions needed a pre-India AI Impact Summit flex, Sarvam AI delivered it loud and clear. Days before the India AI Impact Summit 2026 kicks off in New Delhi, the Bengaluru-based startup ...
Genesis AI has unveiled GENE-26.5, a vision-language-action model aimed at giving robots human-level dexterity, while Intel is positioning itself to dominate the growing AI inference market. The ...
Aquila improves remote sensing image comprehension through two linked innovations. First, it accepts image inputs up to 1,024 × 1,024 pixels, far higher than the 448 × 448 scale supported by many ...
In the race to develop AI that understands complex images like financial forecasts, medical diagrams and nutrition labels, closed-source systems like ChatGPT and Claude are currently setting the pace, ...