Image Captioning Models

Adobe Researchers Open-Source Image Captioning AI CLIP-S

Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Sepehr Khosravi discusses the current state ...

Ars Technica

Microsoft unveils AI model that understands image content, solves visual puzzles

On Monday, researchers from Microsoft introduced Kosmos-1, a multimodal model that can reportedly analyze images for content, solve visual puzzles, perform visual text recognition, pass visual IQ ...

ChatGPT Image 2.0 Signals Visual Reasoning To Solve Real-World Tasks

ChatGPT Image 2.0 suggests that AI image generation is evolving into visual reasoning and verifiable AI, with implications ...

Science Daily

Researchers teach an AI to write better chart captions

A new tool helps scientists develop machine-learning models that generate richer, more detailed captions for charts, and vary the level of complexity of a caption based on the needs of users. This ...

10d

Cross-Modal Data Understanding Advances Through Bukun Ren’s Review of Visual Language Models

A study on visual language models explores how shared semantic frameworks improve image–text understanding across multimodal tasks. By ...

VentureBeat

Meta reveals new AI image generation model CM3leon, touting greater efficiency

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Meta is continuing to push forward with its research into new forms of ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results