Why did it reach that result? Establishing Multimodal XAI Technology to Explain Reasoning Grounds

NTT has established 'Evidence-Enhanced Decoding' technology to solve the issue where Large Vision-Language Models (LVLMs) ignore their own generated reasoning grounds. This technology enables faithful inference using both images and grounds without additional training, enhancing AI reliability. It will be presented at CVPR 2026 in June 2026, with applications expected in fields requiring high reliability such as medical diagnosis.
techNQ 54/100出典:PR Times

📋 Article Processing Timeline

  • 📰 Published: June 1, 2026 at 15:00
  • 🔍 Collected: June 1, 2026 at 15:27 (27 min after Published)
  • 🤖 AI Analyzed: June 1, 2026 at 18:14 (2h 46m after Collected)
NTT Corporation has established 'Evidence-Enhanced Decoding' technology as a new inference mechanism to improve the reliability of outputs from multimodal AI foundation models that handle images and language. Addressing the issue where LVLMs tend to ignore their own generated reasoning grounds during Chain-of-Thought (CoT) processes, this technology separates and weights inference from images and grounds, unlike conventional methods. This enables the model to output answers by faithfully utilizing information from both sources. This achievement will be presented at the Computer Vision and Pattern Recognition (CVPR) 2026 conference, held in Denver, USA, from June 3 to June 7, 2026. In recent years, while LVLM development has advanced, existing CoT mechanisms left the use of grounds to the model, failing to guarantee consistency between grounds and final outputs. This research establishes a plug-and-play decoding technique that requires no additional training, successfully providing interpretability to the LVLM inference process. This is expected to accelerate social implementation in fields requiring highly reliable systems, such as medical image diagnosis and decision-making support.

FAQ

What is the significance of this technology for Taiwan's AI industry?

For Taiwan's integrated hardware-software firms, improving AI model reliability is crucial for enhancing competitiveness in edge AI and industrial AI applications.