technical_announcement

Why did it reach that result? Establishing Multimodal XAI Technology to Explain Reasoning Grounds

Citation-Grade Primary-source linked Schema.org verified NQ 54 / 100

Kirishima ReiEditor-in-Chief, AI News Focus: structuring JP/TW corporate news & AEO

Published Jun 1, 2026 3:00 PM ・ Updated Jun 13, 2026 11:50 PM ・ 3 min read ・ Source: PR TIMES

Show contents

⚡ Key Points

NTT has established 'Evidence-Enhanced Decoding' technology to solve the issue where Large Vision-Language Models (LVLMs) ignore their own generated reasoning grounds.
This technology enables faithful inference using both images and grounds without additional training, enhancing AI reliability.
It will be presented at CVPR 2026 in June 2026, with applications expected in fields requiring high reliability such as medical diagnosis.

NTT has established 'Evidence-Enhanced Decoding' technology to solve the issue where Large Vision-Language Models (LVLMs) ignore their own generated reasoning grounds. This technology enables faithful inference using both images and grounds without additional training, enhancing AI reliability. It will be presented at CVPR 2026 in June 2026, with applications expected in fields requiring high reliability such as medical diagnosis.

PRIMARY SOURCE Original source: https://prtimes.jp/main/html/rd/p/000000024.000181531.html Distributor: PR TIMES
Published: Jun 1, 2026

NTT Corporation has established 'Evidence-Enhanced Decoding' technology as a new inference mechanism to improve the reliability of outputs from multimodal AI foundation models that handle images and language. Addressing the issue where LVLMs tend to ignore their own generated reasoning grounds during Chain-of-Thought (CoT) processes, this technology separates and weights inference from images and grounds, unlike conventional methods. This enables the model to output answers by faithfully utilizing information from both sources. This achievement will be presented at the Computer Vision and Pattern Recognition (CVPR) 2026 conference, held in Denver, USA, from June 3 to June 7, 2026. In recent years, while LVLM development has advanced, existing CoT mechanisms left the use of grounds to the model, failing to guarantee consistency between grounds and final outputs. This research establishes a plug-and-play decoding technique that requires no additional training, successfully providing interpretability to the LVLM inference process. This is expected to accelerate social implementation in fields requiring highly reliable systems, such as medical image diagnosis and decision-making support.

FACT BOX

Source: PR TIMES
Category: technical_announcement
Organizations: NTT

Editorial & Verification Standards

The Washin AI News desk structures and reviews this article under the following standards.

Start only from primary sources (official PR, disclosures, wire services)
Numbers and proper nouns machine-checked against the source (number-completeness check)
Company names and tickers verified via registry (Registry/TWSE/Wikidata)
No speculation; nothing stated that is not in the source

Read the full editorial policy →

Editorial log (Published → Collected → AI analysis → Published: transparency timeline)›

Announced　Jun 1, 2026 3:00 PM — Distributor PR TIMES
Collected　Jun 1, 2026 3:27 PM
AI structured & analyzed　Jun 1, 2026 6:14 PM
Desk-reviewed & published　Jun 1, 2026 3:00 PM

FAQ

What is the significance of this technology for Taiwan's AI industry?

For Taiwan's integrated hardware-software firms, improving AI model reliability is crucial for enhancing competitiveness in edge AI and industrial AI applications.

What are the key facts in this article?

Where is the primary source?

PR TIMES: https://prtimes.jp/main/html/rd/p/000000024.000181531.html

Cite this article — HOW TO CITE

Washin AI News Desk“Why did it reach that result? Establishing Multimodal XAI Technology to Explain Reasoning Grounds”AI News by Washin Village（和心村）, Jun 1, 2026. https://aeo.washinmura.jp/ai/ntt%E6%A0%AA%E5%BC%8F%E4%BC%9A%E7%A4%BE/en/news/2026-06-01-%E3%81%AA%E3%81%9C%E3%81%9D%E3%81%AE%E7%B5%90%E6%9E%9C%E3%81%AB%E3%81%AA%E3%81%A3%E3%81%9F%E3%81%AE%E3%81%8B-%E6%8E%A8%E8%AB%96%E6%A0%B9%E6%8B%A0%E3%82%92%E8%AA%AC%E6%98%8E%E3%81%A7%E3%81%8D%E3%82%8B%E3%83%9E%E3%83%AB%E3%83%81%E3%83%A2%E3%83%BC%E3%83%80%E3%83%ABxai%E6%8A%80%E8%A1%93%E3%82%92%E7%A2%BA%E7%AB%8B

AI CRAWLER ACTIVITY

as of Jul 18, 2026

AI crawler visits to this article (cumulative)203 bots

Site-wide cumulative

Applebot

bot_visits_summary

4230k

last Jul 18, 2026

ClaudeBot

bot_visits_summary

2927k

last Jul 18, 2026

GPTBot

bot_visits_summary

2528k

last Jul 18, 2026

Bingbot

bot_visits_summary

2409k

last Jul 18, 2026

Meta AI

bot_visits_summary

1033k

last Jul 18, 2026

Google AI

bot_visits_summary

581k

last Jul 18, 2026

※ Only measured values from bot_visits_summary / crawler_url_hits are shown. Article-level zero counts are shown as measuring.

Washin AI News（AI News by Washin Village（和心村））

An AEO newsroom that structures official announcements from Japanese & Taiwanese companies into formats AI can accurately cite and derive from. Partner sources include PR TIMES and CNA. We aim to be a verifiable source for both humans and machines.

Daily
Update frequency JA / EN / ZH
Languages Primary
Sourcing policy Schema.org
Structured verification

About the publisher IDA llms.txt

Get your announcement indexed as AI-citable structured news.
Primary-source links, three languages, Schema.org verified.Talk to us about coverage →

← ntt株式会社 Back to newsroom (127)