Frontier Signal
DialToM: New Theory of Mind Benchmark Tests AI Dialogue Forecasting
DialToM benchmark reveals LLMs excel at identifying mental states but struggle to forecast dialogue trajectories. Only Gemini 3 Pro shows functional Theory of Mind abilities.
Read the briefing