Transforming static chain-of-thought outputs into interactive interfaces that enhance verification accuracy and reduce cognitive load
As LLMs generate increasingly longer chain-of-thought reasoning (often thousands of tokens), users face a critical challenge: How can they efficiently comprehend LLM reasoning and detect errors or hallucinations when presented with walls of text?
Given a GSM8K question, LLMs typically provide step-by-step reasoning followed by the final answer. However, such output presentation is often static and long, posing higher cognitive load and leading to slower and more erroneous answer verification. In contrast, we prompt LLMs to generate interactive HTML/JavaScript applications with (a) navigation buttons (inspired by common IDEs) and (b) colored highlights. This interface enables users to verify reasoning more efficiently via tools that reduce cognitive load and improve verification efficiency.
Experience how our three interactive formats help you verify LLM reasoning
We introduce novel explanation formats that improve interpretability and usability of LLM reasoning
๐ก Hover over each card to see examples
Interactive Chain-of-Thought โ Navigate through reasoning steps with IDE-inspired controls, colored highlights, and structured presentation that reduces cognitive load.
Discrete blocks with color-coded variables
โ Visual segmentation
โ Color-coded tracking
โ Sequential playback
Interactive Program-of-Thought โ Program-based reasoning with interactive execution visualization, making computational logic transparent and verifiable.
Code-based reasoning with variable tracking
# Calculate bathroom areawidth_tiles = 10length_tiles = 20tile_size = 6 # incheswidth_inches = width_tiles ร tile_sizelength_inches = length_tiles ร tile_sizewidth_feet = width_inches / 12length_feet = length_inches / 12bathroom_area = width_feet ร length_feet
โ Systematic computation
โ Variable panel updates
โ Step-by-step execution
Interactive Graph โ Graph-based reasoning visualization that reveals structural relationships and dependencies in complex logical flows.
Interactive node-link graph
โ Draggable nodes
โ 85.6% accuracy
โ Clear dependencies
Transforming how users engage with AI reasoning
Watch a comprehensive walkthrough of our interactive explanation interfaces
User study with 125 participants demonstrates significant improvements
iGraph vs Traditional CoT
10.5% faster verification
Identifying exact error steps
iGraph rated highest overall