We are pleased to announce the first international Workshop on Reasoning, Agents, Retrieval, and Attribution (RARA) for grounding documents, to be held in conjunction with ICDM 2025 on November 12, 2025 in Washington DC.
Documents serve as the backbone of knowledge preservation across domains such as law, finance, medicine, and academia. As AI-driven document understanding advances, ensuring factual accuracy, reasoning over complex contexts, and attributing information to reliable sources becomes increasingly vital.
The RARA workshop explores frameworks that ground document understanding through four key pillars:
- Reasoning: Structured mechanisms to navigate complex inference chains
- Agents: Specialized components working together on complex tasks
- Retrieval: Methods for finding and integrating relevant information
- Attribution: Techniques to ensure AI-generated content remains traceable to reliable sources
Topics of Interest
We welcome original research papers on topics including but not limited to:
- Complex Reasoning: Multi-hop inference across document sections, logical consistency maintenance, ambiguity resolution in domain-specific texts
- Agent Architectures: Multi-agent coordination frameworks, tool-augmented document understanding, planning strategies for document analysis
- Document-Specific Agents: Citation verification, fact-checking, table extraction, chart interpretation, formula extraction, document summarization and comparison agents
- Domain-Specific Document Processing: Specialized techniques for legal, financial, healthcare, academic, technical, and government documents
- Advanced Retrieval: Dense/sparse retrieval for multi-modal documents, cross-document information synthesis, retrieval-augmented generation for document processing
- Attribution Mechanisms: Source tracing in AI-generated content, confidence calibration in document analysis, verification of AI-generated claims
- Multi-Modal Processing: Handling diverse document formats including charts, tables, infographics, diagrams, flowcharts, forms, and other visually rich elements
- Document Structure: Layout analysis, semantic segmentation, hierarchical document modeling
- Benchmarks & Evaluation: Novel datasets, evaluation frameworks, metrics for document reasoning, attribution quality assessment, agent performance measurement