Abstract
The deployment of large language models (LLMs) in financial verticals is frequently bottlenecked by computational constraints and the strict imperative for zero-defect accuracy. This paper challenges the prevailing scaling law paradigm by introducing HAA-Fin, a resource-efficient and high-precision framework for regulatory compliance. We construct a Hierarchical Agentic Architecture (HAA) that decomposes complex multimodal auditing tasks, such as detecting layout-dependent loan rate violations or identifying illegal advertising claims, into atomic sub-tasks orchestrated by specialized agents including TimeAgent and RuleAgent. By synergizing a retrieval-augmented generation (RAG) module with layout-aware hybrid search and re-ranking, we enable 8B-parameter models to rival or even surpass the performance of models with over 32B parameters. Validated on a challenging national-level financial benchmark, our method effectively handles multimodal inputs of text and visual layout. It achieves a significant 20.7% accuracy boost via re-ranking and reduces single-document review time to under 1.5 seconds, demonstrating that orchestrating specialized agents is a superior pathway to scalable industrial AI compared to merely increasing model size.
| Original language | English |
|---|---|
| Title of host publication | The IEEE International Conference on Multimedia & Expo 2026 |
| Subtitle of host publication | ICME 2026 |
| Publisher | IEEE Press |
| Chapter | 1 |
| Pages | 1-6 |
| Number of pages | 6 |
| Publication status | Published - 5 Jul 2026 |
| Event | The IEEE International Conference on Multimedia & Expo 2026: ICME 2026 - Bangkok, Thailand, Bangkok, Thailand Duration: 5 Jul 2026 → 9 Jul 2026 https://2026.ieeeicme.org/ |
Conference
| Conference | The IEEE International Conference on Multimedia & Expo 2026 |
|---|---|
| Country/Territory | Thailand |
| City | Bangkok |
| Period | 5/07/26 → 9/07/26 |
| Internet address |
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver