Staged Continual Adaptation of Multimodal Foundation Models for Japanese Financial Documents

Accepted at the CATS Workshop @ ICML 2026. We track an 8.4B-parameter multimodal model from an un-fine-tuned baseline (Phase 0) through three training phases on Japanese financial disclosures, and find that each benchmark peaks at a different phase — so the best checkpoint is task-dependent. This is the latest version of the Compass project; an earlier write-up targeting the FT-LLM 2026 competition is preserved as a legacy post.

May 13, 2026 · Atsushi Yanagisawa