Continual Model Routing in Evolving Model Hubs

15d ago · Global · primary source: export.arxiv.org

Multi-source synthesis by The Embedding Report from 2 sources. Every numeric and quoted claim traces to a cited source body (see methodology).

Researchers have proposed two new models to improve efficiency in AI model routing and table reasoning, addressing challenges posed by the rapid growth of pre-trained models in AI model hubs.

The first model, CARvE, is a contrastive embedding approach for efficient continual model routing in evolving model hubs. It uses checkpoint-based anchoring and structured replay to outperform zero-shot retrieval, fine-tuning, and adapter-merging baselines in model, family, and domain-level accuracy[1]. AI model hubs provide access to a rapidly growing collection of pre-trained models, posing challenges in scaling model selection and updating routing mechanisms. To address this, researchers formalised the setting as Continual Model Routing (CMR) and proposed CMRBench, a large-scale benchmark simulating realistic hub expansion with over 2,000 candidate models[1]. Meanwhile, a separate research effort introduced EcoTab, a table-aware stepwise routing framework for efficient table reasoning. EcoTab estimates the uncertainties of table tokens and text tokens separately and combines them for routing, achieving a better balance between accuracy and efficiency. This is necessary as Large Reasoning Models (LRMs) incur substantial inference cost due to long reasoning traces, and existing methods fail to model separately the uncertainty of table tokens and text tokens[2].

research-paperbenchmark

Background sources we checked (4)
  • arxiv.org ↗ AI model hubs provide access to a rapidly growing collection of powerful pre-trained models, enabling off-the-shelf mixture-of-experts systems with different routing strategies. However, this rapid growth poses two fundamental challenges: scaling model selection across thousands …
  • en.wikipedia.org ↗ The OpenROAD Project (Open Realization of Autonomous Design) is a major open-source project that aims to provide a fully automated, end-to-end digital integrated circuit design flow (RTL-to-GDSII), thereby eliminating the need for human intervention. The project, led by UC San Di…
  • en.wikipedia.org ↗ Speech recognition (automatic speech recognition (ASR), computer speech recognition, or speech-to-text (STT)) is a sub-field of computational linguistics concerned with methods and technologies that translate spoken language into text or other interpretable forms. Speech recognit…
  • en.wikipedia.org ↗ Cro-Magnons or European early modern humans (EEMH) were the first early modern humans (Homo sapiens) found in Europe and Siberia, continuously occupying the continent possibly from as early as 56,800 years ago. They interacted and interbred with the indigenous Neanderthals (H. n…

Sources cited (2)

  1. arxiv.org ↗ E
  2. arxiv.org ↗ E
Spot something wrong? Report an issue