arXiv

KITE: Kernelized and Information Theoretic Exemplars for In-Context Learning

Title: KITE: Kernelized and Information Theoretic Exemplars for In-Context Learning

Abstract:

In-context learning (ICL) has established itself as a robust framework for tailoring large language models (LLMs) to novel and data-limited tasks by leveraging a small set of handpicked, task-specific examples within the prompt. Nevertheless, constrained by the limited context window of LLMs, a critical challenge persists: identifying which examples to choose to optimize performance for a particular user query. Although nearest-neighbor techniques such as KATE are commonly employed for this selection, they exhibit significant limitations in high-dimensional embedding spaces, notably struggling with poor generalization and insufficient diversity. Addressing this example selection dilemma, our research adopts a rigorous, information-theoretic approach. We conceptualize an LLM as a linear function acting on input embeddings and recast the selection process as a query-specific optimization task: choosing a subset of exemplars from a broader pool to minimize prediction error for a given query. This strategy diverges from conventional learning-theoretic methods that prioritize generalization, focusing instead on precise prediction for individual query instances. We develop a principled surrogate objective that is approximately submodular, allowing for the application of a greedy algorithm that offers an approximation guarantee. Our methodology is further refined through two key enhancements: (i) the integration of the kernel trick to facilitate operations in high-dimensional feature spaces without the need for explicit mappings, and (ii) the inclusion of a regularizer based on optimal design to foster diversity among the chosen examples. Our empirical results reveal substantial gains over conventional retrieval techniques across various classification benchmarks, underscoring the advantages of employing structure-aware and diverse example selection for ICL in practical, label-scarce environments.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

AI Concentration Risk Is the Problem: 3-Minutes MLIV
Bloomberg

AI Concentration Risk Is the Problem: 3-Minutes MLIV

The article argues that AI concentration risk, rather than the technology itself, is the primary concern. It highlights ...

Reuters

Foxconn announces strategic collaboration with Intel on next-gen AI infrastructure

Foxconn and Intel announced a strategic partnership to develop next-generation AI infrastructure. This collaboration aim...

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)
Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Broadcom AI Chip Outlook Disappoints Investors
Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Reuters

Europe's tech 'liberation day'? Computer says not yet

Europe’s expected tech breakthrough remains unrealized, as current systems indicate that a true "liberation day" has not...

Hiranandani Group CEO on Powering India's Digital Future
Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

Hiranandani Group CEO discusses driving India's digital transformation.