[R] Where does In-context Learning Happen in LLMs? (NeurIPS 2024)

old.reddit.com

[R] Where does In-context Learning Happen in LLMs? (NeurIPS 2024)

old.reddit.com

Lemmit.Online botMAB to

Machine LearningEnglish · 5 months ago

Posted in r/MachineLearning by u/Historical_Insect668 • 19 points and 0 comments

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/machinelearning by /u/Historical_Insect668 on 2025-02-17 04:27:03+00:00.

Abstract: Self-supervised large language models have demonstrated the ability to perform various tasks via in-context learning, but little is known about where the model locates the task with respect to prompt instructions and demonstration examples.

In this work, we attempt to characterize the region where large language models transition from recognizing the task to performing the task. Through a series of layer-wise context-masking experiments on GPTNEO2.7B, BLOOM3B, and STARCODER2-7B, LLAMA3.1-8B, LLAMA3.1-8B-INSTRUCT, on Machine Translation and Code generation, we demonstrate evidence of a “task recognition” point where the task is encoded into the input representations and attention to context is no longer necessary.

Taking advantage of this redundancy results in 45% computational savings when prompting with 5 examples, and task recognition achieved at layer 14 / 32 using an example with Machine Translation. Our findings also have implication for resource and parameter efficient fine-tuning; we observe a correspondence between fine-tuning performance of individual LoRA layers and the task recognition layers.

PaperLink, Code

You must log in or register to comment.

Chat

Machine Learning

machinelearning

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

Community locked: only moderators can create posts. You can still comment on posts.

This subreddit is temporarily closed in protest of Reddit killing third party apps, see /r/ModCoord and /r/Save3rdPartyApps for more information.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
1 user / week
1 user / month
6 users / 6 months
1 local subscriber
20 subscribers
2.38K Posts
1 Comment
Modlog

mods:
Lemmit.Online bot