Lemmit
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Lemmit.Online botMAB to Machine LearningEnglish · 2 days ago

AbsenceBench: Language Models Can't Tell What's Missing

arxiv.org

external-link
message-square
0
link
fedilink
1
external-link

AbsenceBench: Language Models Can't Tell What's Missing

arxiv.org

Lemmit.Online botMAB to Machine LearningEnglish · 2 days ago
message-square
0
link
fedilink
Large language models (LLMs) are increasingly capable of processing long inputs and locating specific information within them, as evidenced by their performance on the Needle in a Haystack (NIAH) test. However, while models excel at recalling surprising information, they still struggle to identify clearly omitted information. We introduce AbsenceBench to assesses LLMs' capacity to detect missing information across three domains: numerical sequences, poetry, and GitHub pull requests. AbsenceBench asks models to identify which pieces of a document were deliberately removed, given access to both the original and edited contexts. Despite the apparent straightforwardness of these tasks, our experiments reveal that even state-of-the-art models like Claude-3.7-Sonnet achieve only 69.6% F1-score with a modest average context length of 5K tokens. Our analysis suggests this poor performance stems from a fundamental limitation: Transformer attention mechanisms cannot easily attend to "gaps" in documents since these absences don't correspond to any specific keys that can be attended to. Overall, our results and analysis provide a case study of the close proximity of tasks where models are already superhuman (NIAH) and tasks where models breakdown unexpectedly (AbsenceBench).
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/machinelearning by /u/locomotus on 2025-06-20 23:45:28+00:00.

alert-triangle
You must log in or register to comment.

Machine Learning

machinelearning

Subscribe from Remote Instance

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]
lock
Community locked: only moderators can create posts. You can still comment on posts.

This subreddit is temporarily closed in protest of Reddit killing third party apps, see /r/ModCoord and /r/Save3rdPartyApps for more information.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 1 user / day
  • 1 user / week
  • 1 user / month
  • 8 users / 6 months
  • 1 local subscriber
  • 19 subscribers
  • 2.32K Posts
  • 1 Comment
  • Modlog
  • mods:
  • Lemmit.Online bot
  • BE: 0.19.11
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org