This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/kecepa5669 on 2023-11-05 03:39:57.


I’m wondering if anyone has thought about what a benchmark test for autonomous agents might look like?

For example, I’m thinking about the following: Can I tell it want I’m thinking about purchasing, then have the agent go online, knowing my preferences, then research what’s available, shop for prices, then order it for me with my only involvement being an initial text prompt.

Does anyone else have any other ideas of their personal Turing Test for autonomous agents?