NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
Browser Agent Benchmark: Comparing LLM models for web automation (browser-use.com)
wiradikusuma 4 days ago [-]
Since we're in this topic, can anyone suggest good AI-based tool for exploratory (fuzzy?) web testing?
pixel_popping 4 days ago [-]
It's lacking the best model (Opus 4.5) on the benchmark tho.
djohnston 4 days ago [-]
Yeah but then their own product might not score the highest.
pixel_popping 3 days ago [-]
Exactly why I'm pointing it out, which feels a bit corrupt, but understandable.
djohnston 3 days ago [-]
tbh i was a bit cranky yesterday - even if they are #2 on a legit benchmark that would be impressive
MagMueller 5 days ago [-]
[dead]
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 04:54:11 GMT+0000 (Coordinated Universal Time) with Vercel.