NHacker Next
- new
- past
- show
- ask
- show
- jobs
- submit
login
Looks pretty cool. How does your agent understand plain english?
We have built a QA agent that can understand your plain english intent and uses vision to reason and navigate the app to test your intent. You can check our benchmark here
https://finalrun.app/benchmark/ and how we architected our agent for the benchmark https://github.com/final-run/finalrun-android-world-benchmar.... Its all open source
[dead]
Rendered at 16:24:58 GMT+0000 (Coordinated Universal Time) with Vercel.