Congrats on the launch. One complaint: RPA this, non-RPA that, but you never explain what it means. I would write down the acronym fully once at the first mention on the landing page.
fchishtie 3 hours ago [-]
Thank you!! Yeah that's a good point - it's been so engrained in our brains, appreciate the feedback
throw03172019 16 minutes ago [-]
Biggest question is how much of this can be stored / processed on our own infra and with our own lifecycle rules? For example, this can touch a lot of PHI. Screenshots, videos, JSON inputs/outputs etc.
fchishtie 13 minutes ago [-]
logs get written in our customers' premises on a bucket of their choosing (: so PHI doesn't leave their VPC
throw03172019 8 minutes ago [-]
That’s perfect. Is this documented somewhere? Would love a deep dive on security / setup tweaks for data.
throw03172019 49 minutes ago [-]
Does this only revert back to LLM Vision when it catches an error? I.e once the RPA / workflow is built once, it’s efficient for running multiple times (until it catches an error state)?
fchishtie 34 minutes ago [-]
yes effectively, but we use LLM vision in multiple places - for context, there are multiple ways an RPA can fail:
1. RPA code breaks (ex: throws an exception if a window does not exist)
2. RPA reports success but was clicking / typing in the wrong place
3. Underlying system breaks (virtual machine / legacy software)
the skill we have in our MCP is to build the RPA code to throw exceptions where possible so an LLM can understand the context and recover
to avoid false success states we add LLM vision steps in the workflow itself to error out if it sees that the system is in the wrong state
and for the underlying system breaking it can be as simple as having a CRON job that checks the status of the process / the health of the VM and running a script to reboot the system
it depends on the system but the pattern we've seen with RPAs is you can catch maybe 80% of the edge cases in the first week it's been rolled out
dragonsenseiguy 2 hours ago [-]
Small website nitpick: I feel like the "In production with" section's companies logos should be a bit darker, I could barely tell there was something there.
fchishtie 2 hours ago [-]
yes good call out - that customer wheel is so overdue for an update
ilundin 26 minutes ago [-]
Is the cloud LLM the judge based on screenshots with patient/customer data included ? That seems like a no-go for many countries given privacy concerns ?
fchishtie 19 minutes ago [-]
No patient/customer data included in the screenshots - in production we’d basically find some “region” of the screen to screenshot that would help an LLM say yes/no to - ex: “is the nav bar green and does it say Insert Note”
ilundin 6 minutes ago [-]
Ok, that seems doable if working perfect, but what is the tool output "i can see the patient list with search functionality and 30 patients" thing in the demo ? Is this not vision detected ? Or are you digging into windows api (making non standard windows components/widgets non working)?
a-dub 1 hours ago [-]
i'm curious: how does the steady state error rate of a stochastic automated system like this compare with the downtime and errors that come from a (brittle) deterministic bridge that can fail with upgrades? what does the observability look like? (i'm guessing one feature is that the execution log including images/screenshots for each transaction gets saved, which is probably a huge improvement.)
fchishtie 43 minutes ago [-]
it’s a good q - we experimented a lot with computer use / agentic automation and found that at scale a hybrid solution where the automations run as deterministic code with agents for recovery is the best - running automations as code is faster & cheaper & when you’re doing critical tasks (like updating patient records) you don’t want an agent to potentially mess something up.
previously writing RPA code used to take a long time - using AI (and its infinite patience) we can write more durable code that covers more edge cases
And since they’re code based it’s pretty straightforward to an agents monitor them and update their code when upgrades to the underlying system happen etc…
for observability - we have workflow execution logs that store text, videos and screenshots so an agent or a human can debug them - lots and lots of webhooks when things break ! (:
throw03172019 29 minutes ago [-]
How does this compare with CyberDesk (also YC)?
fchishtie 22 minutes ago [-]
We think CyberDesk is great - our main difference is I believe the primary driver for their automations is computer use / agent based - whereas our automations use agents to create/maintain/support python code
theaniketmaurya 2 hours ago [-]
Congrats on the launch! Legacy system users are also one of the slowest to adopt AI. How do you navigate that?
fchishtie 2 hours ago [-]
100% right - we support the AI companies who are selling to the legacy end users - for ex: we don’t sell directly to hospitals, but an AI scribe for doctors that already has a hospital as a customer, we help them integrate to the hospital’s EMR
debarshri 2 hours ago [-]
Legacy system users are also the one who pays the most for tools and services. We sell to enterprise, I can attest to that. If it is relevant usecase and positioning for the market, it should be fine.
fchishtie 2 hours ago [-]
yeah it’s been interesting to watch, we were surprised initially at how much legacy users actually wanted to adopt AI - I think it’s because of how awful the old software can be to interact with
throw03172019 2 hours ago [-]
So AI companies would install this on their customer (practices) computers?
fchishtie 2 hours ago [-]
Yes, more likely on a virtual machine running the legacy software
throw03172019 2 hours ago [-]
Thanks. Most practices are not tech savvy. So how would the VM setup work in their own network / machines?
fchishtie 2 hours ago [-]
Yeah that’s true - in those cases we’ve either worked with their outsourced IT provider to spin up VMs for us or have had to spin up our own VM and connect through a VPN - IT can be very fun…
mingabunga 3 hours ago [-]
Could you use this to test new releases of software for bugs? A bit like TDD but for GUI interactions
fchishtie 3 hours ago [-]
Yes! we have customers doing that
snozolli 1 hours ago [-]
Computer use agents that run on Windows VMs or in the browser. On-premise, cloud
I’ve never heard a customer say “on-premises” when talking about servers they run. On-premise is usually the term regardless if it is “correct”.
snozolli 36 minutes ago [-]
54% of Americans read below a 6th grade level. People word-mash "alot", "atleast", and now "eachother" all the time. Sports commentators use verse when they mean versus.
I'm not suggesting that you correct your customers, but there's no reason to sink to the lowest common denominator when writing.
fchishtie 1 hours ago [-]
thank you - good catch (:
throw03172019 2 hours ago [-]
Please make your trust center public if you are focusing on healthcare AI companies…the footer link is dead.
fchishtie 2 hours ago [-]
Thanks for flagging this!
Boxxed 3 hours ago [-]
What the deuce is an "RPA"?
saheed_laminar 3 hours ago [-]
Its an acronym for Robotic Process Automation.
It usually means triggering mouse clicks and key stokes to perform tasks
fchishtie 3 hours ago [-]
It's a script that simulates clicks/keystrokes on Desktop/Web
viveksingh_17 2 hours ago [-]
[flagged]
Rendered at 18:32:53 GMT+0000 (Coordinated Universal Time) with Vercel.
1. RPA code breaks (ex: throws an exception if a window does not exist) 2. RPA reports success but was clicking / typing in the wrong place 3. Underlying system breaks (virtual machine / legacy software)
the skill we have in our MCP is to build the RPA code to throw exceptions where possible so an LLM can understand the context and recover
to avoid false success states we add LLM vision steps in the workflow itself to error out if it sees that the system is in the wrong state
and for the underlying system breaking it can be as simple as having a CRON job that checks the status of the process / the health of the VM and running a script to reboot the system
it depends on the system but the pattern we've seen with RPAs is you can catch maybe 80% of the edge cases in the first week it's been rolled out
previously writing RPA code used to take a long time - using AI (and its infinite patience) we can write more durable code that covers more edge cases
And since they’re code based it’s pretty straightforward to an agents monitor them and update their code when upgrades to the underlying system happen etc…
for observability - we have workflow execution logs that store text, videos and screenshots so an agent or a human can debug them - lots and lots of webhooks when things break ! (:
I think you meant premises.
https://brians.wsu.edu/2016/05/30/premise-premises/
I'm not suggesting that you correct your customers, but there's no reason to sink to the lowest common denominator when writing.