The demonstration-based approach is interesting for the handoff problem. The hardest part of agentic automation isnt the first run -- its making the agent robust to the cases the demonstrator never showed it. How do you handle edge cases or failures mid-task? Does it fall back to asking the user, or does it have some recovery heuristic? Asking because we found that the failure mode surface matters more than happy-path coverage when you actually deploy these in production.
show comments
abraxas
One more tool targeting OSX only. That platform is overserved with desktop agents already while others are underserved, especially Linux.
show comments
jedreckoning
cool idea. good idea doing a demo as well.
sukhdeepprashut
2026 and we still pretend to not understand how llms work huh
The demonstration-based approach is interesting for the handoff problem. The hardest part of agentic automation isnt the first run -- its making the agent robust to the cases the demonstrator never showed it. How do you handle edge cases or failures mid-task? Does it fall back to asking the user, or does it have some recovery heuristic? Asking because we found that the failure mode surface matters more than happy-path coverage when you actually deploy these in production.
One more tool targeting OSX only. That platform is overserved with desktop agents already while others are underserved, especially Linux.
cool idea. good idea doing a demo as well.
2026 and we still pretend to not understand how llms work huh