I was reading the blog post about bot detection with browsers. The first layer being the IP address of the browser.
One rather unique scenario I've been trying to work out for a scraper is eliminating network latency. My use of the site is enhanced by the request from the browser having the lowest RTT latency to the webserver as possible. This means being in the same cloud provider.
To do this right now I manually navigate to the site and have a browser extension that clicks at just the right time.
I'd really like to eliminate that manual navigation but every time I've tried adding browser automation outside of the single click from the extension, I'm immediately met with bot detection.
Obviously adding a residential proxy step completely defeats the purpose of the RTT latency optimization.
Do modified browsers drive the overall bot detection heuristic low enough that the cloud IP address itself isn't a red flag? I've seen Camoufox and will try it at some point. What other options are available to drive down the overall "score" so I can still automate the browser but keep the latency low?
trollbridge
Biggest question I have is how this will overcome sites that implement aggressive anti-automation security. I can easily automate websites with existing tools until I slam into that wall.
show comments
Oras
Congrats on launch. I have experienced these issues first hand with `Open Finance` a few years ago.
I feel that you'll end up being an automation agency (you mentioned UiPath), companies who have the skills and capacity to build, will not need your service. But those who want the full service, you might fill a gap.
I wish you all the best.
show comments
jackienotchan
I'm always genuinely curious on how startups navigate the founder maze as it helps to break the myth of an overnight success story.
Based on your YC page, you went through a couple of pivots over the last years:
- 4 years ago: Intuned - The data assistant for engineering leaders [0]
- 2 years ago: Intuned - The browser automation platform for developers and product teams [1]
- 1 year ago: Intuned Auth Sessions - Build authenticated scrapers and RPA [2]
What was kind of the evolution from YC S22 4 years ago till you arrived at today's launch? How did you find your differentiation in a highly commoditized space? Even within YC, there are many competitors like Firecrawl, Reworkd, BrowserUse, NotteLabs, Browserbase, etc.
Another thing that might interest HN: AI crawlers come with negative side effects for website owners (costs, downtime, etc.), as repeatedly reported here on HN (and experienced myself).
Does Intuned respect robots.txt directives and do you disclose the identity of your crawlers via user-agent header?
Really cool!
I was reading the blog post about bot detection with browsers. The first layer being the IP address of the browser.
One rather unique scenario I've been trying to work out for a scraper is eliminating network latency. My use of the site is enhanced by the request from the browser having the lowest RTT latency to the webserver as possible. This means being in the same cloud provider.
To do this right now I manually navigate to the site and have a browser extension that clicks at just the right time.
I'd really like to eliminate that manual navigation but every time I've tried adding browser automation outside of the single click from the extension, I'm immediately met with bot detection.
Obviously adding a residential proxy step completely defeats the purpose of the RTT latency optimization.
Do modified browsers drive the overall bot detection heuristic low enough that the cloud IP address itself isn't a red flag? I've seen Camoufox and will try it at some point. What other options are available to drive down the overall "score" so I can still automate the browser but keep the latency low?
Biggest question I have is how this will overcome sites that implement aggressive anti-automation security. I can easily automate websites with existing tools until I slam into that wall.
Congrats on launch. I have experienced these issues first hand with `Open Finance` a few years ago.
I feel that you'll end up being an automation agency (you mentioned UiPath), companies who have the skills and capacity to build, will not need your service. But those who want the full service, you might fill a gap.
I wish you all the best.
I'm always genuinely curious on how startups navigate the founder maze as it helps to break the myth of an overnight success story.
Based on your YC page, you went through a couple of pivots over the last years:
- 4 years ago: Intuned - The data assistant for engineering leaders [0]
- 2 years ago: Intuned - The browser automation platform for developers and product teams [1]
- 1 year ago: Intuned Auth Sessions - Build authenticated scrapers and RPA [2]
What was kind of the evolution from YC S22 4 years ago till you arrived at today's launch? How did you find your differentiation in a highly commoditized space? Even within YC, there are many competitors like Firecrawl, Reworkd, BrowserUse, NotteLabs, Browserbase, etc.
Another thing that might interest HN: AI crawlers come with negative side effects for website owners (costs, downtime, etc.), as repeatedly reported here on HN (and experienced myself).
Does Intuned respect robots.txt directives and do you disclose the identity of your crawlers via user-agent header?
[0] https://www.ycombinator.com/launches/Gqr-intuned-the-data-as...
[1]https://www.ycombinator.com/launches/LGE-intuned-the-browser...
[2] https://www.ycombinator.com/launches/Lpq-intuned-auth-sessio...
I create scrapers on my own when I need one, but this is a cool idea. Best of luck with it.
This is a really neat idea. Congrats on the launch!
Is this a bet that Computer Use models don't get better and cheaper?
cool but we already pay $200/month to OpenAI to turn automation into code
Where have you found early traction with users? Why has your solution been useful for these users relevant to the other options?
How well would this work for a "go to hotel booking site, book 2 weeks in June for a family of 4" type of workflows?
this is actually really cool
What's the benefit of using playwright instead of an official webdriver directly?