by jasonwcfan on 9/17/24, 1:26 PM with 77 comments
This was our launch: https://news.ycombinator.com/item?id=36032081
We recently decided to revive and rebrand the project after seeing a sudden spike in interest from people who wanted to connect LLMs to data - but specifically through browsers. It's also a problem we've experienced firsthand, having built scraping features into Psychic and previously working on bot detection at Robinhood.
If you haven’t built a web scraper or browser automation before, you might assume it’s very straightforward. People have been building scrapers for as long as the internet has existed, so there must be many tools for the job.
The truth is that web scraping strategies need to constantly adapt as web standard change, and as companies that don’t want to be scraped adopt new technologies to try and block it. The old standards never completely go away, so the longer the internet exists, the more edge cases you’ll need to account for. This adds up to a LOT of infrastructure that needs to be set up and a lot of schlep developers have to go through to get up and running.
Scraping is no easier today than it was 10 years ago - the problems are just different.
Finic is an open source platform for building and deploying browser agents. Browser agents are bots deployed to the cloud that mimic the behaviour of humans, like web scrapers or remote process automation (RPA) jobs. Simple examples include scripts that scrape static websites like the SEC's EDGAR database. More complex use cases include integrating with legacy applications that don’t have public APIs, where the best way to automate data entry is to just manipulate HTML selectors (EHRs for example).
Our goal is to make Finic the easiest way to deploy a Playwright-based browser automation. With this launch, you can already do so in just 4 steps. Check out our docs for more info: https://docs.finic.io/quickstart
by ghxst on 9/17/24, 3:17 PM
by suriya-ganesh on 9/17/24, 6:33 PM
If I remember correctly, Skyvern also has an implementation of scaling these browser tasks built in.
ps. Is it not called Robotic Process Automation? First time I'm hearing it as Remote process Automation.
by dataviz1000 on 9/17/24, 3:52 PM
If you don't already have this feature for your system, I would recommend it.
by Oras on 9/17/24, 2:48 PM
by mdaniel on 9/17/24, 4:53 PM
I have never, ever understood anyone who goes to the trouble of booting up a browser, and then uses a python library to do static HTML parsing
Anyway, I was surfing around the repo trying to find what, exactly "Safely store and access credentials using Finic’s built-in secret manager" means
by krick on 9/17/24, 6:25 PM
by whatnotests2 on 9/17/24, 2:23 PM
I can see a few years from now almost all web traffic is agents.
by j0r0b0 on 9/17/24, 2:57 PM
Your sign up flow might be broken. I tried creating an account (with my own email), received the confirmation email, but couldn't get my account to be verified. I get "Email not confirmed" when I try to log in.
Also, the verification email was sent from accounts@godealwise.com, which is a bit confusing.
by skeptrune on 9/17/24, 3:37 PM
by computershit on 9/17/24, 3:03 PM
by ushakov on 9/17/24, 3:07 PM
Also, curious why your unstructured idea did not pan out?
by ilrwbwrkhv on 9/17/24, 3:02 PM
by slewis on 9/17/24, 3:30 PM
by sebmellen on 9/17/24, 3:42 PM