by felarof on 6/20/25, 4:35 PM with 206 comments
-- Why bother building a new browser? For the first time since Netscape was released in 1994, it feels like we can reimagine browsers from scratch for the age of AI agents. The web browser of tomorrow might not look like what we have today.
We saw how tools like Cursor gave developers a 10x productivity boost, yet the browser—where everyone else spends their entire workday—hasn't fundamentally changed.
And honestly, we feel like we're constantly fighting the browser we use every day. It's not one big thing, but a series of small, constant frustrations. I'll have 70+ tabs open from three different projects and completely lose my train of thought. And simple stuff like reordering tide pods from amazon or filling out forms shouldn't need our full attention anymore. AI can handle all of this, and that's exactly what we're building.
Here’s a demo of our early version https://dub.sh/nxtscape-demo
-- What makes us different We know others are exploring this space (Perplexity, Dia), but we want to build something open-source and community-driven. We're not a search or ads company, so we can focus on being privacy-first – Ollama integration, BYOK (Bring Your Own Keys), ad-blocker.
Btw we love what Brave started and stood for, but they've now spread themselves too thin across crypto, search, etc. We are laser-focused on one thing: making browsers work for YOU with AI. And unlike Arc (which we loved too but got abandoned), we're 100% open source. Fork us if you don't like our direction.
-- Our journey hacking a new browser To build this, we had to fork Chromium. Honestly, it feels like the only viable path today—we've seen others like Brave (started with electron) and Microsoft Edge learn this the hard way.
We also started with why not just build an extension. But realized we needed more control. Similar to the reason why Cursor forked VSCode. For example, Chrome has this thing called the Accessibility Tree - basically a cleaner, semantic version of the DOM that screen readers use. Perfect for AI agents to understand pages, but you can't use it through extension APIs.
That said, working with the 15M-line C++ chromium codebase has been an adventure. We've both worked on infra at Google and Meta, but Chromium is a different beast. Tools like Cursor's indexing completely break at this scale, so we've had to get really good with grep and vim. And the build times are brutal—even with our maxed-out M4 Max MacBook, a full build takes about 3 hours.
Full disclosure: we are still very early, but we have a working prototype on GitHub. It includes an early version of a "local Manus" style agent that can automate simple web tasks, plus an AI sidebar for questions, and other productivity features (grouping tabs, saving/resuming sessions, etc.).
Looking forward to any and all comments!
You can download the browser from our github page: https://github.com/nxtscape/nxtscape
by kevinsync on 6/20/25, 9:17 PM
Bookmarks don't cut it anymore when you've got 25 years of them saved.
Falling down deep rabbit holes because you landed on an attention-desperate website to check one single thing and immediately got distracted can be reduced by running a bodyguard bot to filter junk out. Those sites create deafening noise that you can squash by telling the bot to just let you know when somebody replies to your comment with something of substance that you might actually want to read.
If it truly works, I can imagine the digital equivalent of a personal assistant + tour manager + doorman + bodyguard + housekeeper + mechanic + etc, that could all be turned off and on with a switch.
Given that the browser is our main portal to the chaos that is internet in 2025, this is not a bad idea! Really depends on the execution, but yeah.. I'm very curious to see how this project (and projects like it) go.
by hannob on 6/20/25, 5:49 PM
Is this a common and well-defined term that people use? I've never heard it.
It would appear to me from the context that it means something like "web browser with AI stuff tackled on".
by mullingitover on 6/20/25, 10:01 PM
On the other hand: this has the potential to be an absolute security Chernobyl. A browser is likely to be logged into all your sensitive accounts. An agent in your browser is probably going to be exposed to untrusted inputs from the internet by its very nature.
You have the potential for prompt injection to turn your life upside down in a matter of seconds. I like the concept but I wouldn't touch this thing with a ten foot pole unless everyone in the supply chain was PCI/SOC2/ISO 27001 certified, the whole supply chain has been vetted, and I have blood oaths about its security from third party analysts.
by gtsop on 6/20/25, 4:51 PM
Appreciate the agplv3 licence, kudos on that.
by Tsarp on 6/21/25, 1:25 PM
Its kinda built really well without exposing webdriver etc and can comfortably run js and communicate with LLMs.Has full agentic capabilites.
Why a new browser instead of a robust extension?
by ppqqrr on 6/21/25, 5:26 PM
In the meantime, the bigger opportunity with relatively little competition at the moment is the Web itself, not which application to browse it with. The Web absolutely sucks, and that's most of the reason we even feel the need for an "elevated browser experience" in the first place (i.e. lifted trucks on the information highway).
The Web sucks, because it was built naively, then optimized for profitable friction. But all of it stood on the assumption that the cost of production on the web involves highly skilled human labor. LLMs have shattered that assumption, but the effects have not manifested. Which is to say that the entire existing Web is probably going to become a marginal, legacy corner of a much bigger base of LLM-driven hypermedia contents that is yet to come.
by arjunchint on 6/22/25, 8:56 AM
In fact, we built one, rtrvr.ai that has even better Web Agent performance than Open AI's Operator with human assistant and 7x faster than leading competitor: https://www.rtrvr.ai/blog/web-bench-results
Your Accessibility Tree requirement is a poor excuse, rather you should build up an agent from a first principles understanding of DOM interactions.
A browser is a SERIOUS security risk, you need a dedicated team to just pull in the latest security patches that Google pushes to Chromium or your users are sitting ducks to exploits and hacks...
by wanderingmind on 6/21/25, 4:16 AM
[1] https://data.sa.gov.au/data/dataset/reservoir-volumes-2018
by varenc on 6/21/25, 1:06 AM
some genuine feedback on a frustrating early experience:
- I ran the suggested "Group all my tabs by topic" in productivity agent mode. It worked great.
- I then asked it to remove all tab groups and reset things, but was told this:
This is a browser automation task. Please use **Agent Mode** for web interactions like clicking, filling forms, navigating sites, or extracting web content.
- Tried "agent mode" and was told: This is a productivity task. Please use **Chat Mode** for tab management, bookmarks, sessions, history, and content analysis.
- Basically was being sent back and forth. Went back to productivity mode and argued with it for a bit. The closest I could come to it removing all tabs groups was creating a new tab group encompassing all tabs, but couldn't get it to remove groups entirely. I'm guessing it might lack that API?Overall, it'd be nice if every browser level action it took had an undo button. Or at least if it was smart enough/able to remove the tab groups it just created.
Will keep playing with it more.
edit1: one more weird issue: While running the chat interface on chrome internal pages like chrome://extensions, it would randomly browse me to google.com for some reason.
edit2: confirmed that productivity mode lacks a tool to ungroup tabs, just a tool to create tab groups.
by lecro on 6/20/25, 10:21 PM
Great product though.
by blindriver on 6/21/25, 3:33 PM
by dvt on 6/20/25, 8:42 PM
A chat interface works for ChatGPT because most folks use it as a pseudo-search, but productivity tools are (broadly speaking) not generative, therefore shouldn't be using freeform inputs. I have many thoughts on fixing this, and it's a very hard problem, but simply slapping an LLM onto Chrome is just lazy. I don't mean to be overly negative, but it's kind of wild to see YC funding slop like this.
And that's exactly what this is: slop. There's no technical creativity here, this isn't a new product segment, it barely deserves the "hey bro, this might be a feature, not a product" startup 101 criticism. It's what ChatGPT would spit out if you asked it what a good startup idea would be in 2025. All we need to do, even if we were being as charitable as possible, is ask who's doing the heavy lifting here (hint: it's not in the Github repo).
by deepdarkforest on 6/20/25, 8:08 PM
Good luck, but in your place i would at least start with something that a certain ICP needs more. Many, many manhours have been wasted by ambitious technical founders on taking down Chrome. (many also starting from a chrome fork itself). But none of them succeeded. We only have limited energy
by gwhr on 6/24/25, 2:44 PM
by zahlman on 6/20/25, 10:04 PM
So your thesis is that an AI agent should decide what I pay attention to, rather than me?
What could possibly go wrong?
by symbolford on 6/22/25, 7:59 AM
https://chromium.googlesource.com/chromium/src/+/main/docs/l...
It seems like gemini 2.5 is pre-trained on chromium given it has a lot of context into the current state of the codebase. That may help a bit.
Also, I've not tried this yet, but maybe using an MCP server over a clangd language server is a solution to the context problem?
https://chromium.googlesource.com/chromium/src/+/master/docs...
by xena on 6/20/25, 5:08 PM
by joshstrange on 6/21/25, 3:55 PM
I asked it to find the best price for a new Seagate Exos 24TB drive on Amazon (GPT-4o):
> The best price found for a new Seagate EXOS 24TB enterprise hard drive on Amazon is $274.99 (model ST24000NM000C). This is a renewed/refurbished unit with 4.1/5 stars from 39 reviews. Note: The search results show mostly renewed units, and new units may be available at different prices on individual product pages.
Ok, fair, you couldn't find a new one on Amazon. I would have prefered a "Couldn't find one" instead of leading with the price and only in the second sentence mentioning it's a refurb. Note, this is mostly an LLM failing, not Nxtscape, that part seemed to work well overall.
I then followed up with "I only want a new drive, please look at other popular electronic retailers". It replied with (formatted slightly):
> Best Buy - WD Blue 2TB Internal Hard Drive ($59.99, on sale from $70)
> Newegg - Desktop Internal Hard Drives (500GB up to 8TB available) - Price range: $180-$299 - 4.5 store rating - Features: 30-day returns, 5-9 day delivery
> Amazon - Seagate EXOS Enterprise Hard Drives (up to 24TB)
> Additional options include SSDs from Newegg ($60-$201) if you're interested in faster storage solutions.
As almost always with LLMs, I see where it went off the beaten path. I didn't specify "a new one with the same specs I originally asked for", I shouldn't have to. This is probably mostly on the LLM, I don't know if Nxtscape could improve that with prompting (I don't know exactly what they are sending to judge either way). Also it got lazy with the Amazon response (no price).
One way that Nxtscape might be able to improve is to parse out what the user is asking for, creating a data structure to define a "result" (in my case: url, name, price, description?), use that to prompt the LLM to conform to that shape, then take the results and pass them all through a one-off LLM instance to summarize the data. I think that would help with the inconsistencies in the results. Then again, that's very "Data extraction"/"data lookup"-focused and I haven't even played with using it for input: Fill out this form, loop this process for input (mail merge), etc.
Really cool idea and I'll try throwing some other problems at it as I think of them, but mostly for fun/research, this doesn't seem like force-multiplier for my normal workflows (yet).
by admiralrohan on 6/21/25, 6:49 PM
by b0a04gl on 6/20/25, 5:04 PM
by awongh on 6/20/25, 6:21 PM
Sort of like a backwards perplexity search. (LLM context is from open tabs rather than the tool that brings you to those tabs)
I built a tab manager extension a long time ago that people used but ran into the same problem- the concept of tab management runs deeper than just the tabs themselves.
by lucasyvas on 6/21/25, 3:44 PM
Web browsers as applications are made completely useless by the AI wave and I fully expect only the webview portion to survive long run.
Microsoft, Apple, and Google are best positioned to capitalize on this. Meta is further behind but has decent glasses.
The pendulum is on its way back to native.
by ajb on 6/20/25, 10:04 PM
* Buying a sofa. You want to filter for sofas of a specific size, with certain features, marketing sites want to feed you a bunch of marketing slop for each sofa before giving you the details . This generalises to many domains.
* You have a few friends who are still stuck on Facebook, you want to be notified if they post anything and avoid other rubbish
* The local neighborhood is stuck organising in a Facebook group or even worse, nextdoor. You want to see any new posts except for those couple of guys who are always posting the same thing.
* A government consultation website has been put up, but as a hurdle the consultation document has been combinatorially expanded to 763 pages by bureaucratic authoring techniques. You want to undo the combinatorial expansion do you can identify things you actually care about.
by midenginedcoupe on 6/22/25, 11:22 AM
by ellisd on 6/20/25, 8:08 PM
by mahoro on 6/20/25, 6:01 PM
by anilgulecha on 6/20/25, 4:51 PM
edit: Just read about the accessibility thing, but that's thin. Is there any usecase in the future that a browser can, but an extension can't?
by lxe on 6/20/25, 4:53 PM
by kerv on 6/20/25, 8:12 PM
What is the tech around the thing that segments out DOM elements automatically and shows the visual representation. I think something like this would be great for automated UI testing agents?
by 8organicbits on 6/20/25, 11:11 PM
There's a straw man here. If you want to reorder an item on Amazon: click on 'order history', scroll, and click buy. This is a well-optimized path already and it doesn't require your full attention. I suspect the agent approach takes more effort as you need to type and then monitor what the AI is doing.
by b0ner_t0ner on 6/21/25, 3:10 AM
by OsrsNeedsf2P on 6/20/25, 5:31 PM
Also what's the business model?
by Babkock on 6/20/25, 5:35 PM
by _fw on 6/20/25, 5:14 PM
Instead of manually hunting across half a dozen different elements, then copy/paste and retype to put something into a format I want…
I can just get Dia do it. In fact, I can create a shortcut to get it to do it the same way every single time. It’s the first time I’ve used something that actually feels like an extension of the web, instead of a new way to simply act on it at the surface level.
I think the obvious extension of that is agentic browsers. I can’t wait for this to get built to a standard where I can use it every day… But how well is it going to run on my 16GB M1 Pro?
by zahirbmirza on 6/20/25, 5:04 PM
by finolex on 6/20/25, 6:01 PM
by pnw_throwaway on 6/21/25, 12:41 AM
All the same, looks like y’all are having fun working on it, and maybe some unforeseen usecase will bubble up.
by revskill on 6/20/25, 6:09 PM
by ugh123 on 6/20/25, 7:46 PM
by afeigenbaum on 6/20/25, 6:49 PM
by thisislife2 on 6/20/25, 5:39 PM
by bosky101 on 6/21/25, 5:53 PM
From a founder hat, I can see why the code base is a moat, hard problem. I hope the effort is worth the cost.
Good luck.
by rodolphoarruda on 6/20/25, 6:44 PM
by Lammy on 6/20/25, 5:35 PM
- https://tsdr.uspto.gov/#caseNumber=76017078&caseSearchType=U...
> PROVIDING MULTIPLE-USER ACCESS TO A GLOBAL COMPUTER INFORMATION NETWORK FOR THE TRANSFER AND DISSEMINATION OF A WIDE RANGE OF INFORMATION; ELECTRONIC TRANSMISSION OF DATA, IMAGES, AND DOCUMENTS VIA COMPUTER NETWORKS; [ELECTRONIC MAIL SERVICES; PROVIDING ON-LINE CHAT ROOMS FOR TRANSMISSION OF MESSAGES AMONG COMPUTER USERS CONCERNING A WIDE VARIETY OF FIELDS]
- https://tsdr.uspto.gov/#caseNumber=76017079&caseSearchType=U...
> PROVIDING INFORMATION IN THE FIELD OF COMPUTERS VIA A GLOBAL COMPUTER NETWORK; PROVIDING A WIDE RANGE OF GENERAL INTEREST INFORMATION VIA COMPUTER NETWORKS
- https://tsdr.uspto.gov/#caseNumber=74574057&caseSearchType=U...
> computer software for use in the transfer of information and the conduct of commercial transactions across local, national and world-wide information networks
by wongarsu on 6/20/25, 5:05 PM
by jklinger410 on 6/20/25, 4:55 PM
by taylorius on 6/20/25, 5:12 PM