by rand0mx1 on 7/15/23, 11:59 AM with 123 comments
by hartator on 7/15/23, 2:34 PM
Wait. Brave browser sends back to Brave Search engine about your browsing? Other search engines usage, but also crawl pages on your computer to help build their search index?
Ref: https://github.com/brave/web-discovery-project/blob/main/mod...
by 6gvONxR4sf7o on 7/15/23, 3:13 PM
> 1) The purpose and character of the use, including whether such use is of a commercial nature or is for nonprofit educational purposes
> 2) The nature of the copyrighted work
> 3) The amount and substantiality of the portion used in relation to the copyrighted work as a whole
> 4) The effect of the use upon the potential market for or value of the copyrighted work
[emphasis from TFA]
HN always talks about derivative work and transformativeness, but never about these. The fourth one especially seems clear in its implications for models.
Regardless, it makes it seem much less clear cut than people here often say.
by xp84 on 7/15/23, 2:17 PM
> without any worry for copyright infringement because Brave acts as a middleman.
This isn’t how law works. Unless Brave is explicitly indemnifying all their customers (which their lawyers would have to be insane to let them do), any trouble you could get in, is going to be 100% your problem. Pointing the finger at Brave could theoretically get them in trouble too, but would in no way let you off the hook.
by isodev on 7/15/23, 1:46 PM
by k__ on 7/15/23, 4:06 PM
That's genius!
by throwaway72762 on 7/15/23, 1:41 PM
by lern_too_spel on 7/15/23, 4:01 PM
> They don't mention their crawler anywhere in their docs, either. So, if you wanted to block Brave from crawling and indexing and ultimately selling your content to third parties, your only option for the time being would be to block all crawlers, which is how Brave would be able to "respect robots.txt".
by kodah on 7/15/23, 3:44 PM
by lopatin on 7/15/23, 2:24 PM
by ricardo81 on 7/15/23, 11:01 PM
by verisimi on 7/15/23, 1:40 PM
by niemandhier on 7/15/23, 4:07 PM
Atricle 3 and 4 of the EU 'Copyright in the Digital Single Market' give data miners quite extensive rights.
Move operation to the EU, train a foundational model, than train a constitutional model based on that.
As much as I hate the upcoming AI regulation, the CDSM is solid.
https://academic.oup.com/grurint/article/71/8/685/6650009 https://eur-lex.europa.eu/eli/dir/2019/790/oj
Update: Fixed wrong link
by 411111111111111 on 7/15/23, 2:41 PM
It's also a for-profit company and you're not the customer, as you're not paying them money.
I'd be way more worried how they're using the data they're collecting on you vs Google or MS