by peab on 2/19/25, 4:01 PM with 16 comments
by nico on 2/22/25, 5:23 PM
At best it’s just seo spam, at worst it’s collecting people’s emails for direct spam
by cmdtab on 2/22/25, 5:38 PM
by ryantj54 on 2/19/25, 6:02 PM
by ideashower on 2/22/25, 5:39 PM
by simonw on 2/22/25, 5:45 PM
> Estimates for GPT4, for example, give training data sizes of up to 1 petabyte of data.
I followed the provided link, which lead to an ad-laden https://seifeur.com/chat-gpt-4-data-size/ article which looks suspiciously like AI-generated slop. It ends with this set of Q&As which make no sense at all:
> How much data was used to train ChatGPT-4?
> ChatGPT-4 was trained on a dataset size of 570 GB.
> How does the size of GPT-4 compare to GPT-3 in terms of training data?
> GPT-4 has 45 gigabytes of training data, which is significantly larger than GPT-3’s 17 gigabytes.
> How many terabytes of text data does GPT-4 utilize compared to GPT-3?
> GPT-4 utilizes a dataset of 1 petabyte, which is notably larger than GPT-3’s 45 terabytes.
by gostsamo on 2/22/25, 5:29 PM
Not really impressed, tbh, but still fun.
by platelminto on 2/22/25, 5:20 PM
by eek2121 on 2/20/25, 6:45 AM
by Tepix on 2/22/25, 5:33 PM