by doe88 on 3/31/23, 8:10 PM with 1 comments
by a_vanderbilt on 3/31/23, 8:30 PM
I have had a theory that Google has started with a less-capable but easier to host LLM in order to obtain RLHF data. That is where I think they are actually behind. They have access to huge amounts of training data, but without the reinforcement feedback, it isn't going to scale in the ways they need it to right now.