by advaith08 on 9/9/23, 5:43 PM with 51 comments
I've been thinking about the idea of a LLM thats a clone of me - instead of generating replies to be a helpful assistant, it generates replies that are exactly like mine. The concept's appeared in fiction numerous times (the talking paintings in Harry Potter that mimic the person painted, the clones in The Prestige), and I think with LLMs, there might actually be a possibility of us doing something like this!
I've just released a fork of the facebookresearch/llama-recipes which allows you to fine-tune a Llama model on your personal WhatsApp conversations. This adaptation can train the model (using QLoRA) to respond in a way that's eerily similar to your own texting style.
What I've figured out so far:
Quick Learning: The model quickly adapts to personal nuances, emoji usage, and phrases that you use. I've trained just 1 epoch on a P100 GPU using QLoRA and 4 bit quantization, and its already captured my mannerisms
Turing Tests: As an experiment, I asked my friends to ask me 3 questions, and responded with 2 candidate responses (one from me and one from llama). My friends then had to guess which candidate response was mine and which one was Llama's. Llama managed to fool 10% of my friends, but with more compute, I think it can do way better.
Here's the GitHub repository: https://github.com/Ads-cmu/WhatsApp-Llama/
Would love to hear feedback, suggestions, and any cool experiences if you decide to give it a try! I'd love to see how far we can push this by training bigger models for more epochs (I ran out of compute credits)
by brap on 9/9/23, 10:28 PM
by dools on 9/9/23, 8:15 PM
How is your most notable example not when Gilfoyle does exactly this so he doesn’t have to talk to Dinesh in Silicon Valley??
by olvy0 on 9/9/23, 9:23 PM
by cypress66 on 9/9/23, 9:03 PM
by tloriato on 9/9/23, 7:37 PM
I wonder how well would chatGPT | llama2 do given just the last 5 messages of each and asking to generate the next reply pre tending to be you…
Somehow I don’t think it would be worse?
by oDot on 9/9/23, 8:03 PM
by porridgeraisin on 9/9/23, 7:37 PM
by alt-glitch on 9/9/23, 8:24 PM
edit: I actually started a little work on this. If you wanna export more messages than the limited 40k, you can use [0]. I did and I have every text I've ever sent since I had WhatsApp.
[0]: https://github.com/YuvrajRaghuvanshiS/WhatsApp-Key-Database-...
by rosslazer on 9/9/23, 8:20 PM
by andai on 9/9/23, 8:40 PM
So it would simulate conversations between us.
The result was hilarious yet at times uncomfortably accurate... like looking into a mirror...
by codetrotter on 9/9/23, 10:39 PM
I remember that the photos in the newspaper moving mimic the person.
But I thought the talking paintings were ghosts living in the paintings or something.
by jzemeocala on 9/9/23, 8:31 PM
The intent being to create digital avatars of lost loved ones to help people with the grieving process.
I know that there would be tremendous opportunity in such tech for malicious actors to do serious harm, but the stated goal is still a worthwhile endeavor.
by f0e4c2f7 on 9/9/23, 9:12 PM
by jmkni on 9/9/23, 7:32 PM
by lacrimacida on 9/9/23, 11:07 PM
by gojomo on 9/9/23, 8:15 PM
I expect there will be profitable businesses based on training LLMs to simulate eminent people & celebrities – on both their public utterances and their private correspondence – then charging for access to the best models.
by slmkbh on 9/10/23, 11:39 AM
by andai on 9/10/23, 12:35 AM
Does anyone know a convenient way to access the kind of GPUs required for this?
Should I just pay for Google Colab?
by andai on 9/10/23, 12:36 AM
by SubiculumCode on 9/9/23, 9:08 PM
by RockstarSprain on 9/9/23, 8:50 PM
by BasedAnon on 9/9/23, 9:16 PM