by johns on 4/25/22, 9:01 PM with 380 comments
by johns on 4/25/22, 9:17 PM
[0]: https://twitter.com/willnorris/status/1518694675909013504
by Irishsteve on 4/25/22, 9:12 PM
by jdrc on 4/25/22, 9:21 PM
by shrimpx on 4/25/22, 9:39 PM
First of all, "the algorithm" is probably hundreds of thousands of lines of code, including all the tedious boilerplate like cache policies and multi-AZ logic.
And second of all, doesn't the algorithm include machine learning components, which are trained on terabytes of data? That data will likely be impossible to open source. And open sourcing the neural nets without the training data is mostly meaningless from a transparency perspective?
by mrintellectual on 4/25/22, 9:14 PM
by axg11 on 4/25/22, 9:42 PM
Even if we were to open source all associated code and publish all related documents it would be very difficult to make sense of the entire system. That is precisely why companies such as Twitter A/B test the hell out of everything. What most people think of as "the algorithm" is a complex system that receives many inputs (maybe hundreds) and has dependencies on many other internal Twitter services. Tweets likely pass through multiple filtering steps as well as scoring before you ever see them. Each of these steps is highly contextual, depending on: location, past tweets, verification status, etc. You can attempt to predict the effect of a certain change, but you never know the actual outcome until you test it.
I think what will ultimately happen is that _some_ details will be published. Elon will parade that around as a victory for free speech as Twitter is now more "open". In reality, nothing of value will be gained as "the algorithm" isn't a simple function.
by transitivebs on 4/25/22, 9:11 PM
Either this is a mistake, or this is a really, really misguided attempt at a joke from Twitter.
by mrkramer on 4/25/22, 9:30 PM
by CincinnatiMan on 4/25/22, 9:14 PM
by standyro on 4/25/22, 9:34 PM
error forking repo: HTTP 403: The repository exists, but it contains no Git content. Empty repositories cannot be forked. (https://api.github.com/repos/twitter/the-algorithm/forks)
My thoughts:
- Explicit rules for temporary and permanent bans
- Edit button
- More fun and thoughtful conversations like HN
- Less thought bubble Brooklyn based reporters, less VC and side grind hustle snake oil, maybe more comedians and memes?
by edouard-harris on 4/26/22, 1:58 AM
Twitter's EU user base is probably [3] above the 45 million threshold that triggers the strictest transparency requirements under the Act. So perhaps they figure if they're going to be forced to disclose anyway, they might as well do it proactively.
[1] If it's even coherent to talk about their feed ranking system as a single algorithm — see the other comments in this thread.
[2] https://www.theverge.com/2022/4/23/23036976/eu-digital-servi...
[3] https://www.statista.com/statistics/242606/number-of-active-...
by nighthawk454 on 4/25/22, 9:14 PM
by nickysielicki on 4/25/22, 9:22 PM
That doesn’t mean it’s a joke, I see it as a show of goodwill — that there are a handful of people inside Twitter that are excited for transparency and for a revenue model that isn’t entirely based on ads, that are excited to get to work on this right away.
by pddpro on 4/25/22, 9:18 PM
by rickreynoldssf on 4/25/22, 10:12 PM
by paxys on 4/25/22, 9:36 PM
by Barrin92 on 4/25/22, 9:37 PM
So I'm not sure what the ultimate point of this exercise is other than producing faux-transparency.
by xena on 4/25/22, 9:35 PM
by NaturalPhallacy on 4/25/22, 10:33 PM
by unethical_ban on 4/25/22, 9:22 PM
by newbamboo on 4/25/22, 10:31 PM
Elon polled Twitter users about this and the response was overwhelmingly in favor of open source and transparency. Everyone on Twitter got a vote.
If you oppose transparency, as many now are, you lose your credibility. So it’s another one of Elon’s people hacks, and look at all the morons falling for it.
by EMIRELADERO on 4/25/22, 9:20 PM
by qgin on 4/25/22, 9:45 PM
by bpodgursky on 4/25/22, 10:31 PM
Like, there's no public admission right now of whether "shadow banning" or "ghost banning" is even officially a thing!
Some transparency seems unquestionably more powerful than none, and we can work from there.
by yabones on 4/25/22, 9:15 PM
by rvz on 4/25/22, 9:20 PM
Maybe that is where it is going.
by holtkam2 on 4/25/22, 9:13 PM
by Traster on 4/25/22, 10:29 PM
That seems... bizarre to me?
by sakopov on 4/26/22, 1:21 AM
by g105b on 4/25/22, 9:21 PM
by threeseed on 4/25/22, 9:32 PM
* Chronological - reverse sort by date
* Home - for all of the followed topics, recommended topics, retweets and tweets in the past day determine the estimated level of engagement, include the highest and reverse sort by date. This is likely to be a fairly basic ML model.
It will be uncontroversial, technically unsophisticated and of no practical use to anyone - users, developers or researchers.
This is not going to be PageRank where some genuine new insight was discovered.
by Synaesthesia on 4/25/22, 9:22 PM
by hazb on 4/26/22, 12:47 AM
I imagine they'd probably start with documentation and white-papers that communicate "here's how we intend for it to work".
It's seriously unlikely anyone in Twitter knows actually works how any non-trivial algorithm in the company works. To figure THAT out, they could decide to do a company-wide documentation and instrumentation push like they probably would've had to do for GDPR anyway, which is painful and boring and going to take a very long time.
Failing that, they could just say 'the algorithm as it stands is no longer fit for purpose, given part of its core requirement has become that it needs to be transparent and publishable, and presumably legible. We need to make a new one. Publish the core algorithm. We probably won't deploy it in that exact state, it's going to span multi-services and so on, you obviously don't get the data we used to train the models, but we will work backwards from it and here's an open mechanism to measure how true-to-form it actually is'
by u1tron on 4/26/22, 5:36 AM
by tmaly on 4/26/22, 1:58 AM
by minroot on 4/25/22, 9:58 PM
by zelon88 on 4/25/22, 9:18 PM
by qudat on 4/25/22, 10:10 PM
by drnonsense42 on 4/25/22, 9:50 PM
by LugarOS on 4/25/22, 9:11 PM
by a-dub on 4/25/22, 10:18 PM
if twitter is a game, sinking $43bn into it is kinda like winning or losing the grand final boss level. (unclear which)
wish elon would get back to facilitating the building of useful things. we still don't have a great clean energy generation story.
by TrapLord_Rhodo on 4/25/22, 9:23 PM
by oxplot on 4/25/22, 10:40 PM
by asd88 on 4/25/22, 9:41 PM
by 4e530344963049 on 4/25/22, 10:10 PM
by arthurcolle on 4/25/22, 9:58 PM
by ArtWomb on 4/25/22, 9:14 PM