from Hacker News

Amazon has a secret workaround to scrape GitHub for model training

by tardismechanic on 6/14/24, 5:33 PM with 4 comments

  • by tardismechanic on 6/14/24, 5:35 PM

  • by fikjusulta on 6/16/24, 3:46 AM

    I would appreciate a formal mechanism to opt out of data collection for Amazon (as well as OpenAI and Microsoft).
  • by smcin on 6/15/24, 1:11 AM

    [Non-paywalled version]: https://dataconomy.com/2024/06/14/amazon-has-a-secret-way-to...

    According to an internal memo obtained by Business Insider, Amazon’s AGI Group worked around Github's 5,000 request/hr/account limits by 'encouraging' its employees to create multiple GitHub accounts and share their access credentials. By leveraging a network of accounts simultaneously, Amazon aims to condense what would have been a multi-year endeavor into a matter of weeks.

    Dataconomy: The ethical implications are significant. By soliciting employees to share personal GitHub accounts, Amazon is potentially accessing data without explicit consent from GitHub or the repository owners.