from Hacker News

Scraping and indexing 1.2B emails for under $200

by jordiee on 3/12/20, 3:01 PM with 6 comments

  • by tdeck on 3/12/20, 9:38 PM

    Is nobody going to mention how this is a bad thing intended for sending spam? Guess I'll have to be that person then.
  • by ad404b8a372f2b9 on 3/12/20, 6:43 PM

    Took me a while to understand they were scraping email addresses and not actual emails.
  • by hbcondo714 on 3/12/20, 6:20 PM

    Wow, the author really calls out his competition! There are also parts 2 and 3 to this article that discusses using Rust and Postgres for their solution.
  • by natmaka on 3/13/20, 7:56 AM

    Help rid the world of spam! Project Honey Pot is our friend. https://www.projecthoneypot.org/
  • by thomas536 on 3/13/20, 1:22 AM

    I must be missing something because 6.5 days * $21/day = $136.5

    """

    The entire process now took 6.5 days and cost $21/day. Our total cost all said and done was $115!

    """

  • by slowhand09 on 3/12/20, 5:06 PM

    Nice writeup.