from Hacker News

Web Scraping with C++

by daolf on 9/22/22, 1:48 PM with 5 comments

  • by beached_whale on 9/22/22, 3:28 PM

    I was playing with gumbo several months ago and made a nicer C++ wrapper that allowed for some interesting ways of doing queries. It was fun playing with combining the predicates and optimizing the result type so that it did less work. https://github.com/beached/gumbo_pp
  • by forgotpwd16 on 9/22/22, 3:01 PM

    It uses libcurl and gumbo (https://github.com/google/gumbo-parser). Gumbo is apparently written in pure C99 (interestingly Curl is written in the even older C89 standard). Will've been more amusing if article was written considering that and used C99.
  • by ncmncm on 9/22/22, 5:59 PM

    The go-to reference for web scraping used to be weboob.