from Hacker News

Ask HN: anyone interested to build a hacker news with tags?

by tucson on 10/7/14, 12:54 PM with 9 comments

Could anyone help me build a hacker news with tags? I am asking only those who are interested to have it as well because I only have a budget for the hosting for this.

The point is to be able to search through the whole archive using tags/keywords.

example of tags:

'security'

'crm'

'a/b testing'

'optimization'

'http', 'ssl', 'domain name'

'scala', 'c++', 'php', etc

'lua'

'sql'

'marketing'

'website'

'landing page'

=> get all posts that relate to each tag (and combinations of tags) sorted by points of individual posts/comments.

To do list: 1. import all hacker news database 2. insert in database all tags for all posts/comments, using an algorithm similar to the Kaggle Keyword Extraction algo (https://www.kaggle.com/c/facebook-recruiting-iii-keyword-extraction), which will need to be refined. 3. create great user interface to the new database

------- or if no-one has the time, could anyone advise me on how to download the whole hacker news database?