by tucson on 10/7/14, 12:54 PM with 9 comments
The point is to be able to search through the whole archive using tags/keywords.
example of tags:
'security'
'crm'
'a/b testing'
'optimization'
'http', 'ssl', 'domain name'
'scala', 'c++', 'php', etc
'lua'
'sql'
'marketing'
'website'
'landing page'
=> get all posts that relate to each tag (and combinations of tags) sorted by points of individual posts/comments.
To do list: 1. import all hacker news database 2. insert in database all tags for all posts/comments, using an algorithm similar to the Kaggle Keyword Extraction algo (https://www.kaggle.com/c/facebook-recruiting-iii-keyword-extraction), which will need to be refined. 3. create great user interface to the new database
------- or if no-one has the time, could anyone advise me on how to download the whole hacker news database?
by captn3m0 on 10/7/14, 1:02 PM
2. This has already been done quite a few times by various apps, most prominently here: http://algorithmia.com/demo/hn (http://blog.algorithmia.com/post/86295023534/algorithmic-tag...)
by wanghq on 10/8/14, 4:57 AM
I think what's useful is various tools if they are still alive. That's why I want to build a toolbox which collects all the useful tools.