by gerenuk on 5/4/18, 8:16 PM with 10 comments
Currently, using ElasticSearch for storing the meta data and other raw data information but it is a very small scale around 500,000 domains.
I have been tasked to scale it to 20-40 million domains and storing their internal/external links while building a page rank/domain authority score for each domain which we are adding to our database.
What do you guys suggest/recommend for storing this data at a very large scale as web page internal links/external links will be stored which will lead it over 100M-1B links database?
Any kind of feedback/suggestion would be appreciated.
Thanks.
by nik736 on 5/5/18, 5:21 PM
by sharemywin on 5/4/18, 8:29 PM
https://dba.stackexchange.com/questions/38793/which-database...
There's a nice little triangle diagram here: https://stackoverflow.com/questions/2794736/best-data-store-...
by girishso on 5/5/18, 8:07 PM
by drizzle87 on 5/7/18, 2:34 PM
by jjirsa on 5/6/18, 6:52 AM
Cassandra can do many orders of magnitude more than 1B, but would limit you in your query patterns.
by mr__y on 5/5/18, 4:50 PM
by cimmanom on 5/5/18, 1:05 AM
by dchuk on 5/5/18, 12:18 AM