by chiscript on 1/18/20, 10:06 AM with 27 comments
At its core, Flookup is a fuzzy matching add-on that helps you manage text that is less than a 100% match. Beyond that it can be used to:
1. Search for and match data regardless of whether it contains typos.
2. Highlight and delete duplicates duplicates even if the data has mismatched text.
3. Calculate the percentage similarity between strings.
4. Extract unique values from any column based on percentage similarity.
5. Sum and find the average of numbers based on corresponding partial matches.
Because of its versatility, Flookup can be used to return the best match, the next best match, etc. until the minimum percentage similarity is reached. This feature avoids weaknesses other fuzzy matching algorithms have because it safely hands power to the user, and I believe the user is the best judge of which data is a match or not.
Another great feature Flookup has is that it can be used to combine lookup values. This is particularly helpful when your data has many similar strings and you want to add extra information to your lookup value in order to increase the specificity of your query.
Finally, Flookup is good for more than just fuzzy matching; it is the improved replacement for VLOOKUP and INDEX/MATCH that you have been looking for.
Find out more by heading to https://www.getflookup.com, Subscription information is available at https://www.getflookup.com/pricing
by throw_14JAS on 1/20/20, 6:57 AM
My use case is a bit different -- I was doing a lot of database cleanups, particularly CRMs. I rewrote/reused code to build a duplicate detector a number of times; always wish there were a service that I could send data to, and it would flag my dupes. Even was using human labelers to train domain specific models.
by tehabe on 1/20/20, 8:54 AM
by superbrane on 1/20/20, 8:59 AM
by dandare on 1/20/20, 4:27 PM
Good luck with your project!
by samdung on 1/20/20, 11:36 AM
by jacklewis on 1/20/20, 6:23 AM
I see what you did there
by marapuru on 1/20/20, 9:40 AM