by gggggggg on 5/17/17, 10:32 PM with 87 comments
by teddyh on 5/17/17, 11:51 PM
Something like this: Suppose we have a table of strings of digits, some including spaces, and we’d like to remove the spaces. From
123 456
234567
345 678
to 123456
234567
345678
Now, what happens if it encounters, say 4567890
Would the result be unchanged (as we would probably want), or would it “cheat” and remove the middle “7” character, giving “456890”?by ktamura on 5/17/17, 11:19 PM
The beauty of this product is that its adoption strategy is baked into the product itself: I'd share this with all Excel user friends of mine because I want the algorithm to get smarter, and I might even learn a bit of C# myself so that I can contribute and scratch my own itch. This in turn makes the product better (because of the larger training data), lending itself to more word of mouth.
One concern I have is security: I'd love to hear from folks who built this/more familiar with this about how to ensure the security of suggested transformations.
by Cieplak on 5/18/17, 3:16 AM
by gergoerdi on 5/18/17, 5:16 AM
For example, given the rule `f "abcde" 2 == "aabbccddee"`, it even figures out the role of the parameter `2`, so `f "zq" 3` gives `"zzzqqq"`.
by bcherny on 5/18/17, 2:36 AM
https://support.office.com/en-us/article/Use-AutoFill-and-Fl...
by netvarun on 5/17/17, 11:51 PM
[0]https://www.microsoft.com/en-us/research/blog/deep-learning-...
[1]HN Discussion: https://news.ycombinator.com/item?id=14168027
by martinthenext on 5/18/17, 11:54 AM
by wayneprice on 5/17/17, 11:36 PM
It's not production ready / launched yet, but it's getting there.
I'd be interested to finds (or really doesn't find) this useful :)
by gerhardi on 5/18/17, 7:33 AM
It can't do miracles, but this is time saving in many cases like when you want to concatenate values from different columns in a new format into a single column and so on.
by fiatjaf on 5/18/17, 1:07 AM
Ok, just realized somehow the site has vanished. Not working archived version: http://web.archive.org/web/20161028231256/https://www.transf...
by unfamiliar on 5/18/17, 12:29 AM
For example, "sort all of the folders, so that it Alan goes before Amy, etc". The rule ("sort") is pretty ambiguous, but one simple example in the context gives enough information to realise you probably mean alphabetically by first name.
Is there something like this example that could be combined with NLP to make things like these "intelligent assistants" we have now much more useful for data processing tasks?
It would be great to describe data manipulation to a machine the way that I would describe it to a colleague: give an overview of an algorithm, watch how they interpret it, and correct with a couple of examples in a feedback loop. Currently describing such things for a machine requires writing the algorithm manually in a programming language.
by logicallee on 5/18/17, 12:22 AM
What I mean is if every row had a date like "12 May 2002" and you wanted it turned into 2002.05.12 then it would be nice if it indicated when it added data. For example if one of the rows just read "15 May" then, since there is no year, it would not be completely absurd if it transformed into 2017.05.15 - or if all of the other data is 2002, then adding that. But I really think silently adding data that was not in the input is going too far. A transform shouldn't ever silently inject plausible data with no indication that this is interpolated. Bad things can result.
Otherwise great demo!
by mballantyne on 5/18/17, 6:26 AM
https://www.microsoft.com/en-us/research/publication/transfo...
Though it probably also uses more recent work from the same group:
by gshulegaard on 5/18/17, 3:13 AM
by tdbeteam on 5/19/17, 10:47 PM
by JoelJacobson on 5/18/17, 7:37 AM
by matt4711 on 5/18/17, 12:07 AM
"Zhongjun Jin, Michael R. Anderson, Michael J. Cafarella, H. V. Jagadish: Foofah: Transforming Data By Example. SIGMOD Conference 2017: 683-698"
by captnswing on 5/18/17, 9:09 AM
by copperx on 5/18/17, 1:33 AM
by Kiro on 5/18/17, 6:05 AM
by amelius on 5/18/17, 12:22 PM
This would be great for refactoring code.
by tejtm on 5/18/17, 12:08 AM
by cblte on 5/18/17, 1:09 PM
by sjg007 on 5/18/17, 2:15 AM