by ulrikhansen54 on 8/5/21, 1:15 PM with 30 comments
by ALittleLight on 8/6/21, 5:01 AM
by ruinar50 on 8/6/21, 4:09 PM
Does it matter if it's technically overfitting or not if everyone understands what their "one specific thing" is and how to "stitch" them together to get accurate results over a some real-world problem space? (conversely, people have to recognize the limitations.) Also, for "micro-model" as a word, appreciate having neutral vocabulary to talk about a model that doesn't solve the whole problem space, but does work for some of it. As opposed to "overfit model" or "incomplete model", which seem to cast negative connotations on a concept which is potentially useful when properly applied. (Though an eventual consensus on vocabulary likely necessary as the space matures...)
Later parts of the article introduced kick-off, iteration, and prototyping time as concrete benefits. Interested to see a follow-up addressing how micro-models fit into general problem-solving pipeline. What's next in terms of speeding up the assembly-line process? Where do they fit into data-oriented programming on the whole?
by brainwipe on 8/6/21, 8:16 AM
That being said, narrow training sets are a great idea and this application looks great.
by robojoker on 8/6/21, 6:22 AM
by l-lousy on 8/6/21, 10:33 AM
Anyway, I’ve been trying to think of how this could be used for text data, specifically NER, which generally requires a lot more semantic understanding of the input. Sadly it seems like there might not be much room for the ‘micro’ part of the micro models.
by jogundas on 8/6/21, 8:18 AM
However, it is hard to imagine an actual application of the process. If I understand it correctly, the author suggests using a set of micro-models for annotating a dataset which is then used to train another model. The latter model can actually detect Batman in a general environment, ie, can generalize. However, enriching a training dataset by adding adjacent frames depicting Batman from the same movie will likely have limited usefulness when training an actual Batman detection (non-micro!) model. Or do I get the final application wrong?
by tomrod on 8/6/21, 4:52 AM
by underaxon on 8/6/21, 8:14 AM
by klysm on 8/6/21, 7:49 AM
by abz10 on 8/6/21, 1:16 PM
To be fair, most of the industry are amateurs, but most people don’t write medium posts and continue to argue their ignorance on HN.