by cdl on 12/27/18, 3:25 PM with 1 comments
by cdl on 12/27/18, 3:43 PM
My goal with this post and accompanying GitHub project is to provide a more accurate example of an enterprise-grade data science project that includes what is often missed in these sorts of examples: the rigorous positioning of a business problem and scoping of a data science solution.
The majority of the time spent to complete this project was allocated to the understanding and definition of the business problem rather than optimizing a machine learning model. As with any other complex problem, spending some time upfront to work through the various aspects of a business problem in a systematic way will reduce the risk of coming up with the wrong solution, which clearly wastes time (read: opportunity loss) and money.