• Home
  • About Us
  • Contact Us
  • Privacy Policy
  • Special Offers
Business Intelligence Info
  • Business Intelligence
    • BI News and Info
    • Big Data
    • Mobile and Cloud
    • Self-Service BI
  • CRM
    • CRM News and Info
    • InfusionSoft
    • Microsoft Dynamics CRM
    • NetSuite
    • OnContact
    • Salesforce
    • Workbooks
  • Data Mining
    • Pentaho
    • Sisense
    • Tableau
    • TIBCO Spotfire
  • Data Warehousing
    • DWH News and Info
    • IBM DB2
    • Microsoft SQL Server
    • Oracle
    • Teradata
  • Predictive Analytics
    • FICO
    • KNIME
    • Mathematica
    • Matlab
    • Minitab
    • RapidMiner
    • Revolution
    • SAP
    • SAS/SPSS
  • Humor

Data Science “Paint By Numbers” With The Hypothesis Development Canvas

October 30, 2018   BI News and Info

When I was a kid, I loved Paint by Numbers sets. They make anyone who can paint or color between the lines feel like Rembrandt or Leonardo da Vinci (we can talk later about the long-term impact of forcing kids to “stay between the lines”).

Now the design world is applying the “paint by numbers” concept using design canvases. A design canvas outlines what’s important given the subject area and then allows the “painter” to color in the right information. A design canvas is a one-page operational template designed to capture all of the different perspectives necessary for successful execution depending upon the problem being solved.

A great example of a canvas is the Business Model Canvas, courtesy of Strategyzer (see Figure 1).

Paint Fig 1 Data Science “Paint By Numbers” With The Hypothesis Development Canvas

Figure 1:  Business Model Canvas, courtesy of Strategyzer

The Business Model Canvas forces you to “paint in” your business’s value proposition, cost structures, revenue streams, supplier network, and customer segments. Overnight, everyone can become Jack Welch!

Now you are ready to take the next step from a Big Data MBA perspective by building on the Business Model Canvas to flesh out the business use cases – or hypotheses. This is where you can become more effective at leveraging data and analytics to optimize the business. That next step involves the newly created Hypothesis Development Canvas.

Introducing data science paint by the numbers

The one area of under-investment in most data science projects is the thorough and comprehensive development of the hypothesis or use case that is being tested. That is, what are we are trying to prove out with our data science engagement, and how do we measure progress and success?

To address these requirements, we developed the Hypothesis Development Canvas, a “paint by numbers” template that you can populate before executing a data science engagement. The aim is to ensure that you thoroughly understand what you’re trying to accomplish, the business value, how you will measure progress and success, and the impediments and potential risks associated with the hypothesis. The Hypothesis Development Canvas is designed to facilitate the business stakeholder–data science collaboration (see Figure 2).

Paint fig 2 Data Science “Paint By Numbers” With The Hypothesis Development Canvas

Figure 2:  Hypothesis Development Canvas

The Hypothesis Development Canvas includes the following:

  • Hypothesis description and objectives – what the organization is trying to predict and its associated goals (e.g., reduce unplanned operational downtime by X%, improve customer retention by X%, reduce obsolete and excessive inventory by X%, improve on-time delivery by X%).
  • Hypothesis business value from the financial, customer, and operational perspectives – the rough-order ROI from successfully addressing the hypothesis.
  • KPIs – measurement of success and progress, and the exploration of the risks associated with potential second- and third-order ramifications of KPIs. See the blog “Unintended Consequences of the Wrong Measures” for more details on second- and third-order ramifications of KPIs.
  • Decisions – the what, when, where, who, etc. – that need to be made to support and drive actions and automation in support of the hypothesis’s business, customer, and operational objectives.
  • Potential data sources – a list to explore, including a brief description and why the business stakeholders consider each source might be useful.
  • Risks – potential issues associated with false positives and false negatives (Type I and Type II errors); risks associated with those scenarios where the analytic model is wrong.

A vision workshop accelerates the collaboration between the business stakeholder and the data science team to identify the hypothesis requirements that underpin the success of data science engagement.

The Machine Learning Canvas (Big Data MBA version)

To complete the loop, I introduce the machine learning canvas. I stumbled upon the Machine Learning Canvas v0.4 from Louis Dorard at the website “Machine Learning Canvas.” Louis has made his canvas freely available, and I will do likewise with the additions that we made to his canvas based upon our unique data science requirements (see Figure 3).

Paint fig 3 Data Science “Paint By Numbers” With The Hypothesis Development Canvas

Figure 3:  Machine Learning Canvas (Big Data MBA version)

For purposes of our data science work, we needed to add to the panels:

  • Prescription: Once we have a prediction, what do we do with that prediction?
  • Automation:  How do we automate standard procedures with the prescriptive insights?

Summary

A successful data science engagement requires close collaboration with the business stakeholders throughout the development process to:

  • Understand and quantify the sources of financial, operational, and customer value creation (it’s an economics thing).
  • Gain a thorough understanding of the KPIs and metrics against which to measure progress and success, and in particular, the potential second- and third-order ramifications of those KPIs and metrics.
  • Brainstorm the variables and metrics (data sources) that might yield better predictors of business and operational performance.
  • Codify the rewards/benefits and the costs/risks associated with the hypothesis (including the risks and costs associated with false positives and false negatives).
  • Closely collaborate with the business stakeholders to understand when “good enough” is actually “good enough” from a predictive analytics perspective.

You now have the three design canvases that allow you to link your business model to the data science and machine learning efforts, and as well, to give the data science team a direct “line of sight” from how their work impacts the business models.

Paint fig 4 Data Science “Paint By Numbers” With The Hypothesis Development Canvas

Figure 4: Linking business model to data science to machine learning

So to quote the famous American philosopher and part-time groundskeeper Carl Spackler, “Now I’ve got that goin’ for me, which is nice.”

Let’s block ads! (Why?)

Digitalist Magazine

Canvas, data, Development, Hypothesis, Numbers, Paint, Science
  • Recent Posts

    • Experimenting to Win with Data
    • twice-impeached POTUS* boasts: “I may even decide to beat [Democrats] for a third time”
    • Understanding Key Facets of Your Master Data; Facet #2: Relationships
    • Quality Match raises $6 million to build better AI datasets
    • Teradata Joins Open Manufacturing Platform
  • Categories

  • Archives

    • March 2021
    • February 2021
    • January 2021
    • December 2020
    • November 2020
    • October 2020
    • September 2020
    • August 2020
    • July 2020
    • June 2020
    • May 2020
    • April 2020
    • March 2020
    • February 2020
    • January 2020
    • December 2019
    • November 2019
    • October 2019
    • September 2019
    • August 2019
    • July 2019
    • June 2019
    • May 2019
    • April 2019
    • March 2019
    • February 2019
    • January 2019
    • December 2018
    • November 2018
    • October 2018
    • September 2018
    • August 2018
    • July 2018
    • June 2018
    • May 2018
    • April 2018
    • March 2018
    • February 2018
    • January 2018
    • December 2017
    • November 2017
    • October 2017
    • September 2017
    • August 2017
    • July 2017
    • June 2017
    • May 2017
    • April 2017
    • March 2017
    • February 2017
    • January 2017
    • December 2016
    • November 2016
    • October 2016
    • September 2016
    • August 2016
    • July 2016
    • June 2016
    • May 2016
    • April 2016
    • March 2016
    • February 2016
    • January 2016
    • December 2015
    • November 2015
    • October 2015
    • September 2015
    • August 2015
    • July 2015
    • June 2015
    • May 2015
    • April 2015
    • March 2015
    • February 2015
    • January 2015
    • December 2014
    • November 2014
© 2021 Business Intelligence Info
Power BI Training | G Com Solutions Limited