How to create business value from data without risking anything?

Reading time: 7 min

I’m probably not telling you anything new when I say data is already the oil for company development and will be even more so in the coming decades. Growing your business, increasing customer satisfaction or analyzing if there are any upcoming threats at a sustainable pace is impossible without implementing data science techniques. The question rather is: what is the right technique for data projects? How can you check the feasibility of a solution and assess its business benefits quickly and cheaply – before implementing anything? 

The answer: A Proof of Concept (POC) based approach is the way to go! 

This article will not only help you understand what a data science POC is, but it will also introduce you to the exact steps you need to follow in order to fuel growth without risking anything! 

Risk-free & cost-effective – The right way to create business value from data

Data science is an essential part of any industry today, given the massive amounts of data being produced. Nevertheless, as a company providing successful data solutions in various industries, we often encounter confusion among our customers about how to apply a machine learning (ML) based solution to a business problem (or even how to give data-oriented support to an entire department or a company). 

It is important to know that analytical initiatives are often pilot projects entailing that their success and added value are difficult to predict at the beginning. Therefore, if we do not approach such an initiative carefully and with the proper methodology, the project can fail easily for a variety of reasons (e.g., no data, poor data quality, or simply due to a business issue that can not be modelled, etc.). 

The result?

Business decision makers will think of the project as a negative experience and will feel a deep regret about throwingmoney out of the window. 

So then: is there a better way?  

Yes, indeed! 

The best way to approach these projects is by: defining phases with gradually rising investments, which in return will mean an increasing guaranteed ROI (Return on Investment). Based on our 20 years’ of experience, the best solution is a Proof of Concept based approach that can and should be used at any level of data maturity.

What is a data science POC?

I am sure you know the meaning of “Proof of Concept” (just in case though: mini-projects that explore the feasibility of a system, product or service and its business benefits quickly and cheaply). But what does running a POC specifically for data science mean? 

In this case, a Proof of Concept helps you evaluate the value, relevance and feasibility of different data science solutions before they are implemented. A data science POC should always prove that a given system will provide widespread value to the company: that is, it’s capable of bringing a data-driven perspective to a range of the business’s strategic objectives. 

In terms of advantages, this methodology helps you with: 

  • Saving time and money by sustainably allocating the resources of the company only towards those projects, which will reliably create value.
  • Ensuring that the given solution meets specific needs or sets of predefined requirements.
  • Becoming a data-driven organization from the core. 
  • Receiving timely and valuable feedback from various stakeholders.
  • Giving your team the opportunity to get comfortable with change.

Plus, just some of the indirect advantages of a POC include:

  • POC connects experts from different areas and helps them get a better understanding of processes, verifying or rejecting hypotheses.
  • Colleagues, who participate in the project, can get a better understanding of dashboard tools or predictive methodological processes.
  • POC helps the team ask the right questions and transform these into “data language”.
  • It helps to change even strong opinions once they turn out to be false.
How to run a successful data science POC?

Let’s see what are the 5 essential elements of an efficient data science POC which can help you keep your project on track!

  1. Set up a team consisting of the right people

    I know, it sounds obvious, so who exactly do I mean?

    To run an efficient POC, you need to include people from all parts of the organization in your team. 

    • Data scientists and/or analysts will definitely be connected to the project. 
    • The IT team will also need to test the solution’s feasibility. 
    • Furthermore, any business teams involved with or impacted by the results of the project 
    • as well as end-users of the solution should be involved.

    However, be careful you don’t swing too far in the other direction and involve too many people, as this can slow down progress and efficiency. A few representatives from each team or group is generally sufficient.

  2. Define data-related business problems

    Now that your team is set up, the next and probably most important step is the definition of the business problems. 

    Ok, but how?

    Try to gather ideas together with your team for a variety of business issues. Once the list is complete, find the answers to the following questions: 

    • What does the current process look like?
    • Would the use of data in general or data science/machine learning techniques help with this business issue? If so, how?
    • Do you have the necessary data for a POC?
    • Where is the data stored? 
    • Are you willing to work on this use case with an external partner?
    • Will this use case help you make money, save money, or realize any other benefits? 

    The outcome of this step should be the definition of testable ideas (hypotheses). For example, the risk of churn can be determined by using customer data and past behavior patterns.

  3. Select business problems for the POC

    Regardless of the sector, there will be several business problems for which a data solution is conceivable. However, there is no capacity to develop a solution for all. Ideas should therefore be prioritized. Thus, your next step is to determine for which idea(s) it is the most worthwhile to start a POC project. 

    In our experience, there is no obstacle to running multiple POCs in parallel (if you have the capacity), but you should expect these threads to run independently. The big advantage of this is that you have a better chance of success, at the same time the downside is that attention is divided. 

    If capacities are low, it is worth considering speed when prioritizing the ideas. For example, you could define POC projects for business cases where you can validate the possibilities and pitfalls of an analytical solution relatively quickly.

  4. Set up clear deliverables

    In order to restrict a POC to a reasonable timeframe, you have to set up clear deliverables. Without them, no one can be really sure what to consider done or what to consider a success.

    Ideally, the final deliverable is the implementation of a data project based on the selected use case. However, it’s also worth setting up deliverables/milestones along the way. For example, individual teams could set up checkpoints to evaluate their subset of the project.

  5. Know when to stop

    The idea’s been tried and it brought the expected benefits?

    Excellent! It’s time to move on to implementation and incorporate the results into your day-to-day operations. 

    However, if the result is the opposite, don’t waste more resources on it! Draw conclusions and use them in the next idea.

    The best solution is to make a GO / NO-GO decision at about the third of every POC project. 

    This way, the process of developing a data-driven solution can be simplified and risks can be reduced. Use the following decision tree from the use case idea to day-to-day implementation:

Conclusion

Data science projects are complex and it is always a question whether they are feasible or not. 

A Proof of Concept is an excellent way for businesses to evaluate the viability of an idea and sustainably allocate their resources towards those projects, which will reliably create value.

By following the above steps and choosing the right partners for your project you can ensure a smooth and successful POC and allow your organization to move to implementation quickly while saving invaluable resources.

Interested in getting a great amount of valuable data insight, but not sure how to start? Book a 20-minute discussion with one of our experts to help you get started!

Have you got experience with data-based POC’s? What are your best practices?

[We are curious about your opinion: what do you think are the top mistakes businesses make when running a data science POC? Let’s chat about it in the comment section below!👇]

Authors:
Szabolcs Biró – Head of Advanced Analytics
Ákos Matzon – Advisory Team Leader

A risk free way to create business value from data

Reading time: 5 min

As a company providing data solutions, we often encounter a kind of confusion among our customers about how to apply a Machine Learning (ML) based solution to a business problem (or even how to give data-oriented support for an entire department or a company). One of the most important basic features of analytical initiatives is that they are often pilot projects, and their success and added value are difficult to predict at the beginning. Therefore, if we do not approach it carefully and with proper methodology, such an initiative can easily be a failure and burn into the minds of business decision-makers as a negative experience, a total waste of money. In case of analytical projects, an approach is required where phases are defined in a way that enables gradually higher investment and parallelly increasing guaranteed ROI (return on investment). Based on our experience, the best solution is a Proof of Concept-based approach that can and should be used at any level of data maturity.

PoC, i.e. Proof Of Concept

By PoC, we mean mini-projects that explore the feasibility of a (Machine Learning) solution and its business benefits quickly and cheaply.

Source: techopedia

Many times, a data project idea sounds very good in theory: it promises low costs and high benefits, and on a high level there is no sign of any obstacles. However, in reality, when the development has already begun, the project might fail for a variety of reasons (e.g., no data, poor data quality, or simply due to a business issue that is not able to be modelled, etc.). As a manager, it’s tempting to accept such good ideas with the promise of quick implementation, but it’s much more uncomfortable to face the fact that the dreamed-of ML app still doesn’t meet our expectations, plus we’ve thrown a small (or even bigger) amount of money out the window. The solution to minimizing such risks is the PoC-based approach, which means spending extra energy (and money) only on what is proven to be worth addressing.

The life cycle of ML-based solutions

The figure below illustrates how an idea is integrated into business processes.

As a first step, we need to be aware of what use cases and data related business problems are available. If these are not available, it is worth stepping back and reconsidering the main pain points of the company’s operations (e.g., reducing costs, stopping customer dropouts, increasing margins, etc.). Once these pain points are given, testable ideas (hypotheses) can be defined, for example: risk of churn can be determined using customer data and past behavior patterns.

Regardless of the sector, there will be several business problems for which a data solution is conceivable, but there is no capacity to implement all the ideas. Ideas should therefore be prioritized, i.e. it should be determined for which idea(s) it is the most worthwhile to start a PoC project. In our experience, there is no obstacle to running multiple PoCs in parallel (if you have the capacity or, for example, to test multiple suppliers), but you should expect these threads to run independently. The big advantage of this is that you can have two strings to your bow, so you have a better chance of success, but obviously, the attention is divided, so it is worth deciding on parallelization depending on the available capacities. It is also worth considering speed through prioritization, i.e. to define PoC projects for those business cases where you can validate the possibilities and pitfalls of an analytical solution relatively quickly.

If the PoC is successful, i.e., the idea is tried and brings the expected benefits, it is worth moving on to implementation and incorporating the results into day-to-day operations. In case the result is the opposite, stop and draw conclusions, don’t waste more resources on the idea.

Principles of the methodology:
  1. Planning – Let’s spend time selecting an idea for PoC, specifically checking feasibility in terms of data availability and quality.
  2. Toolbox – No need for a cloud data warehouse or sandbox, but sufficient amount of good quality data is really needed
  3. Open-mindedness – It is crucial that the organization be open to such a solution, and not as an attack against expertise, as experts could feel “a machine will not tell me what to do”.
  4. Expertise – When acquiring business knowledge, it is worth involving experts, asking for help and not competing with them with an algorithm.
  5. Lessons Learned – If a PoC is not successful, draw conclusions and use them in the next idea.
  6. Multiple attempts – If possible, try multiple problems (e.g. prediction, optimization, anomaly detection, etc.)
  7. Tensions – Be prepared for the fact that a new solution can replace old processes, and you have to deal with these changes.
  8. Simplicity – To maintain motivation and address expectations, start with an easier idea that is expected to be successful, and then deal with riskier ones.
Arguments against PoC, difficulties

Although there are many arguments in favor of the PoC approach, we also have some against to highlight:

  1. You can’t just launch PoC projects forever. It is important that the data “project funnel” be strong and that integrated data applications are born, and there is not only experiment all the time.
  2. Scalability. It is not always clear how a successful PoC project can be scaled to an implemented solution. This aspect must be taken into account from the beginning.
  3. Dealing with excessive expectations. It is important that experts and business decision-makers involved in the process know what to expect. The result of a PoC is not a final solution, not a classic project, and the realization of financial benefits typically come in a follow-up project.
  4. Take the time. One of the advantages of PoCing is that it is fast, but that doesn’t mean you can hurry. Depending on the complexity of the PoC, the lead time can take from 2 weeks to up to half a year, and the typical lead time is 10 weeks based on our experience.
  5. Buy-in. Without the commitment and motivation of the related business area, it is not possible to design and operate an ML solution successfully.
  6. Quantify benefits. It is often unclear how the potential benefits will be accurately demonstrated (e.g. savings due to an unforeseen event). Significant energies need to be allocated for thorough and fair re-measurement and evaluation.

Szabolcs Biró – Head of Advanced Analytics
Ákos Matzon – Advisory Team Leader