The leading knowledge platform for the financial technology industry
The leading knowledge platform for the financial technology industry

A-Team Insight Blogs

Opinion: How Machine Learning Solves Client Onboarding, by Adam Devine, WorkFusion

Brute force can’t solve onboarding. Machine learning can. Here’s how.

Throwing headcount at the ubiquitous client onboarding problem hasn’t worked. It’s still a mess. Offshoring just sweeps the mess under the rug, and the expression, “misery loves company” is a good way of describing the impact of utility plays. Every onboarding solution on the market today just shifts the burden and expense of data analyst headcount to different places and P&Ls, and because every bank has a slightly (or very) different onboarding process, there will never be a single off-the-shelf solution for every bank. The key to reducing the risk and cost of onboarding is automating the manual, repetitive human work within every unique process. Are we suggesting that every financial institution kick off its own enormous custom IT project? Certainly not. That’s a well-worn page from 1998’s playbook. Cloud-based machine learning is the answer in 2015, and it’s radically improving the way data is collected, validated, and enriched.

What are the challenges of traditional automation?

Let’s first talk about machines that don’t learn, otherwise known as rules-based automation. Scrapers, optical character recognition (OCR), and parsers are common Rules-Based Automation (RBA) point solutions for data collection. They work well if the underlying business process and data sources never change, but change is the only constant in an onboarding process. RBA requires upfront programming and configuration by IT, and it’s impossible to program every potential variable in a data collection process or account for variations in client documents. When the process or sources change, IT must re-write the rules. The process halts while the point solution is repaired, and business continuity is compromised. This is why outsourcing begins where RBA ends – it’s just easier to have headcount do the work than it is to constantly repair and maintain automation.

What are the challenges of headcount?

A human-only data analyst workforce is an expensive, fixed and finite resource, unable to elastically scale up to meet bursts of demand or scale down in troughs. Skilled as they may be, people naturally make mistakes, and sometimes these mistakes are incredibly costly. Supplementing or replacing a full-time equivalent (FTE) workforce with outsourced workers provides moderate cost relief and incrementally more scalability, but as both data volumes and global labor rates rise, the benefits of business and knowledge process outsourcing (BPO / KPO) fall.

What is machine learning?

You’ve heard of it, and you may even be able to describe it, but let’s define it here just to be thorough. Machine learning is computer programming that can program itself by watching humans perform a task. With sufficient repetition of a method of work, machine-learning algorithms can identify a consistent pattern and program itself to replicate it. Machine learning and its parent field, artificial intelligence, will significantly change and improve knowledge work. The most relevant improvement for banks will be the incremental automation of data collection, validation, and enrichment, which will reduce reliance on offshore labor, improve risk profiles, increase controls and data accuracy, speed up processes, and radically reduce the cost of run-the-bank.

Why hasn’t machine learning solved onboarding yet?

Machine learning is a little like a Formula 1 racecar. It’s incredibly sexy and thrillingly fast, but it’s also very complex and difficult to maintain. Machine learning first and foremost requires a lot of high quality data to train algorithms. G-SIBS generate plenty of data for algorithm training, but then there’s the problem of selecting and testing the right general purpose learning algorithms and modeling them in an effective way for a given task – like plucking IBANs from standard settlement instructions or monitoring SEC documents on EDGAR for changes. The selection, modeling, and training process can take years, and no bank has that much time (or budget) for an IT project.

How can machine learning solve onboarding today?

Rather than every bank building its own racecar and test track, the speed, accuracy and efficiency benefits of machine learning are now available as a software service. WorkFusion does for machine learning what Amazon Web Services did for cloud computing, turning an expensive, challenging capability into an on-demand service. WorkFusion is an end-to-end platform for optimizing and automating data collection. The software invisibly pairs machine-learning algorithms with FTE or outsourced human data analysts doing business-as-usual work. This pairing, sometimes referred to as human-in-the-loop computing, incrementally automates the repetitive tasks within processes.

17 of the top 20 global data vendors use WorkFusion to perform mission critical data collection work for entities and instruments, corporate actions, and risk and compliance. Global banks have begun to use WorkFusion for a wide variety of data collection processes, from SSI extraction to BCBS 239 compliance to, most importantly, client onboarding and AML/KYC. WorkFusion spent four years and a considerable amount of its investors’ money building a world-class, enterprise-proven software platform that brings the benefits of machine learning to the financial industry.

Onboarding is only one of thousands of applications that banks will discover and deploy in the coming years, but it is certainly the one of the first big messes that machine learning will clean up for the financial industry. For more information about WorkFusion, swing by the company’s booth at A-Team’s Data Management Summit (DMS) in London this Wednesday and New York on April 16th, check out, or email

By Adam Devine, VP, Product Marketing & Strategic Partnerships, WorkFusion

Related content


Recorded Webinar: Developing operational resilience

Financial institutions’ operational resilience – essentially the ability to prevent, adapt and respond to, and recover and learn from operational disruptions – has come under extreme pressure during the coronavirus pandemic, with last year’s March lockdown creating unprecedented circumstances for financial firms. Employees working from home raised the stakes, as they still do, adding to...


Mirato Raises $9 Million to Fund AI-Based Supplier Risk Platform

In another indicator of the market’s acceptance of AI, Tel Aviv-based Mirato has raised $9 million to support the development and commercialization of its third-party risk management (TPRM) platform. With regulators intensifying their scrutiny of financial institutions’ relationships with external suppliers – with regulations like GDPR addressing privacy concerns, for instance – Mirato’s offering helps...


TradingTech Summit Virtual

TradingTech Summit (TTS) Virtual will look at how trading technology operations can capitalise on recent disruption and leverage technology to find efficiencies in the new normal environment. The crisis has highlighted that the future is digital and cloud based, and the ability to innovate faster and at scale has become critical. As we move into recovery and ‘business as usual’, what changes and technology innovations should the industry adopt to simplify operations and to support speed, agility and flexibility in trading operations.


RegTech Suppliers Guide 2020/2021

Welcome to the second edition of A-Team Group’s RegTech Suppliers Guide, an essential aid for financial institutions sourcing innovative solutions to improve their regulatory response, and a showcase for encumbent and new RegTech vendors with offerings designed to match market demand. Available free of charge and based on an industry-wide survey, the guide provides a...