Editorial & Advertiser disclosure

Global Banking & Finance Review® is an online platform offering news, analysis, and opinion on the latest trends, developments, and innovations in the banking and finance industry worldwide. The platform covers a diverse range of topics, including banking, insurance, investment, wealth management, fintech, and regulatory issues. The website publishes news, press releases, opinion and advertorials on various financial organizations, products and services which are commissioned from various Companies, Organizations, PR agencies, Bloggers etc. These commissioned articles are commercial in nature. This is not to be considered as financial advice and should be considered only for information purposes. It does not reflect the views or opinion of our website and is not to be considered an endorsement or a recommendation. We cannot guarantee the accuracy or applicability of any information provided with respect to your individual or personal circumstances. Please seek Professional advice from a qualified professional before making any financial decisions. We link to various third-party websites, affiliate sales networks, and to our advertising partners websites. When you view or click on certain links available on our articles, our partners may compensate us for displaying the content to you or make a purchase or fill a form. This will not incur any additional charges to you. To make things simpler for you to identity or distinguish advertised or sponsored articles or links, you may consider all articles or links hosted on our site as a commercial article placement. We will not be responsible for any loss you may suffer as a result of any omission or inaccuracy on the website.

Home > Finance > Strengthen Anti-Money Laundering with Machine Learning

Strengthen Anti-Money Laundering with Machine Learning

Published by Gbaf News

Posted on June 29, 2018

8 min read

Last updated: January 21, 2026

Market overview with tech stocks and earnings focus - Global Banking & Finance Review — This image illustrates the calm in tech markets as investors anticipate earnings from major tech companies, known as the Mag 7. The article discusses the impact of recent events on global finance and stock valuations.

H.P. Bunaes, Director of Banking Practice at DataRobot

I don’t know that I’ve come across a problem better suited to machine learning than Anti Money Laundering (AML) in banking. For many other predictive applications, banks find that the availability of data for machine learning is an issue. The data may not exist, or if it does it may be of dubious quality. Not so in AML.

Banks have been collecting client information as part of their Customer Identification Programs (CIP) and Know Your Customer (KYC) programs for years.

Transactional details and suspicious activity alerts are accessible and in place. The findings and outcomes from investigations are available – specifically, whether a suspicious activity report (SAR) was filed or not. This constitutes ideal training data for machine learning which can be used to solve many of the common challenges throughout the typical AML program.

The most immediate opportunity is to screen out false positives from suspicious activity alerts. Many banks use rules-based systems for detecting suspicious activity and generating alerts. Since the downside risk of not detecting truly suspicious activity is severe, these rules tend to be conservative — better to get too many alerts than too few. The result can be an overwhelming number of alerts, the bulk of which turn out to be innocuous activity. But, it’s too risky to restrict alerts by narrowing detection rules.

H.P. Bunaes Director of Banking Practice at DataRobot

H.P. Bunaes Director of Banking Practice at DataRobot

Fortunately, banks can leverage their own data, and machine learning, to filter out the vast majority of false positive alerts with little or no downside risk. By definition, banks know the outcome of their investigations — whether a SAR was filed or not. Using machine learning, banks can use this historical data to train a model to screen out false positives (or at the very least, prioritise them lower) using the known outcomes. The model may learn, for example, to eliminate an alert for a particular combination of product, transaction size, KYC risk score, and location that has never resulted in a SAR.

To ease concerns about truly suspicious activity getting missed, models can be tuned so that there are zero false negatives. Even when tuned to “zero false negatives,” we have found that typically more than half of the false alarms can still be detected and eliminated. And the false positive rate on the remaining population also falls significantly.

This model has been tuned to allow for zero false negatives in the out of training sample.

This model has been tuned to allow for zero false negatives in the out of training sample.

In addition to screening false positives, a trained model can point out data features – client or transactional data for example – that are strong indicators of money laundering based on SARs generated when those patterns are evident. Using this information banks can create “smart” rules to detect suspicious activity that are unique to their client set, the product set, and the investigation outcomes. These detection rules are also far more dynamic: they will self-adjust for changes in products and client behavior as the model is periodically retrained.

DataRobot offers features that let you interpret which factors impact the model most.

DataRobot offers features that let you interpret which factors impact the model most.

Banks may also want to use these insights to validate their KYC process. Features of the data that are strong AML predictors can be built into the KYC procedure. Clients that fit these criteria, or whose product use is likely to “trip” an alert, would be subject to enhanced due diligence (a higher than normal degree of scrutiny upfront). KYC questions that are not good predictors of money laundering risk could be eliminated entirely.

Either way, banks have an ironclad defense of their detection rules and KYC process: their own data with rigorous analytics applied. In my experience, a data-based analysis is always easier to justify than an expert opinion. Expert opinions vary.

Finally, there are unsupervised machine learning algorithms that can be applied for anomaly detection. This can be a “last line of defence” to detect previously unseen activity for a client, or to identify a client acting completely unlike similar clients – unusual activity that might otherwise go undetected.

Money launderers are smart, and AML programs need to be smarter to stay one step ahead. Machine learning is a great way to do just that.

About the Author:

H.P. Bunaes leads the banking practice at DataRobot, helping banks leverage AI and machine learning for predictive analytics and data mining. H.P. has 35 years’ experience in banking, with broad banking domain knowledge and deep expertise in data and analytics. Prior to joining DataRobot, H.P. held a variety of leadership positions at SunTrust and FleetBoston. H.P. is a graduate of the Massachusetts Institute of Technology where he earned a Masters Degree in Management Information Systems and of Trinity College where he achieved a Bachelor of Science degree in Computer Science and Mechanical Engineering.

H.P. Bunaes, Director of Banking Practice at DataRobot

I don’t know that I’ve come across a problem better suited to machine learning than Anti Money Laundering (AML) in banking. For many other predictive applications, banks find that the availability of data for machine learning is an issue. The data may not exist, or if it does it may be of dubious quality. Not so in AML.

Banks have been collecting client information as part of their Customer Identification Programs (CIP) and Know Your Customer (KYC) programs for years.

Transactional details and suspicious activity alerts are accessible and in place. The findings and outcomes from investigations are available – specifically, whether a suspicious activity report (SAR) was filed or not. This constitutes ideal training data for machine learning which can be used to solve many of the common challenges throughout the typical AML program.

The most immediate opportunity is to screen out false positives from suspicious activity alerts. Many banks use rules-based systems for detecting suspicious activity and generating alerts. Since the downside risk of not detecting truly suspicious activity is severe, these rules tend to be conservative — better to get too many alerts than too few. The result can be an overwhelming number of alerts, the bulk of which turn out to be innocuous activity. But, it’s too risky to restrict alerts by narrowing detection rules.

H.P. Bunaes Director of Banking Practice at DataRobot

H.P. Bunaes Director of Banking Practice at DataRobot

Fortunately, banks can leverage their own data, and machine learning, to filter out the vast majority of false positive alerts with little or no downside risk. By definition, banks know the outcome of their investigations — whether a SAR was filed or not. Using machine learning, banks can use this historical data to train a model to screen out false positives (or at the very least, prioritise them lower) using the known outcomes. The model may learn, for example, to eliminate an alert for a particular combination of product, transaction size, KYC risk score, and location that has never resulted in a SAR.

To ease concerns about truly suspicious activity getting missed, models can be tuned so that there are zero false negatives. Even when tuned to “zero false negatives,” we have found that typically more than half of the false alarms can still be detected and eliminated. And the false positive rate on the remaining population also falls significantly.

This model has been tuned to allow for zero false negatives in the out of training sample.

This model has been tuned to allow for zero false negatives in the out of training sample.

In addition to screening false positives, a trained model can point out data features – client or transactional data for example – that are strong indicators of money laundering based on SARs generated when those patterns are evident. Using this information banks can create “smart” rules to detect suspicious activity that are unique to their client set, the product set, and the investigation outcomes. These detection rules are also far more dynamic: they will self-adjust for changes in products and client behavior as the model is periodically retrained.

DataRobot offers features that let you interpret which factors impact the model most.

DataRobot offers features that let you interpret which factors impact the model most.

Banks may also want to use these insights to validate their KYC process. Features of the data that are strong AML predictors can be built into the KYC procedure. Clients that fit these criteria, or whose product use is likely to “trip” an alert, would be subject to enhanced due diligence (a higher than normal degree of scrutiny upfront). KYC questions that are not good predictors of money laundering risk could be eliminated entirely.

Either way, banks have an ironclad defense of their detection rules and KYC process: their own data with rigorous analytics applied. In my experience, a data-based analysis is always easier to justify than an expert opinion. Expert opinions vary.

Finally, there are unsupervised machine learning algorithms that can be applied for anomaly detection. This can be a “last line of defence” to detect previously unseen activity for a client, or to identify a client acting completely unlike similar clients – unusual activity that might otherwise go undetected.

Money launderers are smart, and AML programs need to be smarter to stay one step ahead. Machine learning is a great way to do just that.

About the Author:

H.P. Bunaes leads the banking practice at DataRobot, helping banks leverage AI and machine learning for predictive analytics and data mining. H.P. has 35 years’ experience in banking, with broad banking domain knowledge and deep expertise in data and analytics. Prior to joining DataRobot, H.P. held a variety of leadership positions at SunTrust and FleetBoston. H.P. is a graduate of the Massachusetts Institute of Technology where he earned a Masters Degree in Management Information Systems and of Trinity College where he achieved a Bachelor of Science degree in Computer Science and Mechanical Engineering.