AI data could be tainted even as it’s being cleaned

clean-window — Data cleansing efforts should be properly documented, says Capital One's Hanif

- By Dan DeFrancesco
- 12 Nov 2018

Tweet
Facebook
LinkedIn
Save this article
Send to
Print this page

Companies cleaning the data they’re using for their machine learning models could unintentionally adulterate it in the process, one expert has said.

“Anytime you touch the data before it enters your algorithm, there is absolutely always the risk that it removes something that has contextual information, and you don’t know it yet,” said Zachary Hanif, principal machine learning engineer at Capital One, who spoke on a panel on data science at the Risk USA conference in New York on November 9.

Only users who have a paid subscription or are part of a corporate subscription are able to print or copy content.

To access these options, along with all other subscription benefits, please contact info@risk.net or view our subscription options here: http://subscriptions.risk.net/subscribe

You are currently unable to print this content. Please contact info@risk.net to find out more.

You are currently unable to copy this content. Please contact info@risk.net to find out more.

You may share this content using our article tools. Printing this content is for the sole use of the Authorised User (named subscriber), as outlined in our terms and conditions - https://www.infopro-insight.com/terms-conditions/insight-subscriptions/

If you would like to purchase additional rights please email info@risk.net

You may share this content using our article tools. Copying this content is for the sole use of the Authorised User (named subscriber), as outlined in our terms and conditions - https://www.infopro-insight.com/terms-conditions/insight-subscriptions/

If you would like to purchase additional rights please email info@risk.net