Hello SSN.

Arvind Ram

SSN CSE 2013 Batch

Machine Learning

Why  ML ?

To make RIGHT 

decisions

People can make RIGHT decisions!

Correct. But...

Hard to make large number of them

Decision fatigue

Leads to WRONG decisions

Birth of ML Algorithms

Decisions need DATA

Data needs Privacy

Data reveals your identity

Identity loss can have serious consequences

Ex: Gender Discrimination

Ex: Caste Discrimination

Ex: Theft Targets

We need to protect identity

From who ?

External hackers

Internal hackers

How ?

Multi level access control for extenral hackers

Internal hackers/employees

Data Anonymization

Data Anonymization

Reference: https://trustarc.com/blog/wp-content/uploads/Hash_information_tableData-anonymization-blog.png

Case Study

Netflix Case Study

Reference: https://www.youtube.com/watch?v=gI0wk1CXlsQ

Netflix Case Study

Reference: https://www.youtube.com/watch?v=gI0wk1CXlsQ

Netflix Case Study

Reference: https://www.youtube.com/watch?v=gI0wk1CXlsQ

MA Medical Records

Reference: https://www.youtube.com/watch?v=gI0wk1CXlsQ

Unique ID

Reference: https://www.youtube.com/watch?v=gI0wk1CXlsQ

Differential Privacy

 Alter data

 Won't we lose truth ?

Differential Privacy

Reference: https://towardsdatascience.com/understanding-differential-privacy-85ce191e198a

Use the data after removing noise (25%)

Plausible Deniability

$$$ LEGAL $$$

Facebook Lawsuit

Reference: https://dailyillini.com/news/2020/02/20/facebook-pay-illinois-550-million/

Facebook Lawsuit

Reference: https://dailyillini.com/news/2020/02/20/facebook-pay-illinois-550-million/

Ethics

Helps Make RIGHT decisions

Be good. Do good.

What is RIGHT ?

Depends...

On ?

On ?

Result of a decision

WRONG decisions result in bias

Microsoft Tay.ai

Microsoft Tay.ai

Reference: https://twitter.com/geraldmellor/status/712880710328139776

How to avoid this ?

  • Foresee the complete potential of the solution
  • Blacklist sensitive topics
  • Need to eliminate such traces at training data level 
  • Strong ethical testing rules

 

Google Vision

Google Vision Detection

Reference: https://algorithmwatch.org/en/story/google-vision-racism/

How to avoid this ?

  • Know your users better
  • Diversity in dataset
  • Strong ethical testing rules

Amazon Recruitment

Amazon Recruitment Tool

Reference: https://fortune.com/2018/10/10/amazon-ai-recruitment-bias-women-sexist/

How to avoid this ?

  • Never transfer human bias into dataset
  • Clearly define and analyze decisions 
  • Strong ethical testing rules

Engineer Responsibly!

Thank You

Privacy & Ethics in ML

By arvind ram

Privacy & Ethics in ML

SSN Talk

  • 215