Data-Driven Solutions for Trend Mining in Open Big Data: A Multi-Domain
Approach

Dima Kagan

Beer-Sheva

Beer-Sheva

Beer-Sheva

  • 31 Faculty Members
  • More than 300 master students
  • About 100 Ph.D. students

This presentation

 

 

by GPT-Chat.

WAS

MADE

This presentation

 

 

by GPT-Chat.

WAS

MADE

not

Me and MY Research

  • Make a positive impact
  • Can make someone's life better
  • Has the potential to change policies
  • Can inspire more research
  • Can change regulations
  • Practical!

If what you are working on is not important, and it is not likely to lead to important things, why are you working on it?

Richard Hamming

 

 DATA

DRIVEN

SOLUTIONS

The data revolution

Zooming Into Video Conferencing Security & Privacy

Video conferencing Privacy & Security

Information Leakage

 

 

Malware Attacks

 

 

Data Breach

 

Fake Avatars

 

 

Phishing Attacks

 

 

Zoombombing

Multiparty Privacy

Credit: Such, Jose M., and Natalia Criado. "Multiparty privacy in social media." Communications of the ACM 61.8 (2018): 74-81.

89,305 Zoom related tweets

 

90,395 Zoom related Instagram posts

16,133 images

Dima Kagan, Galit Fuhrmann Alpert, and Michael Fire. “Zooming into video conferencing privacy and security threats”. In: IEEE Transactions on Computational Social Systems (2022)

Dima Kagan, Galit Fuhrmann Alpert, and Michael Fire. “Zooming into video conferencing privacy and security threats”. In: IEEE Transactions on Computational Social Systems (2022)

Dima Kagan, Galit Fuhrmann Alpert, and Michael Fire. “Zooming into video conferencing privacy and security threats”. In: IEEE Transactions on Computational Social Systems (2022)

Dima Kagan, Galit Fuhrmann Alpert, and Michael Fire. “Zooming into video conferencing privacy and security threats”. In: IEEE Transactions on Computational Social Systems (2022)

What could possibly happen in zoom meeting

What could possibly happen in zoom meeting

Face

Detection

Face

Tracking

Feature

Extraction

Change

Detection

Group

Change

Detection

Shmuel Horowitz, Dima Kagan, Galit Fuhrmann Alpert, Michael Fire Interruptions detection in video conferences  2023

Face

Detection

Face

Tracking

Feature

Extraction

Change

Detection

Group

Change

Detection

Shmuel Horowitz, Dima Kagan, Galit Fuhrmann Alpert, Michael Fire Interruptions detection in video conferences  2023

Face

Detection

Face

Tracking

Feature

Extraction

Change

Detection

Group

Change

Detection

Shmuel Horowitz, Dima Kagan, Galit Fuhrmann Alpert, Michael Fire Interruptions detection in video conferences  2023

Face

Detection

Face

Tracking

Feature

Extraction

Change

Detection

Group

Change

Detection

😃

🙂

🙂

🙂

Shmuel Horowitz, Dima Kagan, Galit Fuhrmann Alpert, Michael Fire Interruptions detection in video conferences  2023

Face

Detection

Face

Tracking

Feature

Extraction

Change

Detection

Group

Change

Detection

Single Parcticipent Change Detection

🙂

🙂

🙂

🙁

🙁

🙁

🙂

Shmuel Horowitz, Dima Kagan, Galit Fuhrmann Alpert, Michael Fire Interruptions detection in video conferences  2023

Face

Detection

Face

Tracking

Feature

Extraction

Change

Detection

Group

Change

Detection

Shmuel Horowitz, Dima Kagan, Galit Fuhrmann Alpert, Michael Fire Interruptions detection in video conferences  2023

Face

Detection

Face

Tracking

Feature

Extraction

Change

Detection

Group

Change

Detection

Feature Extaction Change Detection Recall Precision TNR FPR
Embedding 128-dim Arima
Statistical Profiling
50.1%
42.9%
83.2%
75.0%
98.1%
97.8%
0.9%
1.4%
Expressions 7-dim Arima
Statistical Profiling
71.6%
72.5%
90.5%
92.3%
99.2%
99.1%
0.5%
0.8%
Deep-face expressions Majority 67.1% 84.8% 97.7% 2.6%
Method Results

Automatic Large Scale Detection of Red Palm Weevil Infestation using Street View Images

Dima Kagan, Galit Fuhrmann Alpert, Michael Fire

ISPRS Journal of Photogrammetry and Remote Sensing

Red Palm Weevil

South American Palm Weevil

Palm Weevils

Palm Weevil is a type of beetle that infests palm trees, lays eggs and feeds on  the palm's tissue, creating tunnels inside the tree trunk that weaken its structure, causing extensive damage resulting in tree decline and eventual breakage. The beatles can travel up to 50 km a day, rapidly spreading geographically and raising tremendous risks to spatially widespread locations.

 

The Problem

  • Today, the Red Palm Weevil has spread to 85 different countries and regions worldwide.
     
  • In fact, the Food and Agriculture Organization of the United Nations estimates that in 2023, the combined cost of pest management and replacement of damaged palm trees, in Spain and Italy alone, will reach 200 million Euro.

Photo taken by Omer Tsur - AirWorks aerial photography

Palm Weevils Life Cycle

How Does It Look

How Does It Look

Existing Solutions

Publicly Available Data + Computer Vision

●Applicable at large scale.

●Relatively affordable.

●Can be used on any point in the world.

●Can be used anywhere especially in urban areas.

Street View

  • Provide 360 degree images.

  • Contain historical data.

  • World wide availability.

  • Many vendors:

    • Google

    • Microsoft

    • Yandex

    • Apple

    • Kartaview

    • etc.

Google Street View

  • 0.007 USD per each (7.00 USD per 1000).

  • Each image contains up to 120 degrees out of the panorama.

  • Max image size is 640x640.

  • We estimate that fully mapping of San Diego area using only street view images should cost around 51,000$.

Aerial Imagery

  • Cheaper, sometimes even free.
  • Show larger areas than street view.

Aerial Palm Detection

\theta=atan2(y_{s}-y_{a}, x_{s}-x_{a})
(x_{s}, y_{s})
(x_{a}, y_{a })
\theta

Palm Tree Detection & Classification

(x_{s}, y_{s})
(x_{a}, y_{a })

N

Street VS Aerial Mapping

  • 546 aerial images.

  • 756 aerial detected palm trees.

  • 4,544 street level images.

  • 72% match between aerial images and street level detected palm trees.

Normal Heights Village, San Diego

Case Study  

  • Has a Palm Weevil problem.

  • Has ground truth data of infested palm trees (reports).

  • Has historical data in Google Street View.

Analysis

  • Scanning the area of the ground truth data we detected 5,008 trees.
  • Our classifier classified five of the six ground truth trees as infested.
  • Additionally, out of 5,008 detected palms, in the classifier detected additional 13 infested palm trees, of which we identified eight at advanced infestation stages.

Searching on a Large Scale

22,438 aerial images

54,781 street view images

 36,001 palm trees

109 were classified as infested

24 were infested at an advanced stage

Jun 2014

Nov 2017

April 2018

April 2019

Jun 2014

Nov 2017

April 2018

April 2019

Jun 2014

Nov 2017

April 2018

April 2019

Jun 2014

Nov 2017

April 2018

April 2019

Limitations & Challenges

  • Different species of palms trees has different symptoms.
  • Fertilizer may delay the visual appearance of the symptoms.
  • Aerial images are not taken at the same time that of street view images.
  • Street view images are not taken at constant time intervals, and only at sporadic time points.
  • When classifying vegetation there may be external or seasonal variables that can affect how the plant looks and as result influence the classifier.
  • The current method and data do not support the detection of borderline cases or detecting the severity of the infestation.

Future Directions

Future Directions

Future Directions

Future Directions

UNSUPERVISED Graph Anomaly detection

Labeling Data is Hard

Malicious Users Tend to Connect to Other Profiles Randomly

Link Prediction

+

Crowd Wisdom

outline

image/svg+xml 1 2 3 4 5 6 7 9 8 10 11 0.94
image/svg+xml 1 2 3 4 5 6 7 9 8 10 11 0.94 0.84 0.56 0.12 0.22 0.44 0.16 0.32 0.91 0.72 0.59 0.14 0.23
image/svg+xml 1 2 3 4 5 6 7 9 8 10 11 0.91 0.84 0.76 0.56 0.36 0.16 0.34 0.12 0.32 0.44 0.31
image/svg+xml 1 2 3 4 5 6 7 9 8 10 11 0.91 0.84 0.76 0.56 0.36 0.16 0.34 0.12 0.32 0.44 0.31

Kagan, Dima, Yuval Elovichi, and Michael Fire. "Generic anomalous vertices detection utilizing a link prediction algorithm." Social Network Analysis and Mining 8 (2018): 1-13

Anomalous Communities detection 

Shay Lapid, Dima Kagan, and Michael Fire. “Co-Membership-based Generic Anomalous Communities Detection”. In: Neural Processing Letters (2022).

Kagan, Dima, Thomas Chesney, and Michael Fire. "Using data science to understand the film industry’s gender gap." Palgrave Communications 6.1 (2020): 1-16.

subtitles to graph

Data SCIENCE

 

 

Network Science

 

Constructing Movie Social Networks

Constructing Movie Social Networks

Constructing Movie Social Networks

W=W+1

Challenges

 

Am I to understand you're still hanging
around with Dr. Emmett Brown, McFly?

Marty McFly

Lorraine McFly

George McFly

15,540

Subtitles

Cast Data

Cast Data

Who heard about the coronavirus before 2019?

Scientometric Trends for Coronaviruses and Other Emerging Viral Infections

Dima Kagan, Jacob Moran-Gilad, Michael Fire

GigaScience

Motivation

  • In the past hundred years, the human population quadrupled and the word became more connected than ever before.

 

 

  • 9,728                         1,270,406

     
  • The potential for disease spread is immense.

Datasets

  • Microsoft Academic Graph - 210 million papers
  • PubMed - 29 million medical papers
  • SJR - 34,000 journals
  • Wikidata - 78 million items

Thank you questions?

Publications

  • Dima Kagan, Galit Fuhrmann Alpert, and Michael Fire. “Zooming into video conferencing privacy and security threats”. In: IEEE Transactions on Computational Social Systems (2022)
  • Dima Kagan, Galit Fuhrmann Alpert, and Michael Fire. “Automatic large scale detection of red palm weevil infestation using street view images”. In: ISPRS Journal of Photogrammetry and Remote Sensing 182 (2021), pp. 122–133
  • Dima Kagan, Jacob Moran-Gilad, and Michael Fire. “Scientometric trends for coronaviruses and other emerging viral infections”. In: GigaScience 9.8 (Aug. 2020).
  • Dima Kagan, Thomas Chesney, and Michael Fire. “Using data science to understand the film industry‘s gender gap”. In: Palgrave Communications 6.1 (2020), pp. 1–16.ima Kagan, Thomas Chesney, and Michael Fire. “Using data science to understand the
    film industry‘s gender gap”. In: Palgrave Communications 6.1 (2020), pp. 1–16.

Publications

  • Dima Kagan, Michael Fire, and Galit Fuhrmann Alpert. “Trends in Computer Science Research within European Countries: Independent and Collaborative Contributions”. In: Communications of the ACM (2022).
  • Shay Lapid, Dima Kagan, and Michael Fire. “Co-Membership-based Generic Anomalous Communities Detection”. In: Neural Processing Letters (2022).
  • Michael Fire, Rami Puzis, Dima Kagan, and Yuval Elovici. “Large-Scale Shill Bidder Detection in E-commerce”. The 27th International Database Engineered Applications Symposium, 2023.

UNDER REVIEW

  • Dima Kagan, Mor Levy, Michael Fire, and Galit Fuhrmann Alpert. “Ethnic Representation Analysis of Commercial Movie Posters”. 2022.
  • Horowitz, Shmuel, Dima Kagan, Galit Fuhrmann Alpert, and Michael Fire. "Interruptions detection in video conferences." 2023

Israeli - Czech Conference

By Dima Kagan

Israeli - Czech Conference

  • 64