Supervised Knowledge May Hurt Novel Class Discovery Performance

Ben Dai (CUHK)

(Joint work with Li, Otholt, Hu, Meinel, and Yang)

IMS China 2024

NCD Background

Novel class discovery (NCD) is a machine learning task focused on finding new classes in the data that weren't available during the training period.

Liu, Ziwei, et al. "Large-scale long-tailed recognition in an open world." CVPR. 2019.

NCD Background

How can we borrow supervised knowledge and break the category constrain?

NCD Background

NCD: Existing Methods

Vaze et al (CVPR 2022) Generalized Category Discovery

NCD: Existing Methods

Fini et al (ICCV 2021) A Unified Objective for Novel Class Discovery

NCD: Existing Methods

Fini et al (ICCV 2021) A Unified Objective for Novel Class Discovery

NCD: Existing Methods

What makes the implementation of NCD possible?

NCD: Existing Methods

What makes the implementation of NCD possible?

Supervised info \( \mathbf{X} | Y \)

NCD: Existing Methods

What makes the implementation of NCD possible?

Supervised info \( \mathbf{X} | Y \)

Unsupervised info \( \mathbf{X} \)

NCD: Existing Methods

Vaze et al (CVPR 2022) Generalized Category Discovery

NCD: Existing Methods

Fini et al (ICCV 2021) A Unified Objective for Novel Class Discovery

NCD: Existing Methods

DL:

DL typically assumes that more data is better, and the focus lies in designing different network structures to effectively utilize the available data.

STAT:

Statisticians always strive to clarify when and how to utilize data effectively in various situations/assumptions.

NCD: Existing Methods

DL:

DL typically assumes that more data is better, and the focus lies in designing different network structures to effectively utilize the available data.

STAT:

Statisticians always strive to clarify when and how to utilize data effectively in various situations/assumptions.

An interesting question:
Is more (supervised) data necessarily better?

From a practical perspective, we would like to propose a metric that can serve as a reference to guide us in determining which data to utilize, thereby avoiding the need to train a time-consuming, huge model unnecessarily.

NCD: Outline

DL:

More data is better...
design a DL architecture

STAT:

under this kind of assumption you should ...

Step 1

Step 2

Step 3

NCD: Metric

Suppose we learn a mapping \(\mathbf{p}\) from training samples
How to measure the effectiveness of \(\mathbf{p}\)

NCD: Metric

Recall: MMD

Muandet et al (2020) Kernel Mean Embedding of Distributions: A Review and Beyond

NCD: Metric

Recall: MMD

Muandet et al (2020) Kernel Mean Embedding of Distributions: A Review and Beyond

Fini et al (ICCV 2021) A Unified Objective for Novel Class Discovery

NCD: Metric

Yet, in practice, \(Y_u\) is unknown...

NCD: Benchmark

Step 1

Step 2

Step 3

NCD: Benchmark

Conclusion: consistency between Semantic Similarity and Accuracy. The proposed benchmark is good...

NCD: Benchmark

Conclusion: consistency among Semantic Similarity, Accuracy, and (pseudo) transfer flow. The proposed metric is good...

NCD: Supervised Info May Hurt

Step 1

Step 2

Step 3

Step 4

NCD: Supervised Info May Hurt

Suboptimal

NCD: Supervised Info May Hurt

Conclusion: Supervision information with low semantic relevance may hurt NCD performance.

NCD: Supervised Info May Hurt

Conclusion: pseudo transfer flow can be used as a practical reference to infer what sort of data we want to use in NCD.

Application: Data selection

NCD: Supervised Info May Hurt

Application: Data Combining

Contribution

We find that using supervised knowledge from the labeled set may lead to suboptimal performance in low semantic NCD datasets. Based on this finding, we propose two practical methods and achieve ∼3% and ∼5% improvement in both CIFAR100 and ImageNet compared to SOTA.
We introduce a theoretically reliable metric to measure the semantic similarity between labeled and unlabeled sets. A mutual validation is conducted between the proposed metric and a benchmark, which suggests that the proposed metric strongly agrees with NCD performance.
We establish a comprehensive benchmark with varying degrees of difficulty based on ImageNet by leveraging its hierarchical semantic similarity.

Supervised Knowledge May Hurt Novel Class Discovery Performance

Ben Dai (CUHK)

NCD Background

NCD Background

NCD Background

NCD Background

NCD: Existing Methods

NCD: Existing Methods

NCD: Existing Methods

NCD: Existing Methods

NCD: Existing Methods

NCD: Existing Methods

NCD: Existing Methods

NCD: Existing Methods

NCD: Existing Methods

NCD: Existing Methods

NCD: Outline

NCD: Metric

NCD: Metric

NCD: Metric

NCD: Metric

NCD: Metric

NCD: Benchmark

NCD: Benchmark

NCD: Benchmark

NCD: Benchmark

NCD: Benchmark

NCD: Supervised Info May Hurt

NCD: Supervised Info May Hurt

NCD: Supervised Info May Hurt

NCD: Supervised Info May Hurt

NCD: Supervised Info May Hurt

NCD: Supervised Info May Hurt

Contribution

Thank you!

ncd

ncd

statmlben

Supervised Knowledge May Hurt Novel Class Discovery Performance

Ben Dai (CUHK)

NCD Background

NCD Background

NCD Background

NCD Background

NCD: Existing Methods

NCD: Existing Methods

NCD: Existing Methods

NCD: Existing Methods

NCD: Existing Methods

NCD: Existing Methods

NCD: Existing Methods

NCD: Existing Methods

NCD: Existing Methods

NCD: Existing Methods

NCD: Outline

NCD: Metric

NCD: Metric

NCD: Metric

NCD: Metric

NCD: Metric

NCD: Benchmark

NCD: Benchmark

NCD: Benchmark

NCD: Benchmark

NCD: Benchmark

NCD: Supervised Info May Hurt

NCD: Supervised Info May Hurt

NCD: Supervised Info May Hurt

NCD: Supervised Info May Hurt

NCD: Supervised Info May Hurt

NCD: Supervised Info May Hurt

Contribution

Thank you!

ncd

More from statmlben