It was 2017

Computer Vision for IVF

What?

Select which cells to put in cold storage for future IVF

Why?

Storing cells is expensive, and most won't fertilize, so why keep them?

The task (tech.)

Binary classification
About 300 (initially) - 700 (later) images/class
Uniform background, grayscale images

How bad can it be?

Initial estimation: ~3w until the first good prototype

(70% accuracy)

Positive Class

Negative Class

Positive Class

Negative Class

Wait! There's more!

No clear human baseline
No support from SME

What was the initial result?

Transfer learning InceptionV3

52% Accuracy

What was the best result after 6 months?

A combination of custom architecture, a bit more data, data augmentation and 1D temporal convolutions

69.2% Accuracy

The best result after ~1 year?

72% Accuracy

With 3x data, a lot of custom image pre-processing, and a professor and a PhD student from a well known US university

We tried a loooot of stuff

Several Self-Supervised Learning schemes
Transfer Learning with a few models
A lot of data augmentation trickery
Several custom NN architectures
Tried classic CV based on similar research in bovine cells selection

Lessons learned

Data is king
Preprocessing is queen
Simple problems can be rock hard
Estimation is useless

Fun story time

One day, we tried Google AutoML Vision (it was still in closed beta). The CEO was helpful and did data augmentation beforehand.

We ran AutoML full training cycle.

And got 98% accuracy.

... hmm

We ran AutoML full training cycle.

And got 98% accuracy.

The CEO was ready to inform the stakeholders that we cracked it.

But, the best result before that was 69.2%.

Using occluded images, it turned out that the network was predicting garbage.

Indeed something was wrong.

Because CEO was helpful and did data augmentations beforehand but hadn't specified the train/test/valid splits, we had a data leak.

It was 2017

Computer Vision for IVF

What?

Select which cells to put in cold storage for future IVF

Why?

Storing cells is expensive, and most won't fertilize, so why keep them?

The task (tech.)

How bad can it be?

Initial estimation: ~3w until the first good prototype

(70% accuracy)

Positive Class

Negative Class

Positive Class

Negative Class

Wait! There's more!

What was the initial result?

52% Accuracy

What was the best result after 6 months?

69.2% Accuracy

The best result after ~1 year?

72% Accuracy

We tried a loooot of stuff

Lessons learned

Fun story time

... hmm

We wasted ~1.5k USD and 2 months

But could have wasted more

More lessons learned

AI Interview slides

AI Interview slides

Alexandru Burlacu

It was 2017

Computer Vision for IVF

What?

Select which cells to put in cold storage for future IVF

Why?

Storing cells is expensive, and most won't fertilize, so why keep them?

The task (tech.)

How bad can it be?

Initial estimation: ~3w until the first good prototype

(70% accuracy)

Positive Class

Negative Class

Positive Class

Negative Class

Wait! There's more!

What was the initial result?

52% Accuracy

What was the best result after 6 months?

69.2% Accuracy

The best result after ~1 year?

72% Accuracy

We tried a loooot of stuff

Lessons learned

Fun story time

... hmm

We wasted ~1.5k USD and 2 months

But could have wasted more

More lessons learned

AI Interview slides

More from Alexandru Burlacu