Vladimir Iglovikov
Sr Data Scientist at TrueAccord
PhD in Physics
Kaggle top 100
Train
168
Public test
81
Private test
81
TXT
Google Drive
public class PathImageSegmentation {
public String getURL() {
return "https://drive.google.com/uc?export=download&id=6T4qSMyyIThaubkNPdFREZThzRXc";
}
}
https://nizhib.com/posts/image-segmentation
binarization
| CV | LB |
|---|---|
| 836533 | 754414 |
| 868841 | 792269 |
| 885779 | 784527 |
Why?
Train set is NOT 168 patients with 500x500, but 42 with 1000x1000 => random split leads to data leak!
Solution:
I did not do it :(
Train 1000x1000
Test 500x500
Test
162 x 500 x 500
=>
30 x 1000 x 1000
+
42 x 500 x 500
patches from train
| Private Test |
|---|
| smudge |
| n01z3 |
| vkassym |
| dulyanov |
| ternaus |
| pfr |
| ZFTurbo |
| nizhib |
| EgorLakomkin |
| albu |
| ywi4ebyrawi |
| eagle4 |
| Public Test |
|---|
| smudge |
| pfr |
| EgorLakomkin |
| vkassym |
| ualabs |
| nizhib |
| albu |
| zaq1xsw2tktk |
| forcesh |
| ZFTurbo |
| n01z3 |
| ternaus |
$10,000
$7,000
$5,000
$3,000
$1,000
UNet => 0.26
For practice Carvana Image Masking Challenge (ends in 3 weeks)
Software
PyTorch + OpenCV
Hardware