deck

NEURO-GPT

- Uses Temple University Hospital (TUH) EEG corpus

- selected 22 channels only

- Processed 20,000 (19,000 train/ 1000 val) EEG recordings (~5656 hrs)

- Split EEG signal into N chunks, each chunk has dim C * T.

- Each chunk is treated independent sample

Collected and pre-trained a 6M - 369M Transformer model on more than 2,500 hours of diverse EEG data
Handle EEG signals with various channels and time lengths
They evaluate four downstream tasks in BCI, they surpass all SOTA methods by a large margin
Train a neural tokenizer to discretize EEG signals into discrete neural tokens
During pre-training, part of EEG patches are masked while the objective is to predict masked tokens from visible patches.
Eval on 2 datasets, not a big margin as they claimed... relatively compared to architecture complexity.

- Introduce AdaCT, Adapters to convert time series data into spatio-temporal 2D pseudo-images or text.

- Then use pretrained models for text/image

- No involvement with EEG pretraining directly.

- Looks like Fake work (I didn't even get how the convert it to text)