Francois Lanusse
SOS 2026 Summer School, Aussois, June 2026
astro-ph abstracts mentioning Deep Learning, CNN, or Neural Networks
The vast majority of these results has relied on supervised learning and networks trained from scratch.
=> Limits in practice the ease of using deep learning for analysis and discovery
Conventional scientific workflow with deep learning
Conventional researchers @ CMU
Circa 2016
CMU DeepLens (Lanusse et al 2017)
Foundation Model-based Scientific Workflow
Already taken care of
... but how does it work?
Vaswani, Shazeer, Parmar, Uszkoreit, Jones, Gomez, Kaiser, Polosukhin (2017)
import jax
import jax.numpy as jnp
import optax
import flax.linen as nn
class MLP(nn.Module):
features: int
@nn.compact
def __call__(self, x):
x = nn.Dense(self.features)(x)
x = nn.relu(x)
x = nn.Dense(self.features)(x)
return x
class TransformerBlock(nn.Module):
d_model: int
num_heads: int
@nn.compact
def __call__(self, x, mask):
z = nn.LayerNorm()(x)
z = nn.MultiHeadDotProductAttention(num_heads=self.num_heads)(z, mask=mask)
x = x + z
z = nn.LayerNorm()(x)
x += MLP(self.d_model)(z)
return x
class LightCurveTransformer(nn.Module):
d_model: int
num_heads: int
num_layers: int
@nn.compact
def __call__(self, x, mask):
# Building attention mask
attention_mask = nn.make_attention_mask(mask, mask)
# Positional embedding
pos = get_positional_encoding(self.d_model)(x)
# Embedding layer
x = nn.Dense(self.d_model)(x[:,:,1:5]) # Extracting the flux light curves
x += pos
# Transformer blocks
for i in range(self.num_layers):
x = TransformerBlock(self.d_model, self.num_heads)(x, attention_mask)
# Global average pooling
x = jnp.mean(x, axis=1)
# Output layer
x = nn.Dense(1)(x)
return x.squeeze()
More about positional embedding:
Text
i.e. BERT (Devlin et al. 2018)
Masked Auto Encoding (MAE)
Credit: (Liang et al. 2022)
Self-Supervised similarity search for large scientific datasets (Stein et al. 2021)
Project led by Alice Desmons, Francois Lanusse, Sarah Brough
PCA of patch features
Dense Semantic Segmentation
Dense Depth Estimation
Or what you can do when you do have independent views of an object...
Contrastive Language Image Pretraining (CLIP)
(Radford et al. 2021)
Shared information
Flamingo: a Visual Language Model for Few-Shot Learning (Alayrac et al. 2022)
Hierarchical Text-Conditional Image Generation with CLIP Latents (Ramesh et al. 2022)
Cosine similarity search
Image Similarity
Spectral Similarity
Image-Spectral Similarity
Supervised baseline
Or let's do one massive model of everything!
with extensive support from the rest of the team.
Project led by:
Francois
Lanusse
Liam
Parker
Jeff
Shen
Tom
Hehir
Ollie
Liu
Lucas
Meyer
Leopoldo
Sarra
Sebastian Wagner-Carena
Helen
Qu
Micah
Bowles
(Blanco Telescope and Dark Energy Camera.
Credit: Reidar Hahn/Fermi National Accelerator Laboratory)
(Subaru Telescope and Hyper Suprime Cam. Credit: NAOJ)
(Dark Energy Spectroscopic Instrument)
(Sloan Digital Sky Survey. Credit: SDSS)
(Gaia Satellite. Credit: ESA/ATG)
Field Embedding Strategy Developed for
Multiple Physics Pretraining (McCabe et al. 2023)
DES g
DES r
DES i
DES z
HSC g
HSC r
HSC i
HSC z
HSC y
Survey translation
Spectrum super-resolution
Adaptation at low cost
with simple strategies:
Trained on ->
Eval on ->
Inputs:
measured fluxes
Inputs:
measured fluxes + image
Segmenting central bar and spiral arms in galaxy images based on Galaxy Zoo 3D