2024年5月13日, 15:00 | 辽宁 · 沈阳 · 东北大学
王赫 (He Wang)
hewang@ucas.ac.cn
中国科学院大学 · 国际理论物理中心(亚太地区)
中国科学院大学 · 引力波宇宙太极实验室(北京/杭州)
On behalf of the LIGO-VIRGO-KAGRA collaborations
In 1916, A. Einstein proposed the GR and predicted the existence of GW.
Gravitational waves (GW) are a strong field effect in the GR.
2015: the first experimental detection of GW from the merger of two black holes was achieved.
2017: the first multi-messenger detection of a BNS signal was achieved, marking the beginning of multi-messenger astronomy.
2017: the Nobel Prize in Physics was awarded for the detection of GW.
As of now: more than 90 gravitational wave events have been discovered.
O4, which began on May 24th 2023, is currently in progress.
Gravitational waves generated by binary black holes system
GW detector
LIGO-VIRGO-KAGRA network
2017 Nobel Prize in Physics
The first GW event of GW150914
—— Bernard F. Schutz
DOI: 10.1063/1.1629411
GWTC-3
—— Bernard F. Schutz
DOI: 10.1063/1.1629411
GWTC-3
©Floor Broekgaarden (repo)
GW Data characteristics
Noise: non-Gaussian and non-stationary
Signal:
(Earth-based) A low signal-to-noise ratio (SNR) which is typically about 1/100 of the noise amplitude (-60 dB).
(Space-based) A superposition of all GW signals (e.g.: 104 of GBs, 10∼102 of SMBHs, and 10∼103 of EMRIs, etc.) received during the mission's observational run.
Matched filtering techniques (匹配滤波方法)
In Gaussian and stationary noise environments, the optimal linear algorithm for extracting weak signals
LIGO-VIRGO-KAGRA
LISA / Taiji project
2016年,AlphaGo 第一版发表在了 Nature 杂志上
2021年,AI预测蛋白质结构登上 Science、Nature 年度技术突破,潜力无穷
2022年,DeepMind团队通过游戏训练AI发现矩阵乘法算法问题
《达摩院2022十大科技趋势》将 AI for Science 列为重要趋势
“人工智能成为科学家的新生产工具,催生科研新范式”
2023年,DeepMind发布AI工具GNoME (Nature),成功预测220万种晶体结构
AI for Science:为科学带来了模型与数据双驱动的新的研究范式
AI + 数学、AI + 化学、AI + 医药、AI + 量子、AI + 物理、AI + 天文 ...
AlphaGo 围棋机器人
AlphaTensor 发现矩阵算法
AlphaFold 蛋白质结构预测
验证数学猜想
Pioneering works utilizing CNN
AI for Science → AI for GW Astronomy
Exported: Oct, 2023 (in preparation)
PRL, 2018, 120(14): 141103.
PRD, 2018, 97(4): 044039.
Matched-filtering Convolutional Neural Network (MFCNN)
MLGWSC-1
The majority of AI algorithms used for testing are highly sensitive to non-Gaussian real noise backgrounds, resulting in high false positive rates.
(MFCNN group) H.W., et al. PRD (2023)
CL.M., W.W., H.W., et al. PRD (2022)
Ensemble learning
Leverages statistical approaches to utilize more information for making informed decisions by combining multiple models.
Real-time GW searches for GW150914
H.W., et al. PRD (2020)
Expanding the dimension of the output
CL.M., W.W., H.W., et al. PRD (2023)
Introduction to Speed and Efficiency
The Need for Integration (an AI pipeline!)
Case study: Pipeline
Aframe
S.S. Chaudhary, et al. arXiv:2308.04545
Challenges and Future Directions
Case study: Pipeline
Aframe
OpenLVEM, June 08, 2023. Low Latency UPDATE.
Beyond Speed: Generalization and Discovery in GW Detection
Real-time GW searches for GW150914
He Wang, et al. PRD 101, 10 (2020): 104003
He Wang, et al. MLST. 5, 1 (2024): 015046.
Beyond Speed: Generalization and Discovery in GW Detection
He Wang, et al. PRD 101, 10 (2020): 104003
He Wang, et al. MLST. 5, 1 (2024): 015046.
Real-time GW searches for GW150914
["This", "is", "a", "sample"]
Strain
Whiten
Normalized
∼10−19
∼102
∼100
32 s
32 s
merger
tc (around GW150914)
(Cal network SNR)
Band-pass: [20, 2048] Hz
Patching (tokenized) with size 0.125 s and overlap 50%
[1, 128, 256]
(Standard normalization)
dynamic masking
[1, 16512]
[1, 128, 256]
(PSDi from noise)
Band-pass: [20, 2048] Hz
WaveFormer
MSE-Lossi
std
[1, 128, 256]
Noisei:
Signali:
Inputi:
Labeli:
Outputi:
8.0625 s
8.0625 s
Given �=ℎ+�d=h+n, we can normalize �d as follows:
(Bottom panels: results of glitches)
(Upper panels: results of pure noise)
Time-series and spectrogram example of blip.
(Upper panels: Signal amplitude recovery performance
(Bottom panels: Signal phase recovery performance)
Bacon P. et al. arXiv: 2205.13513
GW191204_171526
Firstly, we obtain the denoised output by utilizing Waveformer. Then, triggers are defined and identified by three steps including,
Find Peaks. Locate triggers on a single detector by finding its maximum all local-maximum (0.2s away from neighboring maximum/local-maximum).
An search algorithm for GW require that: [cite: 2010.07244]
the same signal is seen in the detectors; (the same signal is seen by time-shifting in single detector)
the same waveform must be present both detectors;
and the signal’s time of arrival must be consistent with the GW travel time between the observatories.
Firstly, we obtain the denoised output by utilizing Waveformer. Then, triggers are defined and identified by three steps including,
Find Peaks. Locate triggers on a single detector by finding its maximum all local-maximum (0.2s away from neighboring maximum/local-maximum).
By constraining triggers that exist on both two detectors, we get VALID triggers. (consist 3~4 segments)
Firstly, we obtain the denoised output by utilizing Waveformer. Then, triggers are defined and identified by three steps including,
Find Peaks. Locate triggers on a single detector by finding its maximum all local-maximum (0.2s away from neighboring maximum/local-maximum).
By constraining triggers that exist on both two detectors, we get VALID triggers. (consist 3~4 segments)
Calculate the correlation of the to-be-evaluated trigger across channels or within a single channel, between its noisy and corresponding denoised segments, as well as between denoised segments themselves.
noisy input segments
denoised output segments
Hˉ
Lˉ
H
L
Firstly, we obtain the denoised output by utilizing Waveformer. Then, triggers are defined and identified by three steps including,
Find Peaks. Locate triggers on a single detector by finding its maximum all local-maximum (0.2s away from neighboring maximum/local-maximum).
By constraining triggers that exist on both two detectors, we get VALID triggers. (consist 3~4 segments)
Calculate the correlation of the to-be-evaluated trigger across channels or within a single channel, between its noisy and corresponding denoised segments, as well as between denoised segments themselves.
(PyCBC) Davies, et al. PRD 2020
Ours
Assessed denoising workflow performance by comparing with GWTC-1, GWTC-2, GWTC2.1, and GWTC-3 catalogs and associated data releases.
Noted significant divergence in IFAR distribution between our results and those from GWTC and OGC catalogs.
Achieved significant IFAR improvement across all 75 reported BBH events, indicating effective suppression of loud terrestrial noise.
Example: For low SNR (10.8−0.4+0.3) event GW200208_130117, obtained an IFAR of 8916 years, surpassing maximum IFAR of <4000 years in other catalogs.
Variability in IFAR improvement linked to the original data's noise nature, including its non-Gaussian, non-stationary characteristics, and different signal recognition strategies by pipelines.
IFAR performance significantly depends on the reduction of non-Gaussian noise near each event.
Events with substantial IFAR improvement had misleading non-Gaussian noise effectively eliminated.
Events where IFAR underperforms retained non-Gaussian characteristics, possibly due to WaveFormer's inherent systematic errors.
Challenges in Model Interpretability
He Wang, et al. MLST. 5, 1 (2024): 015046.
Menéndez-Vázquez A, et al. PRD 2021
Alfaidi & Messerger. arXiv:2402.04589
The negative log-likelihood cost function always strongly penalizes the most active incorrect prediction. And the correctly classified examples will contribute little to the overall training cost."
—— I. Goodfellow, Y. Bengio, A. Courville. Deep Learning. 2016. (book)
noise
noise + signal
GW151226
GW151012
LVK. arXiv:1602.03839
Exploring Beyond General Relativity
Harsh Narola, et al. “Beyond General Relativity: Designing a Template-Based Search for Exotic Gravitational Wave Signals.” PRD 107, 2 (2023): 024017.
Yu-Xin Wang, et al. "Draft in Progress"
iFAR [years]
iFAR [years]
Sensitivity dfistance [Mpc]
B. P. Abbott et al. (LIGO-Virgo), PRD 100, 104036 (2019).
Credit: LIGO Magazine.
Traditional parameter estimation (PE) techniques rely on Bayesian analysis methods (posteriors + evidence)
Bayesian statistics
Data quality improvement
Credit: Marco Cavaglià
LIGO-Virgo data processing
GW searches
Astrophsical interpretation of GW sources
PRL 127, 24 (2021) 241103.
PRL 130, 17 (2023) 171403.
Nature Physics 18, 1 (2022) 112–17
Big Data Mining and Analytics 5, 1 (2021) 53–63.
A diagram of prior sampling between feature space and physical parameter space
(Based on 1912.02762)
【【机器学习】白板推导系列(三十三) ~ 流模型(Flow based Model)】
The main idea of flow-based modeling is to express y∈RD as a transformation T of a real vector z∈RD sampled from pz(z):
Note: The invertible and differentiable transformation T and the base distribution pz(z) can have parameters {ϕ,ψ} of their own, i.e. Tϕ and pz,ψ(z).
Change of Variables:
Equivalently,
The Jacobia JT(u) is the D×D matrix of all partial derivatives of T given by:
base density
target density
(Based on 1912.02762)
base density
target density
Rational Quadratic Neural Spline Flows
(RQ-NSF)
Train
nflow
归一化流模型示意图
Test
nflow
Train
nflow
Simulation-Based Inference (SBI)
PRL 127, 24 (2021) 241103.
PRL 130, 17 (2023) 171403.
Real-time gravitational wave science with neural posterior estimation
Sampling with prior knowledge for high-dimensional gravitational wave data analysis
He Wang, et al. Big Data Min. Anal. (2021)
PRD 108, 4 (2023): 044029.
Neural Posterior Estimation with Guaranteed Exact Coverage: The Ringdown of GW150914
arXiv:2310.13405, LIGO-P2300306
Cosmological Inference using Gravitational Waves and Normalising Flows
Fast Parameter Inference on Pulsar Timing Arrays with Normalizing Flows
arXiv:2310.12209
He Wang, et al. (2024)
Normalizing Flows as an Avenue to Studying Overlapping Gravitational Wave Signals
PRL 131, 17 (2023): 171403.
Angular Power Spectrum of Gravitational-Wave Transient Sources as a Probe of the Large-Scale Structure
PRD 108, 4 (2023): 044029.
Appreciating the Ringdown Overtone Test of GW150914
arXiv:2404.14286
進撃のnflow model in GW inference area.
2002.07656: 5D toy model [1] (PRD)
2008.03312: 15D binary black hole inference [1] (MLST)
2106.12594: Amortized inference and group-equivariant neural posterior estimation [2] (PRL)
2111.13139: Group-equivariant neural posterior estimation [2]
2210.05686: Importance sampling [2] (PRL)
2211.08801: Noise forecasting [2] (PRD)
2305.17161: FMPE
2404.14286: eccentricity of BBHs
https://github.com/dingo-gw/dingo (2023.03)
Exploring Stochastic Gravitational Wave Background with AI
Our result (preliminary)
Exploring Stochastic Gravitational Wave Background with AI
Abbott R, et al. PRD 104, 2 (2021): 022004.
PyGWB result
Our result (preliminary)
AI or Bayes
Text-to-image
"A running dog"
AI or Bayes
Text-to-image
"A corgi running on the street"
A picture is worth a thousand words.
A fraction of a thousand words.
Credit: 李宏毅
"A running dog"
On-going
Insights
Statistics
Statistics
On-going
Insights
Statistics
Statistics
for _ in range(num_of_audiences):
print('Thank you for your attention! 🙏')
This silde: https://slides.com/iphysresearch/2024may_neu
引力波天文学 & 引力波多信使天文学
Chinese Journal of Space Science, 2023, 43(4): 589-599.
LIGO-G2300554
Nat. Astron. 2021, 5(9): 881-889.
空基引力波探测科学数据的分析与地基相比差距很大:
空间引力波观测频段内含有大量的波源和多种波源类型:
天琴计划
Credit: ESA, K. Holley-Bockelmann
Credit: Minghui Du
空间太极计划
(Sec.8.3.1 Red Book)
空间引力波探测获得的是什么样的 (科学) 数据?
Analyses cannot treat sources independently and sequentially work through a list of candidate detections.
空间引力波探测获得的是什么样的 (科学) 数据?
其他重要的科学数据处理的技术挑战:
随机噪声(Stochastic noise)
仪器瞬变(glitches)
频谱线(Spectral lines)
数据间断(Data gaps)
非平稳性(Non-stationarities)
Baghi et al., Phys. Rev. D (2019)
Analyses cannot treat sources independently and sequentially work through a list of candidate detections.
Mock Data Challenges
波源模板
MNRAS 488, L94–L98 (2019)
EMRI 波形模板需求量 40 个数量级以上
Marsat et al. PRD 103, 8 (2021)
Our results indicate that the existing numerical relativity waveforms are as accurate as 99% with respect to space-based detectors including LISA, Taiji and Tianqin. Such accuracy level is comparable to the one with respect to LIGO.
(ZW, JJZ, ZJC, arXiv:2401.15331)
Bayes' theorem:
探测器响应
在时域中计算的挑战性在于,在每个采样点处都需要计算波形和响应,如果考虑到不同的 TDI 组合方式,则计算的时间复杂度将进一步增大。频域中的 TDI 响应形式,可简单概况为:
其中 α∈{+,×},如果要考虑高阶模的贡献,则 α=ℓm 。tα(f) 描述了时间与引力波瞬时频率的关系,可通过
计算, 其中 Ψα 表示 α 模式频域波形的相位。 T 对时间的依赖关系反映了探测器轨道运动对信号的调制效应,如右图所示。调制效应为响应的建模和计算增加了复杂性,但同时也有助于在参数估计中解除外禀参数之间的简并,提升对波源的定位精度。
Bayes' theorem:
TDI-A
TDI-A
TDI-E
TDI-T
Credit: Minghui Du
数据噪声
Addressing Instrumental Imperfections
数据间断(Data gaps)
瞬态噪声事件(glitches)
频谱线(Spectral lines)
非平稳性(Non-stationarities)
不完美校准(imperfect calibration)
(Sec.8.3.3 Red Book)
Sasli et al., Phys. Rev. D (2023)
Baghi et al., Phys. Rev. D (2019)
似然函数建模
Bayes' theorem:
搜索技术
UCB
GPU-based
...
Karnesis et al. 2303.02164.
Hoy & Nuttall. 2312.13039.
Weaving et al. CQG 41, (2023)
Strub et al., PRD. arXiv:2307.03763
Bayes' theorem:
Nat. Astron. 2022, 6(12): 1356-1363.
Nat. Astron. 2022, 6(12): 1334-1338.
背景与挑战
全局拟合方法 (Global-fit method)
实践应用步骤
潜在局限性
全局拟合
Global-fit
全局拟合
Pipeline | Targets | Programing Language (sampling method) | Comments |
---|---|---|---|
GLASS (Littenberg&Cornish 2023) |
Noise, UCB, VGB, MBHB |
C / Python (TPMCMC / RJMCMC) | noise_mcmc+gb_mcmc+vb_mcmc+global_fit |
Eryn | UCB | Python (TPMCMC / RJMCMC) | No code for UCB case |
PyCBC-INFERENCE | MBHB | Python (?) | Unavailable |
Bilby in Space / tBilby | MBHB / ? | ? / Python? (RJMCMC) | Unavailable |
Strub et al. | UCB | ? (GP) | Unavailable / GPU-based |
Zhang et al. (LZU) | UCB | ? (PSO) | MLP |
Bayes' theorem:
Nat. Astron. 2022, 6(12): 1334-1338.
Nat. Astron. 2022, 6(12): 1356-1363.
(Sec.8.6 Red Book)
超高维度的波源参数空间特性 (编码波形)
科学数据的动态性 (编码数据)
资源优化挑战 (CPU vs GPU)
F(t) over 1 year
h(t) over 10 min
y(t) over 1 year
多类型的大量波源混叠问题
Actually, there are more ...
Credit: Maude Le Jeune (2021)
MCMC采样的高效性和收敛性
multi-agentic reasoning
超高维度的波源参数空间特性 (编码波形)
科学数据的动态性 (编码数据)
资源优化挑战 (CPU vs GPU)
F(t) over 1 year
h(t) over 10 min
y(t) over 1 year
多类型的大量波源混叠问题
Actually, there are more ...
Credit: Maude Le Jeune (2021)
MCMC采样的高效性和收敛性
multi-agentic reasoning
for _ in range(num_of_audiences):
print('Thank you for your attention! 🙏')
This silde: https://slides.com/iphysresearch/2024may_neu
h(θ):=∑iαi(θ)ei≡α(θ),
where α∈C241 and reduced basis {ei} with ⟨ei∣ej⟩=δij.
深度学习算法的学习目标:
(Mc,η)∈Θ⊂R2
(αr,αi)∈R482
Neural
Network
AAK - FastEMRIWaveforms (FEW)
Katz et al., Phys. Rev. D (2021)
Chua et al., Phys. Rev. Lett., (2021)
~1s (快 ≳104 倍)
MNRAS 488, L94–L98 (2019)
EMRI 波形模板需求量 40 个数量级以上
张雪婷, C. Messenger, N. Korsakova,
ML Chan, 胡一鸣, 张建东, Phys. Rev. D (2022)
赵天宇, 周阅, 施锐俊, 曹周键, 任智祥, arXiv:2308.16422
恽倩芸, 韩文标, 郭意扬, 王赫, 杜明辉, arXiv:2309.06694
Zhang et al. PRD (2022) |
Zhao et al. (2308.16422) |
Yun et al. (2309.06694) |
|
---|---|---|---|
TDI | - | TDI-1.5 | TDI-2.0 |
Duration | 3 months | 1 year | 0.5 year |
Waveform Family (train) | AK | AAK | AAK |
Waveform Family (test) | AK / AAK | AK / AAK | AAK |
GW Project | TianQin | LISA | Taiji |
Acceleration Noise [fm/sqrt(Hz)] | 1 | 3 | 3 |
OMS Noise [pm/sqrt(Hz)] | 1 | 15 | 8 |
Base Model | CNN | CNN | CNN |
Input Feature domain | time | frequency | time-frequency |
sampling rate | 1/30 Hz | 1/15 Hz | 1/10 Hz |
Yun et al. (2311.18640) |
---|
TDI-2.0 |
0.5 year |
AAK |
AAK / EOB |
Taiji |
3 |
8 |
Unet / VGG |
time-frequency |
1/10 Hz |
Zhang et al. PRD (2022) |
Zhao et al. (2308.16422) |
Yun et al. (2309.06694) |
|
---|---|---|---|
TDI | - | TDI-1.5 | TDI-2.0 |
Duration | 3 months | 1 year | 0.5 year |
Waveform Family (train) | AK | AAK | AAK |
Waveform Family (test) | AK / AAK | AK / AAK | AAK |
GW Project | TianQin | LISA | Taiji |
Acceleration Noise [fm/sqrt(Hz)] | 1 | 3 | 3 |
OMS Noise [pm/sqrt(Hz)] | 1 | 15 | 8 |
Base Model | CNN | CNN | CNN |
Input Feature domain | time | frequency | time-frequency |
sampling rate | 1/30 Hz | 1/15 Hz | 1/10 Hz |
赵天宇*, Ruoxi Lyu*, 王赫, 曹周键, 任智祥, Commun. Phys., (2023)
"One Model to Rule Them All":EMRI / MBHB / GBs / SGWB 的信号提取
王赫, 吴仕超, 曹周键, 刘骁麟, 朱建阳,
Phys. Rev. D, (2020)
阮文洪*, 王赫*, 刘畅, 郭宗宽,
Phys. Lett. B, (2023)
LDC 一年数据上对 MBHB (+GBs) 信号的信号探测
杜明辉*, 梁博*, 王赫†, 徐鹏, 罗子人, 吴岳良†, accepted by SCPMA, arXiv:2308.05510
阮文洪, 王赫, 刘畅, 郭宗宽,
Universe (2023)
亮点:
杜明辉*, 梁博*, 王赫†, 徐鹏, 罗子人, 吴岳良†, accepted by SCPMA, arXiv:2308.05510
阮文洪, 王赫, 刘畅, 郭宗宽,
Universe (2023)
亮点:
for _ in range(num_of_audiences):
print('Thank you for your attention! 🙏')
This silde: https://slides.com/iphysresearch/2024may_neu