王赫 (He Wang)
合作导师:何吉波 教授
Email: hewang@ucas.ac.cn
June 18, 2024 (Location: Beijing, China)
主要是你要考虑吴老师他们的背景,然后简明扼要,突出亮点。讲的时间不要超过30分钟,能20+讲完更好。感觉吴老师抽出一个小时都比较困难……
10+5 = 15min
space-based (2)
what is flow and flow-based (4)
how flow can be used in MCMC.
mini Global-fit + flow
Taiji
Tianqin
https://twitter.com/chipro/status/1768388213008445837?s=46&t=JmDXWgIucgr_FlsBFTvuRQ
DINGO+SEOBNRv4EHM找了3个ebbh
Evidence for eccentricity in the population of binary black holes observed by LIGO-Virgo-KAGRA
https://dcc.ligo.org/LIGO-G2400750
BEFORE
AFTER
LIGO-G2300554
In 1916, A. Einstein proposed the GR and predicted the existence of GW.
Gravitational waves (GW) are a strong field effect in the GR.
2015: the first experimental detection of GW from the merger of two black holes was achieved.
2017: the first multi-messenger detection of a BNS signal was achieved, marking the beginning of multi-messenger astronomy.
2017: the Nobel Prize in Physics was awarded for the detection of GW.
As of now: more than 90 gravitational wave events have been discovered.
O4, which began on May 24th 2023, is currently in progress.
Gravitational waves generated by binary black holes system
GW detector
The first GW event of GW150914
1400Ripples Air Compressor Blip
Extremely Loud Helix Koi Fish
Various types of Glitch
The improvement of data quality is a very complex issue, with data from over 20,000 sensor channels determining the quality of the gravitational wave science data channel.
Reducing non-Gaussian short-duration pulse interference (Glitches) in gravitational wave data will help reduce the false alarm rate of gravitational wave signals.
Removing Glitches from gravitational wave detection data is a multi-classification problem.
Ormiston R, et al. PRR, 2020
DeepClean: One-dimensional Convolutional Neural Network which takes a specified set of witness channels and subsequently outputs the predicted noise in strain.
IGWN data processing
Non-stationary
Non-Gaussianity
Background
Related Works
Model Structure
Precessing & Train
Effect on Noise
Effect on BBH signals
Chatterjee C, Wen L, et al. PRD 2021
Wei W and Huerta E A. PLB 2020
Bacon P. et al. arXiv: 2205.13513
GW170823
Murali C & Lumley D. PRD 2023
["This", "is", "a", "sample"]
Strain
Whiten
Normalized
∼\(10^{−19}\)
∼\(10^{2}\)
∼\(10^{0}\)
32 s
32 s
merger
\(t_c\) (around GW150914)
(Cal network SNR)
Band-pass: [20, 2048] Hz
Patching (tokenized) with size 0.125 s and overlap 50%
[1, 128, 256]
(Standard normalization)
dynamic masking
[1, 16512]
[1, 128, 256]
(PSD\(_i\) from noise)
Band-pass: [20, 2048] Hz
WaveFormer
MSE-Loss\(_i\)
\(std\)
[1, 128, 256]
Noise\(_i\):
Signal\(_i\):
Input\(_i\):
Label\(_i\):
Output\(_i\):
8.0625 s
8.0625 s
Given �=ℎ+�d=h+n, we can normalize �d as follows:
(Bottom panels: results of glitches)
(Upper panels: results of pure noise)
Time-series and spectrogram example of blip.
(Upper panels: Signal amplitude recovery performance
(Bottom panels: Signal phase recovery performance)
Bacon P. et al. arXiv: 2205.13513
GW191204_171526
Firstly, we obtain the denoised output by utilizing Waveformer.
Then, triggers are defined and identified by three steps including:
Find Peaks. Locate triggers on a single detector by finding its maximum all local-maximum (0.2s away from neighboring maximum/local-maximum).
By constraining triggers that exist on both two detectors, we get VALID triggers. (consist 3~4 segments)
Calculate the cross-correlation of the to-be-evaluated trigger across channels or within a single channel, between its noisy and corresponding denoised segments, as well as between denoised segments themselves.
noisy input segments
denoised output segments
\(\bar{H}\)
\(\bar{L}\)
\({H}\)
\({L}\)
AI
Assessed denoising workflow performance by comparing with GWTC-1, GWTC-2, GWTC2.1, and GWTC-3 catalogs and associated data releases.
Noted significant divergence in IFAR distribution between our results and those from GWTC and OGC catalogs.
Achieved significant IFAR improvement across all 75 reported BBH events, indicating effective suppression of loud terrestrial noise.
Example: For low SNR (\(10.8_{-0.4}^{+0.3}\)) event GW200208_130117, obtained an IFAR of 8916 years, surpassing maximum IFAR of <4000 years in other catalogs.
Variability in IFAR improvement linked to the original data's noise nature, including its non-Gaussian, non-stationary characteristics, and different signal recognition strategies by pipelines.
IFAR performance significantly depends on the reduction of non-Gaussian noise near each event.
Events with substantial IFAR improvement had misleading non-Gaussian noise effectively eliminated.
Events where IFAR underperforms retained non-Gaussian characteristics, possibly due to WaveFormer's inherent systematic errors.
He Wang, et al. MLST. 5, 1 (2024): 015046.
LVK. arXiv:1602.03839
GW151226
GW151012
Gravitational waves and sources:
Wang H, Du M H, Xu P, Zhou Y F. Sci Sin-Phys Mech Astron, 2024, 54, doi: 10.1360/SSPMA-2024-0087
Credit: ESA, K. Holley-Bockelmann
(Sec.8.3.1 The Red Book)
The analysis of scientific data from space-based GW detection differs significantly from ground-based detection:
Analyses cannot treat sources independently and sequentially work through a list of candidate detections.
The analysis of scientific data from space-based GW detection differs significantly from ground-based detection:
(Sec.8.6 The Red Book)
M. Du, B. Liang, HW, P. Xu, Z. Luo, Y. Wu. SCPMA 67, 230412 (2024).
Global vs. Individual Analysis: While global-fit techniques effectively manage the dense overlapping of signals in space-based GW data, individual pipelines are crucial for detecting unique events.
Role of Individual Pipelines: These pipelines act as a pre-processing step, focusing on particular types of sources and diving deeper into the data. They refine the analysis by working on the latest best-fit residuals from the global fit.
Case Study - MBHB Mergers: Mergers of MBHBs often exhibit high SNR between \(10^2\) to \(10^3\), appearing as distinct peaks in data time series.
Data curation
Model: frequency domain; PhenomD; TDI-A/E response
Input: 1 day length; 15Hz; shape=(2, 3, 2877)
Noise: Gaussian stationary from the noise PSD (for training/test) + GB confusion noise (for test)
Project: Taiji program
M. Du, B. Liang, HW, P. Xu, Z. Luo, Y. Wu. SCPMA 67, 230412 (2024).
The top section of the illustration shows the solar system barycenter (SSB) and Taiji frames, with two black dashed arrows symbolizing not two separate GW signals, but rather indicating how the sky location and arrival time of the same GW signal take different values in these two frames.
The “positive” problem translates the SSB-frame parameters to their Taiji-frame counterparts via a time-dependent mapping \(f_1\), then to the TDI outputs through a time-independent mapping \(f_2\), and an exponential term.
TDI-A
These steps can be schematically summarized as:
where \(\mathcal{T}_\alpha^{A, E}(f)\) is often referred to as the transfer function.
M. Du, B. Liang, HW, P. Xu, Z. Luo, Y. Wu. SCPMA 67, 230412 (2024).
Consequently, even if the network has only learned the time-dependent relationship between \(\boldsymbol{\theta}_S\) and the TDI response at a specific tref (the 30th day in our case), with the aid of coordinate transformation, it has essentially learned the time-invariant mapping \(f_2\), and can be then generalized to make parameter estimation at any other reference time.
It is worth noting that our method relies on analytical orbits and
the time-independence of the coordinate transformation \(f_2\).
The top section of the illustration shows the solar system barycenter (SSB) and Taiji frames, with two black dashed arrows symbolizing not two separate GW signals, but rather indicating how the sky location and arrival time of the same GW signal take different values in these two frames.
The “positive” problem translates the SSB-frame parameters to their Taiji-frame counterparts via a time-dependent mapping \(f_1\), then to the TDI outputs through a time-independent mapping \(f_2\), and an exponential term.
M. Du, B. Liang, HW, P. Xu, Z. Luo, Y. Wu. SCPMA 67, 230412 (2024).
Methodology: Utilization of the Kolmogorov-Smirnov (KS) test to compare one-dimensional distributions generated by our algorithms, ensuring the accuracy of parameter estimation.
Empirical Validation: Conducted extensive testing on simulated signals, injecting 1000 waveforms from the prior with added confusion noise and varying reference times between 1 and 365 days.
Results: The tests assessed the frequency at which true parameters fell within certain confidence levels, confirming that our credible intervals are well-calibrated and reflect true confidence in the signal parameters.
Computational performance
10000 posterior samples in 2.7 sec
M. Du, B. Liang, HW, P. Xu, Z. Luo, Y. Wu. SCPMA 67, 230412 (2024).
Overview of Findings: Nested sampling results indicate minimal expected multimodality in ecliptic coordinates. However, distinct peaks identified in the time of coalescence (\(t_c\)), labeled as NF-1 (dominant) and NF-2 (subdominant), highlight unique multimodal behavior.
(NF = Normalizing Flow model)
M. Du, B. Liang, HW, P. Xu, Z. Luo, Y. Wu. SCPMA 67, 230412 (2024).
《引力波探测中关于智能降噪和信号搜寻的研究》(已结题)
2023-01 ~ 2023-12
国自然基金委 | 理论物理专款研究项目 | 18万元 | 负责人
在该专款研究项目中负责了从引力波数据的生成,到数据集的制备,到算法模型的搭建,最后到引力波科学数据分析,涉及完整的数据处理流水线的设计和开发。
《基于引力波探测开源数据的共享数据门户》
2023-06 ~ 2025-05
国家天文科学数据中心 | 青年数据科学家项目 | 10万元 | 负责人
在该专款研究项目中负责构建一个引力波探测开源数据平台,通过采集、整合和管理引力波观测数据和科学分析结果,为科研人员和相关单位提供便捷的数据访问和应用服务,未来有望成为我国空间引力波探测计划的科学基础设施的一部分。
Earth-based GW detection
Space-based GW detection
中科院计算机信息中心“东方”超级计算系统 (全国产CPU/GPU)
for _ in range(num_of_audiences):
print('Thank you for your attention! 🙏')Earth-based GW detection
Space-based GW detection
| Pipeline | Targets | Programing Language (sampling method) | Comments |
|---|---|---|---|
|
GLASS (Littenberg&Cornish 2023) |
Noise, UCB, VGB, MBHB |
C / Python (TPMCMC / RJMCMC) | noise_mcmc+gb_mcmc+vb_mcmc+global_fit |
| Eryn | UCB | Python (TPMCMC / RJMCMC) | Mini code for UCB case |
| PyCBC-INFERENCE | MBHB | Python (?) | Unavailable |
| Bilby in Space / tBilby | MBHB / ? | ? / Python? (RJMCMC) | Unavailable |
| Strub et al. | UCB | ? (GP) | Unavailable / GPU-based |
| Zhang et al. (LZU) | UCB | ? (PSO) | MLP |
| Balrog | MBHB | ? |
(Sec.8.6 Red Book)
Global Fit
Technical challenges:
M. Du, B. Liang, HW, P. Xu, Z. Luo, Y. Wu. SCPMA 67, 230412 (2024).
Neural density estimation
Ref:
Neural density estimation
nflow
(Based on 1912.02762)
The main idea of flow-based modeling is to express \(\mathbf{y}\in\mathbb{R}^D\) as a transformation \(T\) of a real vector \(\mathbf{z}\in\mathbb{R}^D\) sampled from \(p_{\mathrm{z}}(\mathbf{z})\):
Note: The invertible and differentiable transformation \(T\) and the base distribution \(p_{\mathrm{z}}(\mathbf{z})\) can have parameters \(\{\boldsymbol{\phi}, \boldsymbol{\psi}\}\) of their own, i.e. \( T_{\phi} \) and \(p_{\mathrm{z},\boldsymbol{\psi}}(\mathbf{z})\).
Change of Variables:
Equivalently,
The Jacobia \(J_{T}(\mathbf{u})\) is the \(D \times D\) matrix of all partial derivatives of \(T\) given by:
base density
target density
Rational Quadratic Neural Spline Flows (RQ-NSF)
(Based on 1912.02762)
nflow
nflow
Train
Test
Credit: LIGO Magazine.
Traditional parameter estimation (PE) techniques rely on Bayesian analysis methods (posteriors + evidence)
Bayesian statistics
Data quality improvement
Credit: Marco Cavaglià
LIGO-Virgo data processing
GW searches
Astrophsical interpretation of GW sources
PRL 127, 24 (2021) 241103.
PRL 130, 17 (2023) 171403.
Nature Physics 18, 1 (2022) 112–17
HW, et al. Big Data Mining and Analytics 5, 1 (2021) 53–63.
A diagram of prior sampling between feature space and physical parameter space