Tutorial Questions - Day 3

Cassidy · May 23, 2023, 7:44am

I understand that the certificate is automatically generated once the course is completed. Does this mean we must complete the data challenge in order to complete the course and recieve the certificate?

rbadan · May 23, 2023, 11:17am

Will we know how many points we got in the challenge?

AmbicaG · May 24, 2023, 6:46am

Hi, why is this equation true:

AmbicaG · May 24, 2023, 7:38am

In weights = weights / max(weights) why are we normalising weights by the maximum value of weights? How can that be equal to the evidence?

AmbicaG · May 24, 2023, 7:46am

Can you please explain how efficiency = len(freq_samples) / len(freq_gsamples) fgives a measure of the efficiency of the rejection sampling?

mattia_emma · May 31, 2023, 12:38pm

Hi @AmbicaG, I hope I understand your question correctly. This is a gaussian likelihood function, by using this equation, we make sure that if our data point (or curve) is the same as the simulated one, we have L=1 because y_i=s(t_i,f,a) and the numerator in the exponential is null. The bigger the difference between the model and the data, s and y respectively, the more negative the numerator in the exponential and the smaller the likelihood. We could choose other functions to calculate the likelihood, but this is one of the easiest which fulfils all requirements.

mattia_emma · May 31, 2023, 12:44pm

Hi @AmbicaG , we want to normalize because it is easier to work with numbers which are between 0 and 1 instead of an unknown interval. There is no difference in using the non-normalised weights, since in the next step we are performing the rejection. If you want to use unnormalised weights you just have to change
keep = weights > np.random.uniform(0, 1, weights.shape) to
keep = weights > np.random.uniform(0, max(weights) , weights.shape).
We are not assuming max(weights) to be the evidence here, we do not need to include the evidence as it is irrelevant for the sampling, being an equal factor for all weights.

mattia_emma · May 31, 2023, 12:48pm

Hi @AmbicaG, as efficiency we mean how many of the samples we generate are still there after we perform the sampling. For example:
If we have three random numbers (0.1,0.2 and 0.8) and we generate three random numbers to compare them with as in the code, e.g. (0.05, 0.3 and 0.5). Now we have to compare them:
0.1>0.005 so we keep this point, 0.2<0.3 so we reject this point and 0.8>0.5 so we keep the last one.
In total we have kept 2 out of three initial numbers, our efficiency in this case is 2/3=0.666…
In the code we are doing the same, only that the rejection is done for 100000 samples in these three lines:
keep = weights > np.random.uniform(0, 1, weights.shape)
alpha_samples = alpha_gsamples[keep]
freq_samples = freq_gsamples[keep]

doraemon · September 2, 2023, 11:42am

Hi, what are these priors:
prior[‘a_1’] = 0.0
prior[‘a_2’] = 0.0
prior[‘tilt_1’] = 0.0
prior[‘tilt_2’] = 0.0
prior[‘phi_12’] = 0.0

jonah · September 8, 2023, 11:12pm

@doraemon Thank you for your question!

You can find parameter definitions, here:
https://lscsoft.docs.ligo.org/pesummary/stable_docs/gw/parameters.html

Topic		Replies	Views
Tutorials Questions - Day 1 Open Data Workshop	57	1258	February 12, 2024
Doubts in Data Challenge 4 Data Analysis	12	240	June 6, 2022
Lecture Questions - Day 3 (2022) Open Data Workshop	10	460	June 2, 2022
Tutorial A.3 of chapter A.3 Investigation of Continuous Wave	5	106	April 19, 2024
Certificate for completing the GW Workshop 2022 Open Data Workshop odw-2022	4	360	June 15, 2022

Tutorial Questions - Day 3

Related topics