Hi all, so I set up a to convert model arrays to fits images. I had not even go

Comments (58)

landmanbester commented on September 24, 2024

Clearly something is going very wrong. Try divorcing from sky models and just placing pixels on a grid as suggested. Then make the dirty and model images and check that they look reasonable. How are you selecting your cell sizes? You need to make sure that your pixels are small enough to satisfy Nyquist

from kal-cal.

o-smirnov commented on September 24, 2024

Yeah something is very wrong with the simulation then. Also, I suggest making images oversampled in the usual way (~5 pixels across the PSF) so that the sources are more obvious.

from kal-cal.

kwazzi-jack commented on September 24, 2024

So I've been somewhat successful at switching to generating a 2D model array (being careful not to put sources near the edge) and then using the same cell-size and pixel-size selection process we were using for the fluxtractor script (I.e. satisfies nyquist and used good_size function). Everything seems to be working well, using dirty2ms instead. The new data columns themselves seem great as I can image with wsclean to get a decent image returned, but I seem to be having an issue with ms2dirty. It takes the same parameters as I would when I used dirty2ms. Here are the resulting image plots and flux per source after using ms2dirty.

The "Before" model is the one I construct before feeding into dirty2ms and the rest are the corresponding data columns in my ms fed into ms2dirty. Also, this is all where 100% of the sources is modelled. For 1 and 2 sources case, the auto selection of cell- and pixel-size seems to be fine for all arrays. I see for some reason that it does not work for the 100 source case. The model image before fits all sources but the images after do not fit all of them. They use the same cell-size and pixel-size so I'm not sure what could be wrong here.

KAT7 - 1 source

KAT7 - 2 sources

VLA-B - 1 source

VLA-B - 2 sources

VLA-B - 100 sources

MeerKAT - 1 source

MeerKAT - 2 sources

MeerKAT - 100 sources

from kal-cal.

landmanbester commented on September 24, 2024

Just be careful with your terminology. Model images are in units of Jy/pixel, dirty images are in units of Jy/beam only after you divide them by the sum of the weights (usually called wsum). So none of the images you show are in the correct units.

As for the 100 source dirty images not making sense, are you just looking up pixel values at the corresponding sources or are you actually running the flux extraction script on them?

from kal-cal.

kwazzi-jack commented on September 24, 2024

I think that might be the issue here then. I have not run it through the fluxtractor script yet. But I think I will do so and see what happens. Also do I need to pass a mask when I create my data columns from ms2dirty?

from kal-cal.

landmanbester commented on September 24, 2024

Yeah that will make a difference but I am not sure if that is all that is wrong because you still expect to get roughly the correct flux at the locations of the brightest sources (once you divide by wsum). The mask in ms2dirty is actually the negation of the flags not the image space mask that you use for source extraction. You don't have to worry about it unless you have flagged data

from kal-cal.

kwazzi-jack commented on September 24, 2024

So I ran the pipeline on the new data simulation. I decided to write a rough outline of my pipeline so you can see if there are any issues and to also help fill in what I've been doing.

Parameters

Here is the params governing the pipeline:

Arrays: KAT7, VLA-B and MeerKAT
Sources: 1, 2, and 100 (KAT7 only run on 1 and 2)
% modelled sources:
- Source = 1: [100%]
- Source = 2: [100%, 70%]
- Source = 100: [100%, 90%, 80%, 70%]
Noise ~ CN(0, 4)
Dimensions:
- n_time = 720
- n_ant = 7, 27, 64 (respectively)
- n_chan = n_dir = n_corr = 1
Scan parameters:
- length = 2 hr
- dt = 10 s
- freq_0 = 1 GHz
Gains:
- Amplitude-only
- Variability is moderate-to-high (not sure how to phrase this)
Sky-model:
- Generated from 2D image
- FOV:
  - Source = 1 and 2: FOV = 0.2'
  - Source = 100: FOV = 0.5'
- Pareto alpha = 2
- Peak flux (brightest source) = 1.0 Jy
- Buffer zone around edge:
  - Source = 1 and 2: 1 pixel
  - Source = 100: 5 pixels
kalcal:
- Process Noise (log-scale) = 1e-6 to 1e-1
- Measurement Noise = Same as noise level
quartical
- Time intervals (linear-scale) = 1 to 50
- Weights (sigma column) = Same as noise level

Simulation

Simulation is done as followed:

Generate ms for kat7, vlab, and meerkat using simms
Generate true-gains using above ms to be used for corruption
Create model images based on auto-selected FOV, cell- and pixel-size with given number of sources, where dynamic range of the flux is determined using a pareto distribution, where we reduce % modelled source flux by removing the faint sources till we reach a total flux equal to the required % modelled source flux. Sources are single, unique pixels in the image and positions are also saved. Buffer zone inserted to ensure no sources are placed near the edge.
Generate model-vis using the model images above via dirty2ms
Corrupt model-vis with true-gains from above with africanus corrupt-vis to get clean-vis
Add noise to clean-vis to get our vis
Save model-vis, clean-vis, and vis to our ms

Calibration

Calibration algorithms used are:

kalcal:
- ekf run once over data, then eks over ekf results
- fixed noise parameter (sigma_n) to that of simulated data
- calibrated over process noise using log-space
- gains saved in codex-format (with extra dimension for augmented complex part) = (n_time, n_ant, n_chan, n_dir, n_corr, n_aug)
quartical:
- Solve for G term over 25 iterations
- Diag-complex mode
- No model-vis recipe, use simulated model-vis from before
- Set weights according to sigma-column
- Gains saved and converted to codex-format = (n_time, n_ant, n_chan, n_dir, n_corr)

Imaging

Perform single pixel per source imaging using:

Reduced, direct version of pcg algorithm used in pfb-clean
Hessian comprising of going from image to vis to image while applying a mask created using our simulated model source positions, and weighting associated with calibrated gains used above
Feed into pcg to get recovered image
Using model positions, extract flux at given source pixel positions
Save fluxes in original order of brightest source to faintest
Same grid for kalcal and quartical is used when imaging the calibrated gains from calibration

Plotting results

We plot the following results:

Source = 1 and 2:
- Gains mse for kalcal over process noise (against optimal quartical result)
- Gains mse for quartical over time interval (against optimal kalcal result)
- Flux ratio per source (only implemented to show brightest, modelled source for now)
Source = 100
- Gains mse for kalcal over process noise (against optimal quartical result)
- Gains mse for quartical over time interval (against optimal kalcal result)
- Average % error over both kalcal and quartical
- Weighted average over both kalcal and quartical (as a check)

Results

The MSE on the gains seems appropriate as before:

1.1 Gains MSE: kalcal over process noise (against quartical), single source

1.2 Gains MSE: quartical over time intervals (against kalcal), single source

2.1 Gains MSE: kalcal over process noise (against quartical), two sources

2.2 Gains MSE: quartical over time intervals (against kalcal), two sources

3.1 Gains MSE: kalcal over process noise (against quartical), 100 sources with MeerKAT

3.2 Gains MSE: quartical over time intervals (against kalcal), 100 sources with MeerKAT

4.1 Gains MSE: kalcal over process noise (against quartical), 100 sources with VLA-B

4.2 Gains MSE: quartical over time intervals (against kalcal), 100 sources with VLA-B

5.1 Flux Ratio: kalcal vs quartical, one source

5.2 Flux Ratio: kalcal vs quartical, two sources

6.1 Average % Error: kalcal vs quartical, 100 sources with MeerKAT

6.2 Average % Error: kalcal vs quartical, 100 sources with VLA-B

7.1 Weighted average: kalcal vs quartical, 100 sources with MeerKAT

7.2 Weighted average: kalcal vs quartical, 100 sources with VLA-B

Notes

kalcal:
- Has to use a slightly smaller sigma_f to achieve optimal results (which I thought would be much larger? Like a magnitude less at least).
- Avg. % error makes sense as well, but I do not understand the slight dip on the left, but it could be the grid I am using possibly?
- MSE makes sense, could use a smaller grid size for sigma_f
- Across all plots, the values of the metrics only change slightly as % modelled sources decreases
quartical:
- Gains MSE makes more sense now and behaves like it should.
- The rest of the plots do not make sense to me, again, performing too well on any time interval, even when it is really small.

from kal-cal.

kwazzi-jack commented on September 24, 2024

This is the same issue as before, sadly and I am not quite sure what the to do. I decided to check out the result after switching quartical for cubical. It would be known to be worse than quartical, and it was set to mimic quartical as close as possible. Only thing I did forgot about was setting weights for cubical to the noise value of the data, which I'm not sure how much it would impact the results? Anyway, here is the resulting plots, similar to those above:

1.1 Gains MSE: kalcal over process noise (against cubical), single source

1.2 Gains MSE: cubical over time intervals (against kalcal), single source

2.1 Gains MSE: kalcal over process noise (against cubical), two sources

2.2 Gains MSE: cubical over time intervals (against kalcal), two sources

3.1 Gains MSE: kalcal over process noise (against cubical), 100 sources with MeerKAT

3.2 Gains MSE: cubical over time intervals (against kalcal), 100 sources with MeerKAT

4.1 Gains MSE: kalcal over process noise (against cubical), 100 sources with VLA-B

4.2 Gains MSE: cubical over time intervals (against kalcal), 100 sources with VLA-B

5.1 Flux Ratio: kalcal vs cubical, one source

5.2 Flux Ratio: kalcal vs cubical, two sources

6.1 Average % Error: kalcal vs cubical, 100 sources with MeerKAT

6.2 Average % Error: kalcal vs cubical, 100 sources with VLA-B

7.1 Weighted average: kalcal vs cubical, 100 sources with MeerKAT

7.2 Weighted average: kalcal vs cubical, 100 sources with VLA-B

Not too sure what to make of this either, but it does seem to show the avg. % error to worsening as time interval decreases, unlike with quartical. On a side note, is using a single correlation not made the problem less complex, resulting in solutions that are far easier to reach and much closer to the true ones?

Let me know if you want to see the fits images as well of the results.

from kal-cal.

kwazzi-jack commented on September 24, 2024

Found a single bug through imaging the data columns with the 2 source case where I had overwritten some visibility values reducing it something similar to the single source case. The 1 source and 100 source case should be fine.

from kal-cal.

landmanbester commented on September 24, 2024

Wow that's a lot of information to take in. At least things are making more sense now. The simulation you describe seems sensible. I am surprised by how big the difference between CubiCal and QuartiCal is. They should be pretty much equivalent. I guess Jon but some magic sauce in the QuartiCal recipe. Some things to check:
i) Are you interpolating the qcal gain solutions to full resolution before doing the flux extraction or are you applying a sum of boxcars? If the former it may explain why qcal is doing so well on the flux ratio plots (5.1 and 5.2). The CubiCal plots look more like I expect.
ii) Why are the kalcal results so different for the average % error between the two runs (6.1. and 6.2)? Are you not using the same parameters for kalcal between the two runs? The results for the second run look more like I expect.
iii) I don't think the weighted sum is a good metric to show in your thesis because it does not provide information on individual sources but rather some kind of global average. I think this can be misleading. You can put this in your thesis as a motivation for looking at the average % error instead

from kal-cal.

kwazzi-jack commented on September 24, 2024

Well with regards to the differences between quartical and cubical, that is where I am not sure if they are "exactly" the same based on the configuration files.

i) Are you interpolating the qcal gain solutions to full resolution before doing the flux extraction or are you applying a sum of boxcars? If the former it may explain why qcal is doing so well on the flux ratio plots (5.1 and 5.2). The CubiCal plots look more like I expect.

I am applying as a sum of boxcars, by extending the solutions as constant over the specified time-interval index in time. I do this for both quartical and cubical.

ii) Why are the kalcal results so different for the average % error between the two runs (6.1. and 6.2)? Are you not using the same parameters for kalcal between the two runs? The results for the second run look more like I expect.

They use the same parameters. The only difference would be the arrays used (so VLA-B or MeerKAT). I'm not sure what is happening for sigma_f < 1e-5 in 6.1 with MeerKAT. The only other notable differences would be cell-size, image-size and simulated data.

iii) I don't think the weighted sum is a good metric to show in your thesis because it does not provide information on individual sources but rather some kind of global average. I think this can be misleading. You can put this in your thesis as a motivation for looking at the average % error instead

I am aware of this as we discussed before. I'm using it as a test metric only. I understand how the weighted average should look and so if it deviates from this then I know there is an issue and most probably it is affecting the other results as well.

from kal-cal.

landmanbester commented on September 24, 2024

I think the consensus is that something must still be going wrong. I am not sure what it is but I can suggest a few things to try:

Set P_0 to Q and initialise your solution at t0 using the maximum likelihood solution (you can use that time only maximum likelihood calibration script I gave you a while ago to get the solution only at time 0).
Write more detailed tests for your JHr and JHJ operators. How are you testing them atm? I know we tested J with the seemingly linear model but what about JHr? If you are using the sparse matrix representation you can simply use .conj() and .transpose() to get it explicitly. Is that what you are doing? If not you need to check that JH is consistent with J. Do you know how to do that?
For your sanity checks, simulate some gains with non-unity mean functions. That should result in much worse flux suppression if you do the flux extraction using only unity gains.
Make your phase variations far more extreme than your amplitude variations and confirm that things get much worse in this case.

from kal-cal.

kwazzi-jack commented on September 24, 2024

Set P_0 to Q and initialise your solution at t0 using the maximum likelihood solution (you can use that time only maximum likelihood calibration script I gave you a while ago to get the solution only at time 0).

With the maximum likelihood script, are we solving for that specific time-step or just over the whole time-axis and then selecting the solution for the first time-step as m_0 for the filter?

Write more detailed tests for your JHr and JHJ operators. How are you testing them atm? I know we tested J with the seemingly linear model but what about JHr? If you are using the sparse matrix representation you can simply use .conj() and .transpose() to get it explicitly. Is that what you are doing? If not you need to check that JH is consistent with J. Do you know how to do that?

Currently, I am using the explicit implementation for the calibration as it is faster than the sparse implementation, so that shouldn't be an issue. I haven't done detailed tests on the operators. I don't particularly know what I could be testing JHr and JHJ against in this sense, let alone if JH is consistent with J (I am using JH = J.conjugate().transpose()).

For your sanity checks, simulate some gains with non-unity mean functions. That should result in much worse flux suppression if you do the flux extraction using only unity gains.

Make your phase variations far more extreme than your amplitude variations and confirm that things get much worse in this case.

I'm not quite sure what you meant by mean functions in this case. What I have tried to do is to separate gains creation by:

Simulate gains for amplitude on large time length-scales,
Simulate gains for phase on small time length-scales,
Form the full-complex gains by AMPLITUDE_GAINS.real * np.exp(1.0j * PHASE_GAINS)

I haven't done any other of the changes you suggested yet (like setting P_0 to Q) to just test this first. I tested out unity gains vs true gains imaging with the fluxtractor on the new simulated data columns, and it seems to be now be quite severe if we use unity gains, with flux ratio of sources ranging from 1.2 to 1.8.

I found the optimal solutions for kalcal and quartical, and plotted them in a gains-product plot for VLA-B with 100 sources (100% modelled). Here you can see what I try to achieve with my new gains, and how quartical and kalcal behave as we have been hoping for. Kalcal is able to track a lot better despite the erratic data, whereas quartical has to average out to reduce the overfitting of noise. A final point to note is that sigma_f = 0.1 and tint = 16 here.

The only issue is that the resulting pipeline plots are now far harder to understand, and quartical still seems to be doing well despite the terrible solutions it produces. Let me know which plot you would like to see.

I would like a meeting to go through all of this, possibly tomorrow?

from kal-cal.

kwazzi-jack commented on September 24, 2024

I'm busy running the kalcal side where m_0 is known and P_0 is Q. I made prelim plots to see the behaviour between the original m_0 = 1.0 + 0.0j, P_0 = 1.0 start compared to our new one. All of these are VLA-B on 100 sources, with 100% modelled visibilities and various sigma_f.

1.1 Original priors, Sigma_f = 0.1 (ideal sigma_f)

1.2 New priors, Sigma_f = 0.1 (ideal sigma_f)

2.1 Original priors, Sigma_f = 0.00001 (sigma_f really small, lots of averaging)

2.2 New priors, Sigma_f = 0.00001 (sigma_f really small, lots of averaging)

3.1 Original priors, Sigma_f = 10.0 (sigma_f really large, fits noise more, can't track as well since steps are too large)

3.2 New priors, Sigma_f = 10.0 (sigma_f really large, fits noise more, can't track as well since steps are too large)

from kal-cal.

landmanbester commented on September 24, 2024

Ok, this is what I expected to see. With the original prior your starting estimate for the prior covariance was totally over-estimated and doesn't have enough steps to contract when using a large value for sigmaf. I think the new priors are more sensible. Can you check what the posterior covariance looks like now? I remember they were totally over-estimated. I wonder if the prior could have been the cause of that. Also, maybe the hyper-parameter formulae work better with the new prior. We won't have time to check all of that now but it's definitely on the list for your PhD

from kal-cal.

kwazzi-jack commented on September 24, 2024

Results from the chi-sq on residual visibilities:

w x (Vpq - gp x Mpq x gq^+)^H x (Vpq - gp x Mpq x gq^+)

where w = 1/(2 x sigma_n^2) and sigma_n = sqrt(2)

1. When gains are set to true-gains

Array	# of Sources	% Modelled Flux	Chi-Sq
kat7	1	100	15226.2
kat7	2	100	15134.2
kat7	2	70	17733.1
meerkat	1	100	1.45186e+06
meerkat	2	100	1.45198e+06
meerkat	2	70	1.75963e+06
meerkat	100	100	1.45072e+06
meerkat	100	90	1.48772e+06
meerkat	100	80	1.55824e+06
meerkat	100	70	1.67628e+06
vlab	1	100	251985
vlab	2	100	252323
vlab	2	70	300224
vlab	100	100	253325
vlab	100	90	258168
vlab	100	80	266088
vlab	100	70	278133

2. When gains are set to kalcal with sigma_f = 0.1

Array	# of Sources	% Modelled Flux	Chi-Sq
kat7	1	100	14699
kat7	2	100	14639.7
kat7	2	70	16092
meerkat	1	100	1.44566e+06
meerkat	2	100	1.44639e+06
meerkat	2	70	2.64533e+06
meerkat	100	100	1.4484e+06
meerkat	100	90	1.48315e+06
meerkat	100	80	1.54872e+06
meerkat	100	70	1.68022e+06
vlab	1	100	249203
vlab	2	100	249412
vlab	2	70	425598
vlab	100	100	250845
vlab	100	90	254977
vlab	100	80	262016
vlab	100	70	272525

3. When gains are set to quartical with tint = 16

Array	# of Sources	% Modelled Flux	Chi-Sq
kat7	1	100	15545.1
kat7	2	100	15630.5
kat7	2	70	16447.2
meerkat	1	100	1.51988e+06
meerkat	2	100	1.53598e+06
meerkat	2	70	1.8162e+06
meerkat	100	100	1.63426e+06
meerkat	100	90	1.66772e+06
meerkat	100	80	1.73098e+06
meerkat	100	70	1.83863e+06
vlab	1	100	266645
vlab	2	100	264034
vlab	2	70	302914
vlab	100	100	278359
vlab	100	90	282050
vlab	100	80	288977
vlab	100	70	299301

Not sure what to make of the numbers.

from kal-cal.

kwazzi-jack commented on September 24, 2024

Let me know if you want me to post the reduced chi-squared rather.

from kal-cal.

landmanbester commented on September 24, 2024

Can you just compute np.vdot(res, res) to make sure nothing is going wrong in your chisq formula. Also, compare to what you get from doing this with QuartiCal. That should tell us if something is going wrong

from kal-cal.

landmanbester commented on September 24, 2024

The chisq values look reasonable, they are increasing with decreasing modelled flux as expected. Once you have confirmed that you get the same as qcal can you plot the chisq as a function of solution interval for one of the 100% modelled flux simulations and check if the minimum coincides with the minimum of the MSE?

from kal-cal.

kwazzi-jack commented on September 24, 2024

Here is with np.vdot(res, res).

1. Noise-free Visibilities, with true-gains

Array	# of Sources	% Modelled Flux	Chi-Sq
kat7	1	100	9.73429e-11
kat7	2	100	1.2532e-10
kat7	2	70	9983.55
meerkat	1	100	1.16988e-08
meerkat	2	100	1.48559e-08
meerkat	2	70	1.23031e+06
meerkat	100	100	3.24956e-08
meerkat	100	90	148786
meerkat	100	80	431005
meerkat	100	70	902145
vlab	1	100	3.37272e-10
vlab	2	100	2.29302e-09
vlab	2	70	192015
vlab	100	100	4.35415e-09
vlab	100	90	19478.3
vlab	100	80	51001.8
vlab	100	70	99211.8

2. Noisy Visibilities, with true-gains

Array	# of Sources	% Modelled Flux	Chi-Sq
kat7	1	100	60904.7
kat7	2	100	60536.9
kat7	2	70	70932.4
meerkat	1	100	5.80743e+06
meerkat	2	100	5.80794e+06
meerkat	2	70	7.03852e+06
meerkat	100	100	5.8029e+06
meerkat	100	90	5.95088e+06
meerkat	100	80	6.23298e+06
meerkat	100	70	6.70512e+06
vlab	1	100	1.00794e+06
vlab	2	100	1.00929e+06
vlab	2	70	1.2009e+06
vlab	100	100	1.0133e+06
vlab	100	90	1.03267e+06
vlab	100	80	1.06435e+06
vlab	100	70	1.11253e+06

3. Noisy Visibitilies, with kalcal-gains

Array	# of Sources	% Modelled Flux	Chi-Sq
kat7	1	100	58796.2
kat7	2	100	58559
kat7	2	70	64367.9
meerkat	1	100	5.78264e+06
meerkat	2	100	5.78555e+06
meerkat	2	70	1.05813e+07
meerkat	100	100	5.7936e+06
meerkat	100	90	5.9326e+06
meerkat	100	80	6.1949e+06
meerkat	100	70	6.72089e+06
vlab	1	100	996813
vlab	2	100	997646
vlab	2	70	1.70239e+06
vlab	100	100	1.00338e+06
vlab	100	90	1.01991e+06
vlab	100	80	1.04806e+06
vlab	100	70	1.0901e+06

4. Noisy Visibilities, with quartical-gains

Array	# of Sources	% Modelled Flux	Chi-Sq
kat7	1	100	62180.5
kat7	2	100	62521.9
kat7	2	70	65788.9
meerkat	1	100	6.07953e+06
meerkat	2	100	6.14392e+06
meerkat	2	70	7.26482e+06
meerkat	100	100	6.53704e+06
meerkat	100	90	6.67087e+06
meerkat	100	80	6.92393e+06
meerkat	100	70	7.35454e+06
vlab	1	100	1.06658e+06
vlab	2	100	1.05614e+06
vlab	2	70	1.21166e+06
vlab	100	100	1.11344e+06
vlab	100	90	1.1282e+06
vlab	100	80	1.15591e+06
vlab	100	70	1.1972e+06

from kal-cal.

landmanbester commented on September 24, 2024

Looks like kalcal is doing better at the level of the visibilities as well so it still doesn't make sense that qcal should do better with flux suppression. I really want to see how the chisq compares when computing it with residuals written out by qcal. Have you looked at that?

from kal-cal.

kwazzi-jack commented on September 24, 2024

Okay, that does bring a bit of relief. Here is with Quartical's residual product only:

Array	# of Sources	% Modelled Flux	tint-1	tint-4	tint-8	tint-16	tint-32	tint-64
kat7	1	100	40174.5	56554.4	59759.6	62180.5	63669.2	65147.5
kat7	2	100	40312.6	56352	59803.4	62521.9	64103.6	65899.5
kat7	2	70	43325.9	59575.6	63108.7	65788.9	67345.1	69049.6
meerkat	1	100	5.62304e+06	5.80754e+06	5.92706e+06	6.07953e+06	6.2185e+06	6.3873e+06
meerkat	2	100	5.62459e+06	5.81812e+06	5.9571e+06	6.14392e+06	6.30207e+06	6.5162e+06
meerkat	2	70	6.79127e+06	6.97803e+06	7.10142e+06	7.26482e+06	7.40097e+06	7.58521e+06
meerkat	100	100	5.61982e+06	5.88091e+06	6.16587e+06	6.53704e+06	6.90957e+06	7.39515e+06
meerkat	100	90	5.75912e+06	6.01911e+06	6.30226e+06	6.67087e+06	7.04123e+06	7.52233e+06
meerkat	100	80	6.02102e+06	6.27984e+06	6.55968e+06	6.92393e+06	7.29123e+06	7.76842e+06
meerkat	100	70	6.46727e+06	6.7236e+06	6.99803e+06	7.35454e+06	7.71673e+06	8.18398e+06
vlab	1	100	930409	998386	1.02976e+06	1.06658e+06	1.09611e+06	1.13019e+06
vlab	2	100	931919	998640	1.0255e+06	1.05614e+06	1.08163e+06	1.11554e+06
vlab	2	70	1.0902e+06	1.1568e+06	1.18265e+06	1.21166e+06	1.23714e+06	1.27369e+06
vlab	100	100	935598	1.01095e+06	1.05619e+06	1.11344e+06	1.16822e+06	1.23948e+06
vlab	100	90	952387	1.02781e+06	1.07243e+06	1.1282e+06	1.18413e+06	1.25478e+06
vlab	100	80	980112	1.0564e+06	1.10074e+06	1.15591e+06	1.21247e+06	1.28243e+06
vlab	100	70	1.02253e+06	1.09757e+06	1.14208e+06	1.1972e+06	1.25403e+06	1.32463e+06

from kal-cal.

kwazzi-jack commented on September 24, 2024

Does not seem right that tint-1 seems far less compared to tint larger than it. So if I am correct, this means there is an issue with the simulated data?

from kal-cal.

kwazzi-jack commented on September 24, 2024

Also, the column for tint-16 in the previous table matches this table. So they are producing the same outcome.

from kal-cal.

JSKenyon commented on September 24, 2024

Also, the column for tint-16 in the previous table matches this table. So they are producing the same outcome.

That is comforting.

Does not seem right that tint-1 seems far less compared to tint larger than it. So if I am correct, this means there is an issue with the simulated data?

It makes sense that it is low - you have a large number of degrees of freedom. You can and will over-fit to some extent. I would suggest running the tint-2 and tint-3 cases just to be sure. Again, plotting the tint-1 gains on the same plot as the tint-16 and KalCal gains may be illuminating.

from kal-cal.

kwazzi-jack commented on September 24, 2024

Again, plotting the tint-1 gains on the same plot as the tint-16 and KalCal gains may be illuminating.

EDIT: Complex line = True gains line**

from kal-cal.

kwazzi-jack commented on September 24, 2024

I imaged the residual columns using wsclean:

1. Residuals - VLA-B @ 100 sources, with quartical (tint = 1)

2. Residuals - VLA-B @ 100 sources, with quartical (tint = 16)

from kal-cal.

kwazzi-jack commented on September 24, 2024

3. Noise-free corrupted visibilities - VLA-B @ 100 sources

from kal-cal.

JSKenyon commented on September 24, 2024

Ok, that is good. To me it is pretty l

Again, plotting the tint-1 gains on the same plot as the tint-16 and KalCal gains may be illuminating.

EDIT: Complex line = True gains line**

This is good but seriously suggests that something is wrong with the gain MSE plots for QuartiCal. By eye, tint-1 is way closer to the truth than any of the others. I would actually expect a slightly higher tint, between 2 and 6, will probably fit even better as it will overfit slightly less.

The residual images are not informative without a locked colour scale/axes.

Edit: This is progress though!

from kal-cal.

kwazzi-jack commented on September 24, 2024

So to increase overfitting, I should increase noise, which in turn would raise the optimal tint to something a few steps higher?

from kal-cal.

JSKenyon commented on September 24, 2024

The tint-1 solution is already overfitting. What I am saying is that it is surprising that you find an optimal tint of 16 - it is clearly underfitting. Try producing the gains product plots with tint-1 through tint-8 using QuartiCal. You should clearly see it move from overfitting to underfitting and if you don't then we have to consider a bug in QuartiCal.

Edit: Ultimately, I think that the QuartiCal solutions look as you would expect. The issue was that the flux suppression plots didn't seem consistent with the gains. If the gains look right, that suggests the problem is elsewhere.

from kal-cal.

kwazzi-jack commented on September 24, 2024

I think that is what is happening if I had to plot tint-1 to tint-8. The MSE plots probably are just like you said that its just which ever happens to be generally closer. That is why I would think its either got something to do with the simulated data which I am going to image using corrected-data from quartical to see the effect of tint-1 and up. I'm also going to have a look at simulating a dataset with meqtrees to see if things change.

from kal-cal.

kwazzi-jack commented on September 24, 2024

I've setup a single branch of my pipeline as a Jupyter notebook as suggested. It goes from start with creating the ms to a few plots. You can find it on tina or /net/tina/home/welman/masters/projects/flux_suppression/debug/. In there is the notebook called flux_suppression.ipynb. There is a virtual-environment in /net/tina/home/welman/masters/projects/flux_suppression/ called kal-env that runs the notebook server, and has all the necessary libraries to run.

Please could you have a look as it is still producing similar results that make no sense. Thank you!

from kal-cal.

kwazzi-jack commented on September 24, 2024

So here are results from the smaller pipeline. Running on MeerKAT with 100 sources, peak-flux=0.1 Jy and epsilon ~ CN(0, 8). Calibration was done on 100%, 90%, 80%, and 70% modelled sources. Here are the results:

1. Mean Square Error of Gains

2. Weighted Average Absolute Percentage Error

3. Flux Ratio on first 3 brightest sources

from kal-cal.

landmanbester commented on September 24, 2024

The kalcal plots are the same regardless of modelled flux percentage. Are you sure you didn't mess something up?

from kal-cal.

kwazzi-jack commented on September 24, 2024

Well that's what I don't get. The models are definitely different and hence different model-visibilties.

In the dirty2ms step, the the input model changes depending on the % modeled sources.

from kal-cal.

landmanbester commented on September 24, 2024

There is definitely something weird going on. How can the minimum MSE for qcal be at a solution interval close to 50? Have you plotted it to see what that looks like?

from kal-cal.

kwazzi-jack commented on September 24, 2024

Quoting @landmanbester from Google chat:

If you sum the amplitudes in vis space it'll [% unmodeled source flux] be 30% off, not sure about phases.

So I did this to double-check if it was actually translating through when I generated my model visibilities with dirty2ms. All I did was calculate the amplitude for the 100% model visibilities, and then compared it to the 90% to 70% cases.

amp_100 = np.sum(np.abs(model_vis_100))
amp_90 = np.sum(np.abs(model_vis_90))
amp_80 = np.sum(np.abs(model_vis_80))
amp_70 = np.sum(np.abs(model_vis_70))

print("100% ->", amp_100 / amp_100 * 100, "%")
print("90% ->", amp_90 / amp_100 * 100, "%")
print("80% ->", amp_80 / amp_100 * 100, "%")
print("70% ->", amp_70 / amp_100 * 100, "%")

Output:

100% -> 100.0 %
90% -> 99.6713399887085 %
80% -> 99.0151047706604 %
70% -> 98.065584897995 %

So, based on this, that would mean there is no % modeled flux decrease, and this could be explaining the plots not changing in the end?

from kal-cal.

landmanbester commented on September 24, 2024

Yep, that's not right. Can you just try by taking fraction*model where fraction is the fraction of modelled flux? Also, I tried running your notebook again and noticed that kal-env is python2.7. Please update the venv to be > python3.7

from kal-cal.

kwazzi-jack commented on September 24, 2024

So I figured that already and had run with what you mentioned with fraction * model. The outcome did not seem to change much, I will post plots soon. I am confused as to what you were looking at with kal-env because:

welman@tina:~/masters/projects/flux_suppression$ source kal-env/bin/activate
(kal-env) welman@tina:~/masters/projects/flux_suppression$ python --version
Python 3.9.4

I'm busy compiling all the plots, and plots you asked to see before. Will post soon.

from kal-cal.

kwazzi-jack commented on September 24, 2024

Quoting @landmanbester from Google chat:

If you sum the amplitudes in vis space it'll [% unmodeled source flux] be 30% off, not sure about phases.

So I did this to double-check if it was actually translating through when I generated my model visibilities with dirty2ms. All I did was calculate the amplitude for the 100% model visibilities, and then compared it to the 90% to 70% cases.
amp_100 = np.sum(np.abs(model_vis_100))
amp_90 = np.sum(np.abs(model_vis_90))
amp_80 = np.sum(np.abs(model_vis_80))
amp_70 = np.sum(np.abs(model_vis_70))

print("100% ->", amp_100 / amp_100 * 100, "%")
print("90% ->", amp_90 / amp_100 * 100, "%")
print("80% ->", amp_80 / amp_100 * 100, "%")
print("70% ->", amp_70 / amp_100 * 100, "%")
Output:
100% -> 100.0 %
90% -> 99.6713399887085 %
80% -> 99.0151047706604 %
70% -> 98.065584897995 %
So, based on this, that would mean there is no % modeled flux decrease, and this could be explaining the plots not changing in the end?

So with this now, I re-calculated it with the fraction * (100% model) change and I get the expected:

Successful readonly open of default-locked table meerkat.ms: 41 columns, 1451520 rows
100% -> 100.0 %
90% -> 90.0000274181366 %
80% -> 80.00003099441528 %
70% -> 70.00002264976501 %

from kal-cal.

landmanbester commented on September 24, 2024

So I figured that already and had run with what you mentioned with fraction * model. The outcome did not seem to change much, I will post plots soon. I am confused as to what you were looking at with kal-env because:
welman@tina:~/masters/projects/flux_suppression$ source kal-env/bin/activate
(kal-env) welman@tina:~/masters/projects/flux_suppression$ python --version
Python 3.9.4
I'm busy compiling all the plots, and plots you asked to see before. Will post soon.

Even me

(kal-env) ╭─bester@oates /net/tina/home/welman/masters/projects/flux_suppression
╰─➤  python --version
Python 2.7.17

But I am not sure if this is even supposed to work. I sourced your venv while logged in from a different machine

from kal-cal.

kwazzi-jack commented on September 24, 2024

And if you run python3 --version instead? It could be hooking the main python link to python2 instead.

from kal-cal.

kwazzi-jack commented on September 24, 2024

Here is what I get with python2:

(kal-env) welman@tina:~/masters/projects/flux_suppression$ python2 --version
Python 2.7.17

from kal-cal.

landmanbester commented on September 24, 2024

(kal-env) ╭─bester@oates /net/tina/home/welman/masters/projects/flux_suppression
╰─➤  python3 --version
Python 3.6.9

I suspect the venv is not really activated somehow

from kal-cal.

kwazzi-jack commented on September 24, 2024

Possibly because of personal profile links. No worries, you can make your own if you like. You just need the following:

cd /net/tina/home/welman/masters/projects/flux_suppression/
virtualenv -p python3.9 lb-env
source lb-env/bin/activate
pip install -e ../../packages/kal-cal ../../packages/pfb-clean
pip install quartical tqdm PyYaml

I think that should be it. Let me know if there are any errors with dependencies.

from kal-cal.

kwazzi-jack commented on September 24, 2024

So I've run it with the modified fraction * model with the corrections mentioned during our meeting. Here I set peak-flux = 1.0 Jy, noise ~ CN(0.0, 2) and I've extended the upperbound on the process-noise search from 0.1 to 1.0. Here are the plots from the notebook:

1. MSE on calibrated gains, `quartical` vs `kalcal`

2. Flux Ratio for first 3 brightest sources, `quartical` vs `kalcal`

3. Flux Weighted Absolute Percentage Error, `quartical` vs `kalcal`

4. Gains-product plot, comparing time-interval solutions for `quartical`

from kal-cal.

kwazzi-jack commented on September 24, 2024

I reran the simulation, to test the outcome of unmodelled sources, reducing noise. Here I set peak-flux = 1.0 Jy, noise ~ CN(0.0, 0.02). Here are the plots from the notebook:

MSE on calibrated gains, quartical vs kalcal
Flux Ratio for first 3 brightest sources, quartical vs kalcal
Flux Weighted Absolute Percentage Error, quartical vs kalcal
Gains-product plot, comparing time-interval solutions for quartical
Gains-product plot, comparing process-noise solutions for kalcal

We can see the standard drift of the gains solutions in the gains-product plots. Plus, it seems to be appearing in the flux-ratio plots for kalcal.

Still not sure what is happening with quartical. At least it is showing that when there is little noise, the smaller time-intervals will perform better, correlating with MSE and Weight Absolute Percentage Error plots too.

To add, each time I rerun the simulation, all generate items (ms, gains, fluxes, etc.) are deleted and jupyter-notebook outputs are cleared.

from kal-cal.

kwazzi-jack commented on September 24, 2024

I reran the simulation, to test the outcome of unmodelled sources, reducing noise. Here I set peak-flux = 1.0 Jy, noise ~ CN(0.0, 0.02). Here are the plots from the notebook:
1. MSE on calibrated gains, quartical vs kalcal
   ![gains-mse](https://user-images.githubusercontent.com/20638068/168057922-73463aa5-6c97-44eb-a679-4d9ac6ac458d.png)

2. Flux Ratio for first 3 brightest sources, quartical vs kalcal
   ![flux-ratio](https://user-images.githubusercontent.com/20638068/168057959-1f990152-5e85-455d-84d1-96f2a68b04f2.png)

3. Flux Weighted Absolute Percentage Error, quartical vs kalcal
   ![flux-wape](https://user-images.githubusercontent.com/20638068/168058002-07099a32-56c6-4f94-99e3-537bdbac23a8.png)

4. Gains-product plot, comparing time-interval solutions for quartical
   ![quartical-gp-compare-tint](https://user-images.githubusercontent.com/20638068/168060897-43459bb4-4977-49f8-9918-3651a6a2a5d7.png)

5. Gains-product plot, comparing process-noise solutions for kalcal
   ![kalcal-gp-compare-sigma_f](https://user-images.githubusercontent.com/20638068/168061456-81a009d1-0e7a-4708-8f90-bd8edfc49800.png)
We can see the standard drift of the gains solutions in the gains-product plots. Plus, it seems to be appearing in the flux-ratio plots for kalcal.

Still not sure what is happening with quartical. At least it is showing that when there is little noise, the smaller time-intervals will perform better, correlating with MSE and Weight Absolute Percentage Error plots too.

To add, each time I rerun the simulation, all generate items (ms, gains, fluxes, etc.) are deleted and jupyter-notebook outputs are cleared.

So I've just noticed with the kalcal solutions that I did not set the sigma_n correctly for its calibration run, so I'm redoing these plots and will edit with the correct plots when its done. quartical should be the same, roughly.

from kal-cal.

kwazzi-jack commented on September 24, 2024

I have a question @landmanbester for this section in the imaging script:

...
# V = Jp int I kpq dl dm/n Jq.H
# V = G R mask x   G = Mueller term,  G = Jp Jq.H,  G.H G = Jq Jq.H Jp.H Jp
G = corrupt_vis(tbin_indices, tbin_counts, ANT1, ANT2,
                gains, np.ones_like(vis[..., None, None]))[:, :, 0]

# Sigma Inverse and Weights
S = 1/sigma_n**2
W = (S * G.conj() * G).real

# Sum of Weights
wsum = sum(W)

# x = (R.H G.H G R)inv R.H G.H V
dirty = ms2dirty(uvw=UVW, freq=FREQ, ms=S * G.conj() * vis,
                 npix_x=nx, npix_y=ny,
                 pixsize_x=cell_rad, pixsize_y=cell_rad,
                 epsilon=tol, nthreads=nthreads, do_wstacking=True)

def hess(x):
    tmp = dirty2ms(uvw=UVW, freq=FREQ, dirty=mask * x,
                   pixsize_x=cell_rad, pixsize_y=cell_rad,
                   epsilon=tol, nthreads=nthreads, do_wstacking=True)
    
    res = ms2dirty(uvw=UVW, freq=FREQ, ms=tmp, wgt=W,
                   npix_x=nx, npix_y=ny,
                   pixsize_x=cell_rad, pixsize_y=cell_rad,
                   epsilon=tol, nthreads=nthreads, do_wstacking=True)
    
    return mask * res

# Run pcg function to get recovered image
model_rec = pcg(hess, mask * dirty, x0=np.ones_like(dirty), tol=tol)
...

So I realized, that I am not applying wsum to the output image. In particular, would it not have to be coupled with the ms2dirty functions? I.e.

dirty /= wsum

and:

return mask * res / wsum

Another question is how the gradient descent is being done. So the initial x0 is a dirty image, that has the conjugate G divided by sigma_n column squared applied as weights to visibilities, i.e. S * G.conj() <=> |g_p^* x g_q| / sigma_n^2. This is to represent the G^H V part of the expression?

So why does the hessian function then not do this in the ms2dirty step, but only applies W, which is W = S * G.conj() * G <=> |g_p^* x g_q|^2 / sigma_n^2?

from kal-cal.

landmanbester commented on September 24, 2024

So I realized, that I am not applying wsum to the output image. In particular, would it not have to be coupled with the ms2dirty functions? I.e.
dirty /= wsum
and:
return mask * res / wsum

The normalisation by wsum just puts the dirty image in units of Jy/beam. You only need to worry about if if you want your fits files to be in these units, not for the optimisation. This normalisation is automatically applied during the inversion of the hessian operator.

Another question is how the gradient descent is being done. So the initial x0 is a dirty image, that has the conjugate G divided by sigma_n column squared applied as weights to visibilities, i.e. S * G.conj() <=> |g_p^* x g_q| / sigma_n^2. This is to represent the G^H V part of the expression?

So why does the hessian function then not do this in the ms2dirty step, but only applies W, which is W = S * G.conj() * G <=> |g_p^* x g_q|^2 / sigma_n^2?

This is not gradient decent, it's the conjugate gradient algorithm. They work fundamentally differently. Just look at the math expression in the comment. The dirty image is R.H G.H V (and now also with S to account for the noise level). The operator you need to invert is the R.H G.H G R part (again with S), or what I've called hess. The whole point of the conjugate gradient algorithm is that you can invert this operator implicitly i.e. you don't need the whole matrix in memory, just the action of that matrix on a vector.

Something is still very wrong with these results, we need to keep digging

from kal-cal.

kwazzi-jack commented on September 24, 2024

So, I have fine tuned the 100 source case to fit the solution interval paper, and with what was suggested on Tuesday. It seems to be working, but I am cautious, and would like to know what you all think.

So I have the following setup the simulation (the details are not necessary, I just do not want to lose this setup, the plots should be enough):

(n_time, n_ant, n_chan, n_dir, n_corr) = (720, 64, 1, 1, 1)
True-gains, taken from squared exponential function, with sigma=0.2, length=100, with mean adjusted to fit around 0.75:
fov = 0.5'
n_sources = 100
Pareto distribution set to alpha=2
(Auto-generated) n_pixels = 480x480
(Auto-generated) cell_size = 3.85asec
Skymodel % modelled sources, by % flux = [100%, 70%, 50%, 30%]
Simulated skymodel, with buffer between sources guaranteeing > 5 pixels distance to make sure flux-finder script does not select incorrect source, and not overlapping sources with wsclean
generated-models
sigma_n = 1.0
noise in visibility space = N(0, sigma_n**2) + 1.0*N(0, sigma_n**2)
Random seed fixed
Weights = 1.0/(2.0 * sigma_n**2) = 0.5
(For quartical) Sigma_n = sqrt(2) * sigma_n = sqrt(2)
kalcal calibration, with log-search from sigma_f = 1e-4 to 1e-1, over 32 points
quartical calibration, with custom-search from tint = 1 to 24, 48 to 240 (step 48), 360, 480, 720 (32 points)
Over all solutions, image with wsclean:
- weight = briggs 0.0
- scale = 3.85asec
- size = 480x480
- niter = 1e6
- threshold = 2.0 * RMS(Noise-Image) = 2.83
- Weight-spectrum = |gains|^2 / weight
Used a similar flux-finder script to Ulrich's one used for solution interval paper, where using the known coordinates of the source in the model image, select a region 10 pixels in size, with the coordinates at the centre. Then in the region, select the brightest pixel as our source flux.

Results

1. MSE on calibrated gains, `quartical` vs `kalcal`

2. Flux Ratio for first 4 brightest sources, `quartical` vs `kalcal`

Mean-line: Average flux ratio of the first 25 brightest sources.

3. Flux Weighted Absolute Percentage Error, over all sources, `quartical` vs `kalcal`

4. Gains-product plot, comparing time-interval solutions for `quartical`

5. Gains-product plot, comparing process-noise solutions for `kalcal`

6. Flux-difference plot, showing difference, per source, `recovered_flux - true_flux`, for `quartical`

7. Flux-difference plot, showing difference, per source, `recovered_flux - true_flux`, for `kalcal`

I have yet to try the 2 source case, but I would set it to the same with a smaller fov. quartical is behaving as discussed on Tuesday, from my perspective. Even though it is hard to see, if I limit the log-search per % modelled source configuration, kalcal produces, and possibly even better than that of quartical, which the main pipeline does.

@landmanbester in light of it working, I tried the fluxtractor.py script again, and the results are marginally better, but were still too good for quartical. I will produce a comparison for next week's meeting.

from kal-cal.

landmanbester commented on September 24, 2024

Well this makes more sense. I suggest modifying the search space for kalcal to cover sigmaf [1e-7,1e-2], it's clearly not at the optimum in terms of minimum MSE. As for the fluxtractor script, I'm pretty sure something weird is happening there but let's not worry about this for now, just get the results you need and write them up asap

from kal-cal.

kwazzi-jack commented on September 24, 2024

After the meeting, before I tried the imaging the model columns with wsclean, I tried fluxtractor script again (as it would be the more accurate result to show). I also tried to plot in a similar fashion to the solution interval paper, fig. 1, where I show the brightest modeled and brightest modeled source.

1. Mean Square Error plot, `kalcal` vs `quartical`

2. Weighted Average Absolute Percentage Error, `kalcal` vs `quartical`, using the flux-finder script (with `wsclean`)

3. Weighted Average Absolute Percentage Error, `kalcal` vs `quartical`, using the flux-tractor script (`pcg` from `pfbclean` )

4. Flux Ratio plot of brightest modelled and unmodelled source, `kalcal` vs `quartical`, using the flux-finder script (with `wsclean`)

5. Flux Ratio plot of brightest modelled and unmodelled source,, `kalcal` vs `quartical`, using the flux-tractor script (`pcg` from `pfbclean` )

from kal-cal.

landmanbester commented on September 24, 2024

Interesting. What has changed? Just the way you simulate the gains?

from kal-cal.

kwazzi-jack commented on September 24, 2024

Yeah, just the gains. Should I stick with fluxtractor then?

from kal-cal.

landmanbester commented on September 24, 2024

Well it would be better if you can get sensible results using fluxtractor as it eliminates issues stemming from imperfect deconvolution. But I am really perplexed that just changing the gains can make such a huge difference. In principle QuartiCal can fit these gains better than the more physical ones you were using so it's weird that the fluxtractor results are suddenly making more sense. I say more sense because there is still something I don't understand i.e. why the minimum MSE does not coincide with the least flux suppression. Maybe there is something fundamental I am not getting. But, in any case, these results make some sort of sense so I suggest moving forward and starting to put them in your thesis

from kal-cal.

kwazzi-jack commented on September 24, 2024

Okay, will do, thank you!

from kal-cal.

Comments (58)

Parameters

Simulation

Calibration

Imaging

Plotting results

Results

1.1 Gains MSE: kalcal over process noise (against quartical), single source

1.2 Gains MSE: quartical over time intervals (against kalcal), single source

2.1 Gains MSE: kalcal over process noise (against quartical), two sources

2.2 Gains MSE: quartical over time intervals (against kalcal), two sources

3.1 Gains MSE: kalcal over process noise (against quartical), 100 sources with MeerKAT

3.2 Gains MSE: quartical over time intervals (against kalcal), 100 sources with MeerKAT

4.1 Gains MSE: kalcal over process noise (against quartical), 100 sources with VLA-B

4.2 Gains MSE: quartical over time intervals (against kalcal), 100 sources with VLA-B

5.1 Flux Ratio: kalcal vs quartical, one source

5.2 Flux Ratio: kalcal vs quartical, two sources

6.1 Average % Error: kalcal vs quartical, 100 sources with MeerKAT

6.2 Average % Error: kalcal vs quartical, 100 sources with VLA-B

7.1 Weighted average: kalcal vs quartical, 100 sources with MeerKAT

7.2 Weighted average: kalcal vs quartical, 100 sources with VLA-B

Notes

1.1 Gains MSE: kalcal over process noise (against cubical), single source

1.2 Gains MSE: cubical over time intervals (against kalcal), single source

2.1 Gains MSE: kalcal over process noise (against cubical), two sources

2.2 Gains MSE: cubical over time intervals (against kalcal), two sources

3.1 Gains MSE: kalcal over process noise (against cubical), 100 sources with MeerKAT

3.2 Gains MSE: cubical over time intervals (against kalcal), 100 sources with MeerKAT

4.1 Gains MSE: kalcal over process noise (against cubical), 100 sources with VLA-B

4.2 Gains MSE: cubical over time intervals (against kalcal), 100 sources with VLA-B

5.1 Flux Ratio: kalcal vs cubical, one source

5.2 Flux Ratio: kalcal vs cubical, two sources

6.1 Average % Error: kalcal vs cubical, 100 sources with MeerKAT

6.2 Average % Error: kalcal vs cubical, 100 sources with VLA-B

7.1 Weighted average: kalcal vs cubical, 100 sources with MeerKAT

7.2 Weighted average: kalcal vs cubical, 100 sources with VLA-B

1.1 Original priors, Sigma_f = 0.1 (ideal sigma_f)

1.2 New priors, Sigma_f = 0.1 (ideal sigma_f)

2.1 Original priors, Sigma_f = 0.00001 (sigma_f really small, lots of averaging)

2.2 New priors, Sigma_f = 0.00001 (sigma_f really small, lots of averaging)

3.1 Original priors, Sigma_f = 10.0 (sigma_f really large, fits noise more, can't track as well since steps are too large)

3.2 New priors, Sigma_f = 10.0 (sigma_f really large, fits noise more, can't track as well since steps are too large)

1. When gains are set to true-gains

2. When gains are set to kalcal with sigma_f = 0.1

3. When gains are set to quartical with tint = 16

1. Noise-free Visibilities, with true-gains

2. Noisy Visibilities, with true-gains

3. Noisy Visibitilies, with kalcal-gains

4. Noisy Visibilities, with quartical-gains

1. Residuals - VLA-B @ 100 sources, with quartical (tint = 1)

2. Residuals - VLA-B @ 100 sources, with quartical (tint = 16)

3. Noise-free corrupted visibilities - VLA-B @ 100 sources

1. Mean Square Error of Gains

2. Weighted Average Absolute Percentage Error

3. Flux Ratio on first 3 brightest sources

1. MSE on calibrated gains, quartical vs kalcal

2. Flux Ratio for first 3 brightest sources, quartical vs kalcal

3. Flux Weighted Absolute Percentage Error, quartical vs kalcal

4. Gains-product plot, comparing time-interval solutions for quartical

Results

1. MSE on calibrated gains, quartical vs kalcal

2. Flux Ratio for first 4 brightest sources, quartical vs kalcal

3. Flux Weighted Absolute Percentage Error, over all sources, quartical vs kalcal

4. Gains-product plot, comparing time-interval solutions for quartical

5. Gains-product plot, comparing process-noise solutions for kalcal

6. Flux-difference plot, showing difference, per source, recovered_flux - true_flux, for quartical

7. Flux-difference plot, showing difference, per source, recovered_flux - true_flux, for kalcal

1. Mean Square Error plot, kalcal vs quartical

2. Weighted Average Absolute Percentage Error, kalcal vs quartical, using the flux-finder script (with wsclean)

3. Weighted Average Absolute Percentage Error, kalcal vs quartical, using the flux-tractor script (pcg from pfbclean )

4. Flux Ratio plot of brightest modelled and unmodelled source, kalcal vs quartical, using the flux-finder script (with wsclean)

5. Flux Ratio plot of brightest modelled and unmodelled source,, kalcal vs quartical, using the flux-tractor script (pcg from pfbclean )

Related Issues (3)

Recommend Projects

Recommend Topics

Recommend Org

1. MSE on calibrated gains, `quartical` vs `kalcal`

2. Flux Ratio for first 3 brightest sources, `quartical` vs `kalcal`

3. Flux Weighted Absolute Percentage Error, `quartical` vs `kalcal`

4. Gains-product plot, comparing time-interval solutions for `quartical`

1. MSE on calibrated gains, `quartical` vs `kalcal`

2. Flux Ratio for first 4 brightest sources, `quartical` vs `kalcal`

3. Flux Weighted Absolute Percentage Error, over all sources, `quartical` vs `kalcal`

4. Gains-product plot, comparing time-interval solutions for `quartical`

5. Gains-product plot, comparing process-noise solutions for `kalcal`

6. Flux-difference plot, showing difference, per source, `recovered_flux - true_flux`, for `quartical`

7. Flux-difference plot, showing difference, per source, `recovered_flux - true_flux`, for `kalcal`

1. Mean Square Error plot, `kalcal` vs `quartical`

2. Weighted Average Absolute Percentage Error, `kalcal` vs `quartical`, using the flux-finder script (with `wsclean`)

3. Weighted Average Absolute Percentage Error, `kalcal` vs `quartical`, using the flux-tractor script (`pcg` from `pfbclean` )

4. Flux Ratio plot of brightest modelled and unmodelled source, `kalcal` vs `quartical`, using the flux-finder script (with `wsclean`)

5. Flux Ratio plot of brightest modelled and unmodelled source,, `kalcal` vs `quartical`, using the flux-tractor script (`pcg` from `pfbclean` )