AIPS HELP file for FINDR in 31DEC24

As of Fri Jul 26 17:22:47 2024

FINDR: Find normal and extreme amplitudes for uv data set

INPUTS

INNAME                             Input UV data
INCLASS                            Input UV data (class)
INSEQ                              Input UV data (seq. #)
INDISK                             Input UV data disk drive #
SOURCES                            Source (pointings) list
QUAL            -10.0              Source qualifier -1=>all
CALCODE                            Calibrator code '    '=>all
TIMERANG                           Time range to process.
SELBAND                            Bandwidth to select (kHz)
SELFREQ                            Frequency to select (MHz)
FREQID                             Freq. ID to select.
BIF               0.0      100.0   Lowest IF number 0=>all
EIF               0.0      100.0   Highest IF number 0=>all
BCHAN             0.0     2048.0   Lowest channel number 0=>all
ECHAN             0.0     2048.0   Highest channel number 0=>all
SUBARRAY          0.0     1000.0   Subarray, 0=>all
DOCALIB          -1.0      101.0   > 0 calibrate data & weights
                                   > 99 do NOT calibrate weights
GAINUSE                            CL/SN table to apply
DOPOL            -1.0       10.0   If >0 correct polarization.
PDVER                              PD table to apply (DOPOL>0)
BLVER                              BL table to apply.
FLAGVER                            Flag table version
DOBAND           -1.0       10.0   If >0 apply bandpass cal.
                                   Method used depends on value
                                   of DOBAND (see HELP file).
BPVER                              Bandpass table version
SMOOTH                             Smoothing function. See
                                   HELP SMOOTH for details.
UVRANGE         0.                 Min & max baseline (klambda)
SOLINT                             Time interval (sec)
SCANLENG                           Scan length (sec)
REFANT                             Reference antenna for VDIF
OPTYPE                             'VDIF', 'VRFI', 'TIME'
DOROBUST         -1.0        1.0   > 0 -> robust spectral avg
DOCAT            -1.0        1.0   > 0 => keep the XX file
DOCRT            -3.0     132.0    > 0 -> use the terminal,
                                   > 72 => terminal width
                                   0 -> no printing
                                   else use the line printer
OUTPRINT
                                   Printer disk file to save
PRTLEV                             > 0 => debug messages
BADDISK           -1.0      1000.0 Disks to avoid for scratch.
ARRAY2   @                         Answers from first source:
         @                         ARRAY2(1-5,1) robust averages
         @                         ARRAY2(6-10,1) rms of robust
         @                            averages
         @                         ARRAY2(11-20,1) fraction of
         @                            closure error by tenths.
         @                         ARRAY2(1-5,2) # > 10 sigma
         @                         ARRAY2(6-10,2) # > 20 sigma
         @                         ARRAY2(11,2) # orphans
         @                         ARRAY2(12,2) total # samples

HELP SECTION

FINDR
Type:  Task
Use:   This task computes the average weight, visibility amplitude and
       rms fluctuations in each baseline and correlator over a
       specified range of spectral channels and time.  It then
       converts the baseline-based quantities into apparent
       antenna-based quantities.  Finally, using robust averaging, it
       determines the "normal" amplitude, weight, and rms and
       deviations in these quantities.  It returns the normal values
       and some estimates of the extremes found in the selected subset
       of the data.
       The VDIF OPTYPE computes the average weight and complex
       visibility in each baseline and correlator over a specified
       range of spectral channels and time.  It then converts the
       baseline-based quantities into apparent antenna-based
       quantities.  Then it determines the amplitude of the vector
       difference between each visibility and the median of the
       visibilities in a window (in time) surrounding the visibility.
Adverbs:
  INNAME.....Input UV data file
  INCLASS....Input UV data file (class).      Standard defaults.
  INSEQ......Input UV data file (seq. #).     0 => highest.
  INDISK.....Input UV data file disk drive #. 0 => any.
  SOURCES....List of sources (pointings) to be processed.
             '*' or blank = all; a "-" before a source name means all
             except ANY source named.
  QUAL.......Only sources with a source qualifier number in the SU
             table matching QUAL will be used if QUAL is not -1.
  CALCODE...."sources" may be selected on the basis of the calibrator
             code given in the SU table.
                  '    ' => any calibrator code selected
                  '*   ' => any non blank code (cal. only)
                  '-CAL' => blank codes only (no calibrators)
             NB: The CALCODE test is applied in addition to the other
             tests, i.e. SOURCES and TIMERANG, in the selection of
             sources to process.  CALCODE affects only the selection
             of calibrators.
  TIMERANG...Time range of the data to be processed. In order: Start
             day, hour, min. sec, end day, hour, min. sec. Days
             relative to reference date.
  SELBAND....Bandwidth of data to be selected. If more than one IF is
             present SELBAND is the width of the first IF required.
             Units = kHz. For data which contain multiple
             bandwidths/frequencies the task will insist that some
             form of selection be made by frequency or bandwidth.
  SELFREQ....Frequency of data to be selected. If more than one IF is
             present SELFREQ is the frequency of the first IF
             required. Units = MHz.
  FREQID.....Frequency identifier to select (you may determine which
             is applicable from the OPTYPE='SCAN' listing produced by
             LISTR). If either SELBAND or SELFREQ are set, their
             values overide that of FREQID.  However, setting SELBAND
             and SELFREQ may result in an ambiguity.  In that case,
             the task will request that you use FREQID.
  DOCALIB....If true (>0), calibrate the data using information in the
             specified Cal (CL) table for multi-source or SN table for
             single-source data.  Also calibrate the weights unless
             DOCALIB > 99 (use this for old non-physical weights).
  BIF........First IF to select. 0=>all.
  EIF........Highest IF to select. 0=>all higher than BIF
  BCHAN......First channel to select. 0=>all.
  ECHAN......Highest channel to select.  The spectral channels are
             used as separate samples of the visibility to provide
             better statistics in determining the rms.
  SUBARRAY...Subarray number to consider. 0=>1.
  GAINUSE....CL table version number to apply.  0=> highest.
  DOPOL......If > 0.5 then correct data for instrumental polarization
             as represented in the AN or PD table.  This correction is
             only useful if PCAL has been run or feed polarization
             parameters have been otherwise obtained.  See HELP DOPOL
             for available correction modes: 1 is normal, 2 and 3 are
             for VLBI.  1-3 use a PD table if available; 6, 7, 8 are
             the same but use the AN (continuum solution) even if a PD
             table is present.
  PDVER......PD table to apply if PCAL was run with SPECTRAL true and
             0 < DOPOL < 6.  <= 0 => highest.
  BLVER......Version number of the baseline based calibration (BL)
             table to appply. <0 => apply no BL table,  0 => highest.
  FLAGVER....Specifies the version of the flagging table to be
             applied.  0 => highest numbered table.  <0 => no flagging
             to be applied.  A new FG table is created to hold the new
             flags; if FLAGVER >= 0, the FG table used on the data is
             copied into the output file before the new flags are
             computed.
  DOBAND.....If true (>0) then correct the data for the shape of the
             antenna bandpasses using the BP table specified by BPVER.
             The correction has five modes:
             (a) if DOBAND=1 all entries for an antenna in the table
             are averaged together before correcting the data.
             (b) if DOBAND=2 the entry nearest in time (including
             solution weights) is used to correct the data.
             (c) if DOBAND=3 the table entries are interpolated in
             time (using solution weights) and the data are then
             corrected.
             (d) if DOBAND=4 the entry nearest in time (ignoring
             solution weights) is used to correct the data.
             (e) if DOBAND=5 the table entries are interpolated in
             time (ignoring solution weights) and the data are then
             corrected.
  BPVER......Specifies the version of the BP table to be applied
  SMOOTH.....Specifies the type of spectral smoothing to be applied to
             a uv database . The default is not to apply any smoothing.
             The elements of SMOOTH are as follows:
             SMOOTH(1) = type of smoothing to apply: 0 => no smoothing
               To smooth before applying bandpass calibration
                 1 => Hanning, 2 => Gaussian, 3 => Boxcar, 4 => Sinc
               To smooth after applying bandpass calibration
                 5 => Hanning, 6 => Gaussian, 7 => Boxcar, 8 => Sinc
             SMOOTH(2) = the "diameter" of the function, i.e. width
               between first nulls of Hanning triangle and sinc
               function, FWHM of Gaussian, width of Boxcar. Defaults
               (if < 0.1) are 4, 2, 2 and 3 channels for SMOOTH(1) =
               1 - 4 and 5 - 8, resp.
             SMOOTH(3) = the diameter over which the convolving
               function has value - in channels.  Defaults: 1,3,1,4
               times SMOOTH(2) used when input SMOOTH(3) < net
               SMOOTH(2).
  UVRANGE....(Minimum,Maximum) baseline (kilo-lambda) to process.
  SOLINT.....The interval over which the rms is determined.
             0 => 60 seconds.
  SCANLENG...Length of "scan" for VDIF in seconds.  The vector average
             of the visibilities over the scan is differenced from the
             individual visibility to make the "vector difference".
  REFANT.....Reference antenna suggestion for VDIF.
  OPTYPE.....Choice of methods for examining the data:
             'TIME' -> (default)
                       It computes the baseline amplitudes and rmses
                       in each SOLINT and converts them to antenna-
                       based values.  A robust method is used and
                       closure failure statistics maintained.  It then
                       saves them for later.  When all data have been
                       read, the antenna-based results are analyzed,
                       source by source, antenna by antenna.
                       Regression is used to compute the mean antenna
                       amplitude and mean rms and the rmses in these
                       parameters.  Outliers are then counted and
                       reported.  Statistics on the failure statistics
                       are also reported.
             'VDIF' -> It computes the baseline reals, imaginaries,
                       and weights in each SOLINT and converts then to
                       antenna-based values.  A robust method is used
                       and closure failure statistics maintained.  It
                       saves them for later.  When all data have been
                       read, the antenna-based results are analyzed
                       source by source, correlator by correlator.
                       First the visbility at each time is differenced
                       with the average of the visibilities
                       surrounding it.  The weights, amplitudes, and
                       amplitudes of the differences are then averaged
                       and reported.  Outliers are counted and
                       reported.  Statistics on the failure statistics
                       are also reported.
             'VRFI' -> It computes the baseline reals, imaginaries,
                       and weights in each SOLINT and saves as many as
                       it can for later.  When the source changes, the
                       buffer is full, or all data have been read, the
                       baseline-based results are analyzed correlator
                       by correlator.  First the visbility at each
                       time is differenced with the average of the
                       visibilities surrounding it.  The weights,
                       amplitudes, and amplitudes of the differences
                       are then averaged and reported.  Outliers are
                       counted and reported.
  DOROBUST...> 0 => Do robust averaging of the spectral channels in
                    each time and baseline.  Normal averaging is done
                    over time.
             <= 0 => Normal averaging (all samples included) is done
                     over all channels as well as time.
  DOCAT......> 0 => keep the XX extension file generated by the TIME
             OPTYPE.  Otherwise delete it on exit.
  DOCRT......Zero means write the flag table without comment.
             False (< 0) use the line printer if OUTPRINT = ' ' else
                   write named OUTPRINT file only.  When OUTPRINT is
                   not blank, DOCRT=-2 suppresses the page-feed
                   character on page headers and DOCRT=-3 suppresses
                   page headers and most other header information.
             True  (> 0) use the terminal interactively.  The task will
                   use the actual terminal width as a display limit
                   unless 72 < DOCRT < width.  In that case, the display
                   limit will be DOCRT characters.
  OUTPRINT...Disk file name in which to save the line printer output.
             ' ' => use scratch and print immediately for interactive
             jobs - batch jobs use OUTPRINT = 'PRTFIL:BATCHjjj.nnn'
             (jjj= job #, nnn = user #).  When OUTPRINT is not blank,
             multiple outputs are concatenated, and the file is not
             actually printed.
  PRTLEV.... Set > 0 to get various messages:
             1  Print the robust averages over time for each antenna,
                IF, and polarization and print the number of samples
                that would be self flagged and the number that would
                be flagged using the global averages.
             2. Also print the antenna-based amplitudes, rms's, and
                weights for each antenna, IF, polarization, and time.
                (You could keep the XX table and use PRTAB but this
                may be easier and neater.)
             3. Also print failure information from the routine that
                converts baseline-based data to antenna-based.
             4. Also print the antenna-based solutions from the
                solver.  This is for desperate programmers mainly.
  BADDISK....This array contains the numbers of disks on which it is
             desired that scratch files not be located.
Output adverbs:
  ARRAY2.....(We will probably rename this someday but for now we use
             this already known scratch adverb).  Using the first
             source (or the average of all sources with GAIN)
             From the robust average of robust antenna averages
             ( 1,1)  Vector-averaged amplitude or gain
             ( 2,1)  Vector-averaged data rms or amplitude of the
                     vector difference on VDIF/VRFI
             ( 3,1)  Scalar-averaged amplitude or gain
             ( 4,1)  Scalar-averaged data rms
             ( 5,1)  Weight
             ( 6,1)  RMS in Vector-averaged amplitude or gain
             ( 7,1)  RMS in Vector-averaged data rms or amplitude of
                     the vector difference on VDIF/VRFI
             ( 8,1)  RMS in Scalar-averaged amplitude or gain
             ( 9,1)  RMS in Scalar-averaged data rms
             (10,1)  RMS in Weight
             (11,1)  Fraction of data with closure failure fraction
                     > 0, < 0.1
             (12,1)  Fraction of data with closure failure fraction
                     >= 0.1, < 0.2
             ...
             (20,1)  Fraction of data with closure failure fraction
                     >= 0.9, <= 1.0
             Number of antenna time averages that differ from the
             overall average (ARRAY2(I,1)) in parameter I (I=1-5)
             by more than N times the RMS (ARRAY2(5+I,1)) in that
             parameter:
             ( 1,2)  N = 10 * RMS in Vector-averaged amplitude/gain
             ( 2,2)  N = 10 * RMS in Vector-averaged data rms/Vdif
             ( 3,2)  N = 10 * RMS in Scalar-averaged amplitude/gain
             ( 4,2)  N = 10 * RMS in Scalar-averaged data rms
             ( 5,2)  N = 10 * RMS in Weight
             ( 6,2)  N = 20 * RMS in Vector-averaged amplitude/gain
             ( 7,2)  N = 20 * RMS in Vector-averaged data rms/Vdif
             ( 8,2)  N = 20 * RMS in Scalar-averaged amplitude/gain
             ( 9,2)  N = 20 * RMS in Scalar-averaged data rms
             (10,2)  N = 20 * RMS in Weight
             (11,2)  Number of one-sample "averages" for which there
                     is no rms = number of orphans
             (12,2)  Total number of time averages * Number IFs *
                     Number polarizations
             ARRAY2(3,1), (4,1), (8,1), (9,1) (3,2), (4,2), (8,2),
             (9,2), and (11,2) are zero on VDIF and VRFI.

EXPLAIN SECTION

Explanation of the methods used


FINDR starts by averaging the calibrated and flagged data over a
period of SOLINT seconds.  Data with a time difference <= SOLINT are
included.  Thus, a SOLINT of 60 with records every 10 seconds will
include 7 records; to get only 6 set SOLINT to something > 50 and < 60
such as 55.  The data from each included antenna pair, IF, and
polarization are kept separate, but the spectral channels from BCHAN
through ECHAN are averaged together.

For OPTYPEs other than VDIF and VRFI:  During the averaging, the
apparent rms and average weight are found including all data (no
outlier rejection).  The rms and amplitude are averaged by both vector
and scalar means.

For OPTYPEs VDIF and VRFI: The averaging is done to determine the
"individual" records to be compared with the "scan" averages of such
records in a region of time centered on the individual records.
Typically, with decent signal-to-noise, one will set SOLINT for VDIF
to less than the time between records.  The "scan" averaging is done
over the scan interval using parameter SCANLENG and is done by finding
the median real and imaginary parts which should be less sensitive to
outliers.

Failures in antenna-based solutions are marked as such and regarded as
data that would be flagged by FLAGR.

===============
OPTYPE 'TIME'     The default.

The task converts the average amplitudes, rms's, and weights of the
antenna pairs into antenna-based values using the assumption that

Amp(A1,A2,I,P) = SQRT (Amp(A1,I,P) * Amp(A2,I,P))
Sig(A1,A2,I,P) = SQRT (Sig(A1,I,P) * Sig(A2,I,P))
Wt(A1,A2,I,P)  = SQRT (Wt(A1,I,P)  * Wt(A2,I,P))

It uses robust methods (outliers ignored) to compute the antenna-based
values and then counts the fraction [Fr(A,I,P)] of baselines for each
antenna that are more than 3 times the rms away from the final
solution.

The values of Amp(A,I,P), Sig(A,I,P), Wt(A,I,P) and Fr(A,I,P) are then
written to an XX table (which may be saved by setting DOCAT > 0).
When all of the data have been read and averaged and the antenna-based
results written to the XX table, these results are read back into
memory.

One source at a time, the task determines the robust average of each
of the five parameters (vector amplitude, vector rms, scalar
amplitude, scalar rms, weight) and the robust rms's in each of these
robust averages for each antenna, IF, and polarization.  By "robust"
average, we mean an average determined by repeated passes through the
data with the averaging process discarding outlier points.  The robust
average of these robust averages is then determined and printed in the
message window and returned in ARRAY2(I,1).

For each source and one antenna, IF, and polarization at a time, the
task loops over time counting the "questionable" data.  A sample is
questionable if

ABS(Par(A,I,P,t)-AvPar(A,I,P)) > N * AvParrms(A,I,P))

where Par(A,I,P,t) is the parameter value of antenna A at IF I,
polarization P, averaged at time t; AvPar(A,I,P) is the robust average
over time of that antenna, IF, and polarization, AvParrms(A,I,P) is
the uncertainty in that average over time.  Two values of N are used -
5 and 10 and the numbers of questionable samples are reported (if
PRTLEV >= 1).

The process is then repeated to compare the SOLINT-average samples in
each parameter with the global (over all antennas, IFs, polarizations)
averages.  For each source and one antenna, IF, and polarization at a
time, the task loops over time counting the "questionable" data.  A
sample is questionable if

ABS(Par(A,I,P,t)-GlobalAvPar) > N * GlobalAvParrms(A,I,P))

where Par(A,I,P,t) is the parameter value of antenna A at IF I,
polarization P, averaged at time t; GlobalAvPar is the robust average
over antenna, IF, and polarization of the robust averages over time;
and GlobalAvParrms is the uncertainty in that average over A, I, and
P.   The values of N used are 10 and 20 and the numbers of
questionable samples are reported (if PRTLEV >= 1) and are returned in
ARRAY2(1-10,2).

The task also computes the fraction of failure fractions having value
exactly 0 and 1 and in bins of width 0.1.  These are reported and the
binned values returned in ARRAY2(11,1) through ARRAY2(20,1).

===============
OPTYPE 'VDIF'

The task converts the average weights and complex visibilities of the
antenna pairs into antenna-based values using the assumption that

Amp(A1,A2,I,P) = SQRT (Amp(A1,I,P) * Amp(A2,I,P))
Phs(A1,A2,I,P) = Phs(A1,I,P) - Phs(A2,I,P)
Wt(A1,A2,I,P)  = SQRT (Wt(A1,I,P)  * Wt(A2,I,P))

It uses robust methods (outliers ignored) to compute the antenna-based
values and then counts the fraction [Fr(A,I,P)] of baselines for each
antenna that are more than 2.5 times the rms away from the final
solution.

The values of Amp(A,I,P), Phs(A,I,P) (as real and imaginary),
Wt(A,I,P), Fr(A,I,P), and reference antennas (I,P) are then written to
an XX table (which may be saved by setting DOCAT > 0).  When all of
the data have been read and averaged and the antenna-based results
written to the XX table, these results are read back into memory.  If
needed, the data are put on a consistent reference antenna basis.

One source at a time, the task then loops over each antenna, IF, and
polarization converting the time sequence of reals and imaginaries
into time sequences of amplitudes and amplitudes of the "vector
difference" between the current antenna visibility and the vector
average of antenna visibilities within SCANLENG/2 of the current
time.  Note that, in this case, the averages are generated by finding
the median value of the reals and the median value of the imaginaries.
This should be somewhat less succeptible to outliers than straight
averaging and better than robust averaging as well for the small
number of samples which will be normal.

Once source at a time, the task determines the robust averages and
rmses of the amplitudes, amplitudes of the vector differences, and the
weights over all antennas, IFs, and polarizations and reports them.
It then reports the number of samples more than 10 and 20 times this
rms from the mean for each antenna, IF, and polarization.  The
averages, rmses, and total number of deviant points are returned to
AIPS in ARRAY2 for the first source.

The task also computes the fraction of failure fractions having value
exactly 0 and 1 and in bins of width 0.1.  These are reported and the
binned values returned in ARRAY2(11,1) through ARRAY2(20,1).


===============
OPTYPE 'VRFI'

The task accumulates SOLINT averages in memory until the source
changes, the data ends, or the number of times that it can hold is
exhausted.  For each time, the task then loops over each antenna pair,
IF, and polarization finding the amplitude of the "vector difference"
between the current visibility and the vector average of visibilities
for that antenna pair within SCANLENG/2 of the current time.  Note
that, in this case, the averages are generated by finding the median
value of the reals and the median value of the imaginaries.   This
should be somewhat less succeptible to outliers than straight
averaging and better than robust averaging as well for the small
number of samples which will be normal.

The task determines the robust averages and rmses of the visibility
amplitudes, the amplitudes of the vector differences, and the weights
and reports them.  It also counts the samples that are more than 10
and 20 times the rms away from the robust averages and reports them.

The task then loops for another source or buffer load as appropriate.
The averages of the averages and rmses and the sums of the number of
deviations are returned to AIPS in ARRAY2.

AIPS