Ilya Selyuzhenkov September 02, 2008
Data sets:
Shower shape fitting procedure:
Fitting function "[0]*(exp ( -0.5*((x-[1])/[2])**2 )+[3]*exp ( -0.5*((x-[4])/[5])**2 )+[6]*exp ( -0.5*((x-[7])/[8])**2 ))"
Figure 1: MC gamma-jet shower shapes and fits for u-plane
Results from single, double and triple Gaussian fits (using from 9 to 15 strips) are shown.
Figure 2: Same as figure 1. but from v-plane
Figure 3: MC gamma-jet results using triple Gaussian fits within 12 strips from a peak.
Left: u-plane. Right: v-plane
Figure 4: Combined fit results from MC gamma-jet sample
Figure 5: Fitting parameters [see equation for the fit function above].
Note, that parameters 1, 4, and 7 (peak position) has the same value.
Numerical fit results:
Figure 6: Same as Fig. 3, but for gamma-jet candidates from pp2006 data
Figure 7: Same as Fig. 5, but for gamma-jet candidates from pp2006 data
Ilya Selyuzhenkov September 09, 2008
Data sets:
Procedure to calculate maximum sided residual:
For each event fit SMD u and v energy distributions with
triple Gaussian functions from shower shapes analysis:
[0]*(exp(-0.5*((x-[1])/[2])**2)+[3]*exp(-0.5*((x-[1])/[4])**2)+[6]*exp(-0.5*((x-[1])/[5])**2))
Fit parameters sorted by various pre-shower conditions and u and v-planes can be found here
There are only two free parameters in a final fit: overall amplitude [0] and mean value [1]
Fit range is +-2 strips from the high strip (5 strips total).
Integrate energy from a fit within +-2 strips from high strip.
This is our peak energy from fit, F_peak.
Calculate tail energies on left and right sides from the peak for both data, D_tail, and fit, F_tail.
Tails are integrated up to 30 strips excluding 5 highest strips.
Determine maximum difference between D_tail and F_tail:
max(D_tail-F_tail). This is our maximum sided residual.
Plot F_peak vs. max(D_tail-F_tail). This is sided residual plot.
(implementation for this item is in progress)
Based on MC gamma-jet sided residual plot find a line (some polynomial function)
which will serve as a cut to separate signal and background.
Use that cut line to calculate signal to background ratio
and apply it for the real data analysis.
Figure 1: Maximum sided residual plots for different data sets and various pre-shower condition.
Columns [data sets]: 1. MC QCD background; 2. gamma-jet; 3. pp2006 data
Rows [pre-shower bins]: 1. pre1=0 pre2=0; 2. pre1=0, pre2>0; 3. 0<pre1<4MeV; 4. 4<pre1<10MeV
Results from u and v plane are combined as [U+V]/2
Figure 2: max(D_tail-F_tail) distribution (projection on horizontal axis from Fig.1)
Some observations:
Results for pp2006 and MC gamma-jet are consistent for pre1=0 pre2=0 case (upper left plot)
Results for pp2006 and MC QCD background jets are also in agrees for pre1>0 case (lower left and right plots)
Figure 3: F_peak distribution (projection on vertical axis from Fig.1)
Ilya Selyuzhenkov September 16, 2008
These results are obsolete.
Please use this link instead
Data sets:
Notations used in the plots:
Figure 1: D_peak from [U+V]/2.
Figure 2: U/V asymmetry for D_peak: [U-V]/[U+V]
Figure 3: F_peak from [U+V]/2.
Figure 4: U/V asymmetry for F_peak: [U-V]/[U+V]
Figure 5: (D_peak - F_peak)/D_peak asymmetry
Ilya Selyuzhenkov September 23, 2008
Data sets:
Notations used in the plots:
Figure 1: D_peak from [U+V]/2.
Figure 2: (D_peak - F_peak)/D_peak asymmetry
Ilya Selyuzhenkov September 23, 2008
Figure 1: D_peak vs. [right-left] D_tail
Ilya Selyuzhenkov September 23, 2008
Data sets:
Notations used in the plots:
Figure 1: Maximum sided residual plot
Top get more statistics for MC QCD sample plot is redone with a softer R_cluster > 0.9 cut
Figure 2: D_peak (projection on vertical axis for Fig. 1)
Upper left plot (no pre-shower fired case) reveals some difference
between MC gamma-jet and pp2006 data at lower D_peak values.
This difference could be due to background contribution at low energies.
Still needs more statistics for MC QCD jet sample to confirm that statement.
Figure 3: max(D_tail-F_tail) (projection on horisontal axis for Fig. 1)
One can get an idea of signal/background separation (red vs. black) depending on pre-shower condition.
Figure 4: Mean < max(D_tail-F_tail) > vs. D_peak (profile on vertical axis from Fig. 1)
For gamma-jet sample average sided residual is independent on D_peak energy
and has a slight positive shift for all pre-shower>0 conditions.
For large D_peak values (D_peak>0.16) MC gamma-jet and pp2006 data results are getting close to each other.
This corresponds to higher energy gammas, where we have a better signal/background ratio,
and thus more real gammas among gamma-jet candidates from pp2006 data.
(Note: legend's color coding is wrong, colors scheme is the same as in Fig. 3)
Figure 5: Mean < D_peak > vs. max(D_tail-F_tail) (profile on horisontal axis from Fig. 1)
For "no-preshower fired" case MC gamma-jet sample has a large average values than that from pp2006 data.
This reflects the same difference between pp2006 and MC gamma-jet sample at small D_peak values (see Fig. 2, upper left plot).
(Note: legend's color coding is wrong, colors scheme is the same as in Fig. 3)
Figure 7: D_peak vs. gamma 3x3 tower cluster energy
Figure 8: 3x3 cluster tower energy distribution
Figure 9: Gamma pt distribution
The simplest way to get signal/background separation is to draw a straight line
on sided residual plot (Fig. 1) in such a way that
it will contains most of the counts (signal) on the left side,
and use a distance to that line for both MC and pp2006 data samples
as a discriminant for signal/background separation.
To get the distance to the straight line one can rotate sided residual plot
by the angle which corresponds to the slope of this line,
and then project it on "rotated" max(D_tail-F_tail) axis.
Figure 10: Shows "rotated" sided residual plot by "5/6*(pi/2)" angle (this angle has been picked by eye).
One can see that now most of the counts for gamma-jet sample (middle column)
are on the left side from vertical axis.
Figure 11: "Rotated" max(D_tail-F_tail) [projection on horizontal axis for Fig. 10]
Cut on "Rotated" max(D_tail-F_tail) can be used for signal/background separation.
From figure below one can see much better signal/background separation than in Fig. 3
Figure 12: "Rotated" D_peak [projection on vertical axis for Fig. 10]
Ideally, instead of straight line one needs to use
an actual shape of side residual distribution for MC gamma-jet sample.
This shape can be extracted and parametrized by the following procedure:
The distance to that polynomial function can be used to determine our signal/background rejection efficiency.
This work is in progress...
Just last one figure showing shapes for 6 slices from sided plot.
Figure 13: max(D_tail-F_tail) for different slices in D_peak (scaled by the integral for each slice)
Ilya Selyuzhenkov September 30, 2008
Data sets:
Notations used in the plots:
Figure 1: Sided residual plot: D_peak vs. max(D_tail-F_tail)
Red lines show 4th order polynomial functions, a*x^4,
which have 80% of MC gamma-jet counts on the left side.
These lines are obtained independently for each of pre-shower condition
based on fit procedure shown in Fig. 3 below.
Figure 2: max(D_tail-F_tail) distribution
(projection on horizontal axis from sided residual plot, see Fig. 1 above)
Figure 3: max(D_tail-F_tail) [at 80%] vs. D_peak.
For each slice (bin) in D_peak variable, the max(D_tail-F_tail) value
which has 80% of gamma-jet candidates on the left side are plotted.
Lines represent fits to MC gamma-jet points (shown in red) using different fit functions
(linear, 2nd, 4th order polynomials: see legend for color coding).
Note, that in this plot D_peak values are shown on horizontal axis.
Consequently, to get 2nd order polynomial fit on sided residual plot (Fig. 1),
one needs to use sqrt(D_peak) function.
The same apply to 4th order polynomial function.
Figure 4: D_peak vs. horisontal distance from 4th order polinomial function to max(D_tail-F_tail) values.
(compare with Fig. 1: Now 80% of MC gamma-jet counts are on the left side from vertical axis)
Figure 5: Horizontal distance from 4th order polynomial function to max(D_tail-F_tail)
[Projection on horizontal axis from Fig. 4]
Based on this plot one can obtain purity, efficiency, and rejection plots (see Fig. 6 below)
Horizontal distance plotted in Fig. 5 can be used as a cut
separating gamma-jet signal and QCD-jets background,
and for each value of this distance one can define
gamma-jet purity, efficiency, and QCD-background rejection:
gamma-jet purity is defined as the ratio of
the integral on the left for MC gamma-jet data sample, N[g-jet]_left,
to the sum of the integrals on the left for MC gamma-jet and QCD jets, N[QCD]_left, data samples:
Purity[gamma-jet] = N[g-jet]_left/(N[g-jet]_left+N[QCD]_left)
gamma-jet efficiency is defined as the ratio of
the integral on the left side for MC gamma-jet data sample, N[g-jet]_left,
to the total integral for MC gamma-jet data sample, N[g-jet]:
Efficiency[gamma-jet] = N[g-jet]_left/N[g-jet]
QCD background rejection is defined as the ratio of
the integral on the right side for MC QCD jets data sample, N[QCD]_right,
to the total integral for MC QCD jets data sample, N[QCD]:
Rejection[QCD] = N[QCD]_right/N[QCD]
Figure 6: Shows:
purity[g-jet] vs. efficiency[g-jet] (upper left);
rejection[QCD] vs. efficiency[g-jet] (upper right);
purity[g-jet] vs. rejection[QCD] (lower left);
pp2006 to MC ratio, N[pp2006]/(N[g-jet]+N[QCD]), vs. horizontal distance from Fig. 5 (lower right)