A new distribution for multiplicities in leptonic and hadronic collisions at high energies

Charged particles production in the electron-positron, pbarp and pp collisions in full phase space as well as in the restricted phase space slices, at high energies are described with predictions from shifted Gompertz distribution, a model of adoption of innovations. The distribution has been extensively used in diffusion theory, social networks and forecasting. A two-component model in which PDF is obtained from the superposition of two shifted Gompertz distributions has been introduced to improve the fitting of the experimental distributions by several orders. The two-components correspond to the two subgroups of a data set, one representing the soft interactions and the other semi-hard interactions. Mixing is done by appropriately assigning weights to each subgroup. Our first attempt to analyse the data with shifted Gompertz distribution has produced extremely good results. It is suggested that the distribution may be included in the host of distributions more often used for the multiplicity analyses.


Introduction
The shifted Gompertz distribution was introduced by Bemmaor [1] in 1994 as a model of adoption of innovations.It is the distribution of the largest of two independent random variables one of which has an exponential distribution Email addresses: Ridhi.Chawla1988@gmail.com(R.Chawla), manjit@pu.ac.in (M.Kaur) Preprint submitted to Journal of Advances in High Energy Physics November 6, 2018 arXiv:1801.00278v2[hep-ph] 5 Nov 2018 with parameter b and the other has a Gumbel distribution, also known as log-Weibull distribtion, with parameters η and b.Several of its statistical properties have been studied by Jiménez and Jodrá [2] and Jiménez Torres [3].In machine learning, the Gumbel distribution is also used to generate samples from the generalised Benoulli distribution, which is a discrete probability distribution that describes the possible results of a random variable that can take on one of the K-possible elementary events, with the probability of each elementary event separately specified.The shifted Gompertz distribution has mostly been used in the market research and diffusion theory, social networks and forecasting.It has also been used to predict the growth and decline of social networks, on-line services and shown to be superior to the Bass model and Weibull distribution [4].It is interesting to study the statistical phenomena in high energy physics in terms of this distribution.Recently, Weibull distribution has been used to understand the multiplicity distributions in various particle-particle collisions at high energies and more recently [5] to explain LHC data.Weibull models studied in the literature were appropriate for modelling a continuous random variable which assumes that the variable takes on real values over the interval [0, ∞].In situations where the observed data values are very large, a continuous distribution is considered an adequate model for the discrete random variable, for example in case of a particle collider, the luminosity during a fill decreases roughly exponentially.Therefore, the mean collision rate will likewise decrease.That decrease will be reflected in the number of observed particles per unit time.In the same way a photon detector that counts photons in a continuous train of time bins.If the photons are anti-bunched in time, that is, they tend to be separated from each other, one will get a different distribution of photon counts than if the photons are bunched, that is, bunched together in time.By analyzing the photon counting statistics one can infer information about the continuous underlying distribution of the temporal spacing of photons.The shifted Gompertz distribution with non-negative fit parameters identified with the scale and shape parameters, can in this way be used for studying the distributions of particles produced in collisions at accelerators.One of the studies in statistics is when the variables take on discrete values.The idea was first introduced by Nakagawa and Osaki [6], as they introduced discrete Weibull distribution with two shape parameters q and β where 0 < q < 1 and β > 0. Models which assume only non-negative integer values for modelling discrete random variables, are useful for modelling the kind of problems mentioned above.
The charged-particle multiplicity is one of the simplest observables in collisions of high energy particles, yet it imposes important constraints on the dynamics of particle production.The particle production has been studied in terms of several theoretical, phenomenological and statistical models.Each of these models has been reasonably successful in explaining the results from different experiments and useful for extrapolations to make predictions.Although Weibull distribution has been studied recently, no attempt has been made so far to analyze the high energy collision data in terms of shifted Gompertz distribution.Our first attempt to analyse the data produced good results and encouraged us for a comprehensive analysis.
The aim of the present work is to introduce a statistical distribution, the shifted Gompertz distribution to investigate the multiplicity distributions of charged particles produced in e + e − , pp and pp collisions at different center of mass energies in full phase space as well as in restricted phase space windows.Energy-momentum conservation strongly influences the multiplicity distribution for the full phase space.The distribution in restricted rapidity window however, is less prone to such constraints and thus can be expected to be a more sensitive probe to the underlying dynamics of QCD, as inferred in references [7,8].
In Section II, details of Probability Distribution Function (PDF) of the shifted Gompertz distribution is discussed.For e + e − collisions a two component model has been used and modification of distributions done in terms of these two components; one from soft events and another from semi-hard events.Superposition of distributions from these two components, by using appropriate weights is done to build the full multiplicity distribution.When multiplicity distrbution is fitted with the weighted superposition of two shifted Gompertz distributions, we find that the agreement between data and the model improves considerably.The fraction of soft events, α for various energies have been taken from references [9,10] which use the K T clustering algorithm, the most extensively used algorithm for LEP e + e − data analyses.The corresponding fractions for pp and pp are not available in different rapidity bins.For pp data at all energies under study, the α values for full phse space are taken from reference [11].We also tried to fit the multiplicity distribution to find the best fit alpha value.It is found that, the fit values agree very closely with values obtained from reference [11].We thus fitted distributions in restricted rapidity windows for pp and pp data in terms of soft and semi-hard components to get the best fit α values.
In a recent publication, Wilk and W ĺodarczyk [12] have developed a method of retrieving additional information from the multiplicity distributions.They propose, in case of a conventional Negative Binomial Distribution fit [11], to make the parameters dependent on the multiplicity in place of having a 2component model.They demonstrated that the additional valuable information from the MDs, namely the oscillatory behaviour of the counting statistics can be derived.In a future extension of the present work, we shall analyse the shifted Gompertz distribution, using the approach proposed and described by the authors [12].
Section III presents the analysis of experimental data and the results obtained by the two approaches.Discussion and conclusion are presented in Section IV.

Shifted Gompertz distribution
The dynamics of hadron production can be probed using the charged particle multiplicity distribution.Measurements of multiplicity distributions provide relevant constraints for particle-production models.Charged particle multiplicity is defined as the average number of charged particles, n produced in a collision nP n .Hadron production depends upon the center of mass energy available for particle production nearly independent of the types of particles undergoing collisions.Subsequently, it is the fragmentation of quarks and gluons which produce hadrons non-perturbatively.Thus the same PDFs can be used to describe behaviour of multiplicity distributions.In numerous works in the past, the most popular Negative Binomial Distribution has been successfully used for a wide variety of collisions [13].Universality of multiparticle production in e + e − , pp and pp have been discussed in several papers, a detailed paper amongst these is [14].
We briefly outline the probability density function (PDF) of the shifted Gompertz distribution used for studying the multiplicity distributions.Equation (1-3) define the PDF and the mean value of the distribution; Mean of the distribution is given by and Where b ≥ 0 is a scale parameter and η ≥ 0 is a shape parameter.Similar to the Weibull distribution, shifted Gompertz distributions is also a two parameter distribution, in terms of its shape and scale.

Two-Component Approach
It is well established that at high energies, charged particle multiplicity distribution in full phase space becomes broader than a Poisson distribution.This behaviour has been successfully described by a two parameters negative binomial (NB) distribution defined by; where k is related to the dispersion D by k parameter of the distribution is negative in the lower energy domain where the distribution is binomial like.k is positive in the higher energy domain and the distribution is truly NB, the two particle correlations dominate and 1/k is closely related to the integral over full phase space of the two particle correlation function.NB distribution was very successful until the results from UA5 collaboration [15] showed a shoulder structure in the multiplicity distribution on pp collisions.To expain this NB regularity violations, C. Fuglesang [16], suggested the violations as the effect of the weighted superposition of soft events (events without mini-jets) and semi-hard events (events with mini-jets), the weight α being the fraction of soft events.The multiplicity distribution of each component being NB.This idea was successfully implemented in several analyses at high energies to fit the multiplicity distributions with superposed NB functions.
Adopting this suggestion for the multiplicity distributions in e + e − , pp and pp collisions at high energies, we have used a superposition of two shifted Gompertz components.The two components are interpreted as soft and hard components, as explained above.The Multiplicity distribution is produced by adding weighted superposition of multiplicity in soft events and multiplicity distribution in semi-hard events.This approach combines two classes of events, not two different particle-production mechanisms in the same event.Therefore, no interference terms are needed to be introduced.The final distribution is the sum of the two independent distributions, henceforth called modifed shifted Gompertz distribuion.
In this approach, the multiplicity distribution depends on five parameters as given below; As described by A. Giovannini et al [11], that the superimposed physical substructures in the cases of e + e − annihilation and hadron-hadron interactions, are different, the weighted superposition mechanism is the same.

The Data
The data from different experiments and three collision types are considered; i) e + e − annihilations at different collision energies, from 91 GeV up to the highest energy of 206.2 GeV at LEP2, from two experiments L3 [17] and OPAL [18,19,20,21] are analysed.

Results and discussion
The PDF defined by equations (1,6) are used to fit the experimental data.tuations between the data and the fits are acceptably small, as the ratio is nearly one.
Figure 4 shows the modified shifted Gompertz distribution, equation (6) fitted to the pp data at energies from 200 GeV to 900 GeV in four rapidity windows.To avoid cluttering of figures, the plots for shifted Gompertz are not shown.Figure 5 shows the shifted Gompertz and modified shifted Gompertz distributions, fitted to the pp collisions in full phase space for the same energies.The comparison can be seen from the parameters of the fits, χ 2 /ndf and the p-values documented in Table II.Figure 6 shows the ratio plots of the data over modified shifted Gompertz fit for pp collisions at different energies in full phase space.The plots show acceptable fluctuations with the ratio values around unity. Figure 7 shows the modified shifted Gompertz distribution, equation (6) fitted to the pp collisions at LHC energies from 900 GeV to 7000 GeV in four rapidity windows.Again for restricting the number of figures, only the modified distributions are shown.Comparison between the two types of distributions can be seen from the parameters of the fits, χ 2 /ndf and the p-values documented in Table III.Figure 8 shows the ratio plots of data over modified shifted Gompertz fit for the collisions at different energies in full phase space.The plots show acceptable fluctuations with the ratio values around unity.
Comparison of the fits and the parameters shows that overall shifted Gompertz distribution is able to reproduce the data at most of the collision energies in full phase space as well as in the restricted rapidity windows for e + e − , pp and pp collisions.It does fail and is excluded statistically for some energies where p-value is < 0.1%, in particular for 540 GeV pp data for some rapidity intervals and for LHC data at the highest energy of 7000 GeV.However, comparison of the fits and the parameters from the (two-component) modified shifted Gompertz distribution shows that though the data are very well reproduced in full phase space as well as in all rapidity intervals for all collision energies in e + e − , pp and pp collisions, the distribution does fail for the e + e − collisions at 91 GeV.The χ 2 /ndf value in each case reduces enormously, when modified shifted For minimisation of χ 2 for the fits, CERN library MINUIT2 has been used.In case of modified shifted Gompertz, the fit parameters are doubled while introducing the modification.This causes large error limits on the parameters resulting in the very large p values, particularly close to 1.In addition, the LEP data for e + e − collisions suffer from very small sample size at some energies, thereby adding to the errors on the fit parameters.
Using shifted Gompertz distribution, the multiplicity distribution for 500 GeV e + e − collisions at a future Collider is predicted, as shown in figure 10.The value of mean multiplicity < n > is predicted to be the 37.14 ± 1.12.Figure 11 shows the dependence of mean multiplicity from experimental data on energy  √ s.The fitted curve in equation ( 10) represents Fermi-Landau model [24,25] and fits the data reasonably well with a 1 = -10.609± 2.003 and b 1 = 10.156 ± 0.561 It may be observed that the value of < n > predicted from shifted Gompertz distribution at 500 GeV fits well on the curve, as shown in the figure 11.A parameterization of the multiplicity data in e + e − collisions at the next-to-leadingorder QCD was done by D.E.Groom et al [26,14] and is given in equation ( 9) of the reference: where a and c are constants and β 0 is defined in equation (9.4b).The < n > versus √ s dependence was shown in reference [27].Parameters a and c were fitted to the experimental data and a very good agreement was shown.It is observed that both formulae, equations (10,11), provide excellent extrapolations for √ s > 206.The mean multiplicity < n > at 500 GeV is predicted to be 39.18 by NLO QCD equation.In the present work, the mean value predicted by the shifted Gompertz distribution as 37.14 ± 1.23 agrees very closely with the value derived from NLO QCD.This is good test of the validity of the proposed distribution.
An interesting description of universality of multiplicity in e + e − and p+p(p) has been discussed by Grosse-Oetringhaus et al [14].It is shown that although the multiplicity distributions differ between p + p(p) and e + e − collisions, their average multiplicities as a function of √ s show similar trends that can be unified using the concepts of effective energy and inelasticity.It is also shown that the Fermi-Landau form < n >∼ s 1/4 fails to describe the pp multiplicity data.But the data is well described by < n >= A + Blns + Cln 2 s.The universality appears to be valid at least up to Tevatron energies.The multiplicities in e + e − and p + p(p) collisions become strikingly similar when the effective energy E ef f in p + p(p) collisions, available for particle production is used.
where E lead is the energy of the leading particle and the inelasticity K is defined The parameter n 0 corresponds to the contribution from the two leading protons to the total multiplicity and is expected to be close to n 0 =2.One can use this fit of p + p(p) data to predict the multiplicities at the LHC.As described in reference [14], using a fit with equation( 13), Jan Fiete Grosse-Oetringhaus et al have estimated K = 0.35 ± 0.01 and n 0 = 2.2 ± 0.19.Under the assumptions that inelasticity remains constant at about 0.35 at LHC energies and that the extrapolation of the e + e − data with the QCD form is still reliable, authors fit the p+p(p) data to predict the multiplicities at the LHC.They find < n >=88.9.We use these values of inelasticity and average multiplicity to build the multiplicity distribution at √ s=14 TeV using shifted Gompertz function.
Figure 12 shows the multiplicity distribution predicted from shifted Gompertz PDF for pp collisions at √ s = 14 TeV at LHC.The mean value of the multiplicity is predicted to be < n >≈89.2.It is observed that in general, at all energies for different types of collisions, the multiplicity distributions can be described by shifted Gompertz function.However the LHC data at 7000 GeV in the lower rapidity windows are an exception, whereby the fits are statistically excluded with CL < 0.1%.At all energies, both the scale parameter b and the shape parameter η decrease with the collision energy in the center of momentum.In the rapidity windows, b decreases with the increase in the rapidity.The shape parameter η increases with rapidity as it determines the width of the distribution.
The fact that multiplicity distributions at higher energies show a shoulder structure is well established.In order to improve upon shifted Gompertz fits to the data, the multiplicity distribution is reproduced by a weighted superposition of two shifted Gompertz distributions corresponding to the soft component and the semi-hard component.It is observed that this modified shifted Gompertz distribution improves the fits excellently and the χ 2 values diminish by several orders.However, distributions fail for 91 GeV e + e − data.
The data for pp collisions at 7 TeV fail for shifted Gompertz distribution in three rapidity windows.But the modified shifted Gompertz distribution shows a very good agreement with the data for all rapidities, as shown in table III.For each of the rapidity bins, χ 2 /dof values are reduced manifold with CL > 0.1%.
Using the shifted Gompertz function and the analysis, the multiplicity distributions at future collider and the mean multiplicity predicted for 500 GeV e + e − , agrees very well with the predictions from NLO QCD prediction and also with the Fermi-Landau model of particle production.

Conclusion
The aim of this paper is to propose the use of a new statistical distribution for studying the multiplicity distributions in high energy collisions; shifted Gompertz distribution function often used in model of adoption of innovations, describes the multiplicity data extremely well.A detailed analysis of data from e + e − , pp and pp collisions at high energies in terms of shifted Gompertz distribution shows that, in general the distribution fits the data very well at most of the energies and in various rapidity intervals with the exception of a very few.Very similar to the Weibull distribution, which recently has been extensively used, it determines two non-negative parameters measuring the scale and shape of the distribution.A power law dependence of the scale parameter and shape parameter on the collision energy is established for the e + e − data.The parametrisation as a power law is inspired by the observation that single particle energy distribution obeys a power law behaviour.
The occurrence of a shoulder structure in the multiplicity distribution (MD) of charged particles at high energy is well established.This affects the shape of the distribution fit.To improve upon the fits to the data, a weighted superposition of the distributions using shifted Gompertz function for the soft events (events with mini-jets) and the semi-hard events (events without mini-jets) is done.The concept of superposition originates from purely phenomenological and very simple considerations.The two fragments of the distribution suggest the presence of the substructure.The two-component shifted Gompertz distribution fits the data from different types of collisions at different energies, extremely well.Describing the MD in terms of soft and semi-hard components, allows one to model, under simple assumptions the new energy domain.While predicting the multiplicity distribution using shifted Gompertz Distribution at 14 TeV, it remains interesting to determine the dependence of fraction of minijet events, α upon the rapidity windows compared to the events without mini-jets.To predict the more accurate multiplicity distributions in different rapidity windows at 14 TeV, modified shifted Gompertz PDF is required, for which α value in each rapidity window is needed.The analysis presented for 7 TeV data shows that the minijet fraction of events decreases with energy as well as with the increasing size of rapidity window.This trend has also been shown in reference [11] where the α fraction for full rapidity range of pp collisions at 14 TeV has been estimated as 0.30.When multiplicity distributions in full phase space at higher energies like 7 TeV, become available, the extrapolations from the lower energy domain to the highest energies can be well established, as predicted in other works also, using different approaches [28,29].
A good agreement between the mean multiplicity and the multiplicity dependence on energy, predicted by NLO QCD and the Fermi-Landau model of particle production, with the predictions made by shifted Gompertz distribution, serves as a good test of the validity of the proposed distribution.
The future extension of the present work shall focus on analysis of multiplicities from lower energy domains, in hadron-nucleus interactions and nucleusnucleus interactions using shifted Gompertz distribution and to derive the additional information from the oscillatory behaviour of the counting statistics, as suggested by Wilk and W ĺodarczyk [12].

Fig- ures 1
-2 show the shifted Gompertz function and the modified (two-component)shifted Gompertz function fits to the data e + e − from L3 and OPAL experiments.Parameters of the fits, χ 2 /ndf and the p-values are documented in Table I. Figure 3 shows the ratio of data over modified shifted Gompertz fit plots for e + e − collisions at two energies.The plots correspond to the worst and the best fits depending upon the maximum and minimum χ 2 /ndf values and show that fluc-

Figure 1 :
Figure 1: Charged multiplicity distribution from L3 experiment.Solid lines represent the Gompertz distribution.

Figure 2 :
Figure 2: Charged multiplicity distribution from OPAL experiment.Solid lines represent the Gompertz distribution.

Figure 4 :
Figure 4: Charged multiplicity distributions in pp collisions in four rapidity windows.Solid lines represent modified shifted Gompertz function.

Figure 5 :
Figure 5: Charged multiplicity distributions in pp collisions in full phase space.Solid lines represent shifted Gompertz function (top) and modified shifted Gompertz distributions(bottom).

Figure 7 :Figure 8 :Figure 9 :
Figure 7: Charged multiplicity distributions in pp collisions in four rapidity windows for the modified shifted Gompertz distributions.

Figure 10 :
Figure 10: Probability distribution for e + e − collisions predicted from shifted Gompertz function.

Figure 11 :Figure 12 :
Figure 11: Average multiplicity in e + e − collisions as a function of √ s.The fitted curve represents Fermi-Landau model.

Table 1 :
Parameters of shifted Gompertz and modified shifted Gompertz functions for e + e − collisions.

Table 2 :
Parameters of shifted Gompertz and modified shifted Gompertz functions for pp collisionsGompertz fit is used.In each case the fit is accepted with p-value > 0.1%.For shifted Gompertz distribution, the scale parameter b and the shape parameter η values are plotted in figure 9 for e + e − interactions for LEP data from L3 and OPAL experiments.A power law is fitted to the data.It is observed that both b and η values decrease with increase in collision energy and are

Table 3 :
Parameters of shifted Gompertz and modified shifted Gompertz functions for pp collisions.