LHC data and extensive air showers

Currently the uncertainty in the prediction of shower observables for di fferent primary particles and energies is dominated by di fferences between hadronic interaction models. The LHC data on minimum bias measurements can be used to test Monte Carlo generators and these new constraints will help to reduce the uncertainties in air shower predictions. In this article, after a short introduction on air showers we will show the results of the comparison between the updated version of high energy hadronic interaction models EPOS LHC and QGSJETII-04 with LHC data. Results for air shower simulations and their consequences on comparisons with air shower data will be discussed.


Introduction
Knowing the elemental composition of cosmic ray particles arriving at Earth is of crucial importance to understand the production and propagation of cosmic rays.Unfortunately, cosmic rays can be measured only indirectly above an energy of 10 14 eV through the cascades of secondary particles, called extensive air-showers (EAS), that they produce in the atmosphere (for a recent review, see [1]).Only by simulating the generation of EAS and comparing the predictions with measurements can one draw conclusions on the primary mass composition of the arriving particles [2].With the operation of modern large-scale experiments, the reliability of air-shower simulations has become the source of the largest systematic uncertainty in the interpretation of cosmic-ray data [3][4][5][6][7][8].While the electroweak interaction processes are reasonably well understood, modeling of hadronic multi-particle production is subject to large theoretical uncertainties that are, moreover, difficult to estimate [9][10][11].
The Large Hadron Collider (LHC) at the CERN laboratory allows us to access, for the first time, the energy region above the cosmic ray spectral knee in the laboratory (Fig. 1).Therefore an analysis of inclusive particle data taken at the LHC is particularly interesting for constraining existing hadronic interaction models and for testing possible new mechanisms of hadron production [13].Data from LHC experiments published so far have mostly been taken with detectors covering the central phase space region in pseudorapidity (|η| < 2.5).This region is most easily accessible in collider experiments and is also the region of the highest rapidity-density of produced particles.The first data have been compared to cosmic ray models in [14].On the other hand, since the number of particles a e-mail: tanguy.pierog@kit.eduFlux of cosmic rays arriving at Earth rescaled by the energy to the power 2.5.Data references are given in [12]. in an air-shower is roughly proportional to the energy of the primary particle, the most energetic outgoing particles of an interaction, emitted in the very forward region of a collider experiment -such as in di ffractive interactionsare the most important ones for understanding air-showers.
In this paper, we will discuss changes in the hadronic model predictions after LHC data and their consequences on air shower observables.In the first section, we will explain the so-called Heitler model to extract from a simple toy model the main hadronic observables which drive the development of air showers.We will then compare the results of the hadronic interaction models with LHC data for such observables.Finally using detailed Monte Carlo simulations done with CONEX [15], the new predictions for X max , for the number of muons, and for the missing energy will be presented.

Heitler's Model
To qualitatively describe the dependence of shower development on some basic parameters of particle interaction, decay and production, a very simple toy model can be used.Although initially developed for electromagnetic (EM) showers [16] it can also be applied to hadronic showers [17].

Electromagnetic showers
For simplicity, instead of having three particle types (γ, e + and e − ) like in electromagnetic showers, we will consider only one particle with energy E with only one EM interaction producing two new particles with energy E/2 after a fixed interaction length of λ e , see Fig. 2  Denoting with n the number of generations (consecutive interactions), the number of particles at a given depth with the energy E per particle for a given primary energy E 0 being Defining the critical energy E c (∼ 85 MeV in air) as the energy below which energy loss processes dominate over particle production, one can make the assumption that the shower maximum is reached at a depth at which the energy of the secondary particles reaches E c .Then two main shower observables are given by This simplified picture does not reproduce the detailed behavior of an EM shower, but two important features are well described: the number of particles at shower maximum is proportional to E 0 and the depth of shower maximum depends logarithmically on the primary energy E 0 .

Hadronic showers
Generalizing this idea, a hadronic interaction of a particle with energy E is assumed to produce n tot new particles with energy E/n tot , two thirds of which are charged particles n ch (charged pions) and one third are neutral particles n neut (neutral pions), as shown Fig. 2 right-hand side.Neutral particles decay immediately into EM particles (π 0 → 2γ).After having travelled a distance corresponding to the mean interaction length λ ine , charged particles re-interact with air nuclei if their energy is greater than some typical decay energy E dec .

Energy transfer
In each hadronic interaction, one third of the energy is transferred to the EM shower component.After n generations the energies in the hadronic and EM components are given by where n will be calculated later.Simulations show that the total number of generations before a vertical shower reaches the ground level is typically about 5 to 6 [18].
Even in an air shower initiated by a hadron, most of the energy is carried by EM particles (∼ 90% for n = 6).Hence the depth of shower maximum is given by that of the EM shower component, X e max .As the first hadronic interaction produces EM particles of energy ∼ E 0 /n tot one gets where λ ine is the hadronic interaction length.This simplified expression for the depth of maximum neglects the EM sub-showers initiated by hadrons of later generations.The inclusion of higher hadronic generations does not change the structure of Eq. ( 7), only the coefficients change (see, for example, [19]).

Muon component
To keep the picture simple, we assume that all charged hadrons decay into muons when their energy reaches E dec .
In a real shower, this limit can be seen as the characteristic energy where interaction length and decay length of charged pions are similar (about 150 GeV for pions).By construction, charged particles will reach the energy E dec after n interactions Since one muon is produced in the decay of each charged particle, we get with α = ln n ch / ln n tot = 1 + ln R/ ln n tot ≈ 0.82 . . .0.95 [19] where R = n ch /n tot < 1.The number of muons produced in an air shower depends not only on the primary energy and air density, but also on the total particle multiplicities and in a much more sensitive way [20] of the charged over all-particle ratio of hadronic interactions.
It should be kept in mind that the parameters of the model are only effective quantities and are not identical to the respective quantities measured at accelerators.In particular, the approximation of all secondary particles carrying the same energy is only motivated by the fact that it allows us to obtain simple, closed expressions.The wellknown leading particle effect, typically quantified by the (in)elasticity of an interaction, can be implemented in the model [17] but will not be considered here.

Hadronic Interaction Models and LHC data
It is clear that such a toy-model, as described in previous section, only gives a very much over-simplified account of air shower physics.However, the model allows us to qualitatively understand the dependence of many air shower observables on the characteristics of hadronic particle production.Accordingly the parameters of hadron production which are most important for air shower development are the cross section (or mean free path), the multiplicity of secondary particles of high energy, and the production ratio of neutral to charged particles.Until the start of LHC, these parameters were not well constrained by particle production measurements at accelerators.As a consequence, depending on the assumptions of how to extrapolate existing accelerator data, the predictions of hadronic interaction models differ considerably.There are several hadronic interaction models commonly used to simulate air showers.Here we will focus on the two high energy models which were updated to take into account LHC data at 7 TeV: QGSJETII-03 [21,22] changed into QGSJETII-04 [23] and EPOS 1.99 [24,25] was replaced by EPOS LHC (v3400), both available in CORSIKA v7.35 [26].There is no major change in these models but in addition to some technical improvements, some parameters were changed to reproduce TOTEM [27] cross sections.Both are based on Gribov-Regge multiple scattering, perturbative QCD and string fragmentation.The former versions reproduce accelerator data and even first LHC data reasonably well ( [14] and Figs. 4 and 5) but predict different extrapolations above E cms ∼1.8 TeV (E lab ∼ 10 15 eV) that lead to very different results at high energy [2,28] which can be improved using LHC data.

Cross section
As shown in eq. 7, the cross section is very important for the development of air showers and in particular for the .99(dashed line) and QGSJETII-03 (dashed-dotted line).Points are data from [30] and the stars are the LHC measurements by the TOTEM experiment [27].
depth of shower maximum.As a consequence, the number of electromagnetic particles at ground level is strongly correlated to this observable (if the shower maximum is closer to ground, the number of particles is higher).The total cross section of proton-proton scattering is usually used as an input to fix basic parameters in all hadronic interaction models.Therefore it is very well described by all the models at low energy, where data exist [29].And then it diverges above 2 TeV center-of-mass (cms) energy because of different model assumptions.As shown on Fig. 3 the new point measured by the TOTEM experiment at 7 TeV reduces the difference between the models by a factor of 5 (50 to 10 mb).In all the figures EPOS LHC is represented by a full (blue) line, QGSJETII-04 by a dotted (red) line, EPOS 1.99 by a dashed (black) line and QGSJETII-03 by a dashed-dotted (green) line.

Multiplicity
According to eq. 7, the multiplicity plays a similar kind of role as the cross section, but with a weaker dependence (log).On the other hand, the predictions from the models had much larger differences for the multiplicity compared to the cross section.As shown in Fig. 4 left-hand side, the particle density at mid-rapidity is well reproduced by all the models up to 2 TeV where Tevatron data [31] constrain the results, but at the highest energies in π-air, the difference can be as high as a factor of 10 (Fig. 4 righthand side).After re-tuning at 7 TeV to be compatible with CMS data [32] or ALICE data [33], in Fig. 5, the difference, even at high energy in π-air, is less than a factor of 2. On the right-hand side of Fig. 5, we can see that not only the averaged multiplicity had been changed after re-tuning, but the fluctuations are now very similar for QGSJETII-04 and EPOS LHC.This will be important for the fluctuations of the air shower maximum.So, for both cross section and multiplicity, when the models are constrained by LHC data up to 7 TeV, the extrapolation to the highest energy is not so different any more.This will have a strong impact on X max uncertainty in air shower simulations.

Baryon production
Another important observable for EAS is the number of muons reaching the ground.Using eq. 9 and the definition of α and R, it has been shown in [20] that the number of (anti)baryons plays an important role in the value of R especially if we take into account the leading particle effect.As a consequence the number of muons in EAS is sensitive to the number of (anti)baryons produced in the hadronic interactions and it is important to check the production of such particles in LHC data.
Both ALICE [34] and CMS [35] experiments published very nice results on identified spectra used to constrain models used for air shower simulations.As shown in Fig. 6, these data help a lot to reduce the differences between the models especially because they resolve an ambiguity on the phase space used to produce the anti-proton over pion ratio as a function of the number of charged particles with Tevatron data at 1.8 TeV.LHC data are much better defined and can be used to constrain the production of baryon pairs at mid-rapidity (largely dominated by string fragmentation).
It is important to notice that not only (and not all) (anti)baryons enter into the definition of the ratio R. All particles which do not decay into an electromagnetic particle can play a similar role and keep the energy of the shower into the hadronic channel to produce muons.For instance in QGSJETII-04 the newly introduced ρ 0 resonance as excited state of the pion remnant in pion in-teractions has a very strong influence on the muon production.Since forward π 0 s, transferring a lot of energy in the electromagnetic channel, are replaced by particles which decay into charged pions, the energy is kept in the hadronic channel.This is clearly illustrated by the Fig. 7 where we can see that QGSJETII-04 reproduce nicely both π 0 and ρ 0 forward spectra while QGSJETII-03 producing no ρ 0 had to hard π 0 spectra.This effect was already in EPOS 1.99, being one source of difference between the 2 models.EPOS 1.99 model seems to underestimate ρ 0 production but this is due to a wrong mixing with ω resonance and, in fact, the sum of the 2 is correct.It has been corrected in EPOS LHC.On the other hand, in EPOS 1.99 another process producing forward (anti)baryons was missing at high energy and has been added in EPOS LHC.As a consequence the reduced rate of (anti)baryon production at mid-rapidity is compensated by more forward (anti)baryon production which is even more important for muon production.Unfortunately there is very little data to constrain this production channel especially in collider experiments.NA61 [38] and LHCf [39] experiments may help to constrain this process in the future.

EAS Simulations
Using the air shower simulation package CONEX and the new versions of the high-energy hadronic interaction models, we can get an estimate of the resulting uncertainties.
In the following EAS simulation, results using EPOS LHC (v3400) and QGSJETII-04 are presented and compared to former results using QGSJETII-03 and EPOS 1.99.
As shown in Fig. 8, the mean depth of shower maximum, X max , for proton and iron induced showers simulated with CONEX is still different for EPOS LHC and 03001-p.4Transverse momentum distribution for |y| < 1 for non single diffractive (NSD) p-p scattering at 7 TeV (left-hand side) and anti-proton over pion ratio at y = 0 for p-p collisions (right-hand side) as a function of center of mass energy.Simulations are done with EPOS LHC (full line), QGSJETII-04 (dotted line), EPOS 1.99 (dashed line) and QGSJETII-03 (dashed-dotted line).Points are data from ALICE and PHENIX [34] experiments and triangles are from CMS experiment [35].QGSJETII-04.But now the elongation rate (the slope of the mean X max as function of the primary energy) is the same in both cases while EPOS 1.99 had an elongation rate larger than QGSJETII-03.The difference between the two models is a constant shift of about 20 g/cm 2 (close to the experimental systematic error in Pierre Auger Observatory [7]) while before the difference was increasing up to 50 g/cm 2 at the highest energies.This is very important to the study of the primary cosmic ray composition.If the models converge to a similar elongation rate, it will allow us to have a more precise idea on possible changes in composition at the "ankle" for instance where the Pierre Auger Observatory measure a break in the elongation rate of the data.
Concerning the number of muons at ground (for 40 o inclined showers at the height of 1500 m), the difference between the new QGSJETII-04 and the old QGSJETII-03 is even more impressive.We can see on Fig. 9 that QGSJETII-04 predicts now only 7% less muons than EPOS 1.99, which is about 20% more than QGSJETII-03.It is due to the change in baryon, strangeness and mostly resonance production as described in section 3.3.Concerning the predictions of EPOS LHC, the number of muons is very similar to the one in EPOS 1.99 because of the leading baryon production compensating the reduction of (anti)aryon production at mid-rapidity.So, even if the number of muons is much more similar now for the two most recent hadronic models, there is still an uncer- to the data can be found in [1] and [7].
tainty of about 10% and furthermore the energy spectrum of the muons at ground is different between the models.This can be an important factor for the attenuation length of the muons in the atmosphere.The average energy of the muons is, for the moment, larger in QGSJETII-04 than in EPOS.
A consequence of the larger mean energy of the muons in EAS simulated with QGSJETII-04 is that the missing energy is larger for showers simulated with this model even if the total number of muons is larger in EPOS LHC.
EPOS LHC results are very similar to those of EPOS 1.99.The factor to correct for the missing energy in fluorescence based telescopes is presented on Fig. 10.The difference between QGSJETII-04 and the previous version QGSJETII-03 frequently used as reference in the past is less than 3% but it has a direct influence on the energy scale considered to compare simulations and data.It should not be neglected.

Summary
Using a simple cascade model, it is possible to find the main parameters of hadronic interactions that influence air shower predictions.For the mean depth of shower maximum, X max , these parameters are the inelastic cross sections, the secondary particle multiplicity, and the inelasticity (not studied here).Using recent LHC data at 7 TeV it is possible to reduce the uncertainty in the extrapolation of the hadronic interaction models used for EAS simulations.Using pre-and post-LHC versions of the QGSJETII and EPOS models, it has been shown that the difference in multiplicity between these models has been reduced by a factor of 5 at the highest energy, resulting in a very similar elongation rate.There is still a systematic shift in X max of about 20 g/cm 2 due to remaining differences in the multiplicity (and elasticity) of the models.This uncertainty is comparable to the experimental uncertainty in the measurement of X max .As a consequence the interpretation of the data using post-LHC data will be more reliable, especially concerning the possible change in mass composition with energy as summarized in [40].
For the number of muons, the ratio between particles producing hadronic sub-showers and the total number of particles is very important.LHC data are important to constrain (anti)baryon and strangeness production at midrapidity.Lower energy data of fixed target experiment are also important to measure forward production of π 0 for instance.Taking into account both aspects, the new version of the QGSJETII and EPOS models predict very similar results close to EPOS 1.99 model, but with harder spectrum.
The difference between EPOS 1.99 and the results of EPOS LHC is not very large because most of the changes are taking place at mid-rapidity.This phase space is good to test the physics of the model but is not very important for air shower development.A contrario, large differences between QGSJETII-03 and QGSJETII-04 are observed.With a larger X max the average mass is heavier than before at Auger energies.In addition the number of muons increased by about 20% to a value closer to EPOS 1.99 but with a larger mean energy for the muons at ground.
To conclude, we can say that LHC data contribute a lot to reducing the uncertainties in air shower simulations, providing better tools to analyze cosmic ray data.The differences between the hadronic models have been reduced but one should keep in mind that there are still uncertainties in the models themselfs which have to be better quantified and transferred to the calculation of the systematic errors in EAS analysis.

Figure 1 .
Figure1.Flux of cosmic rays arriving at Earth rescaled by the energy to the power 2.5.Data references are given in[12].

Figure 2 .
Figure 2. Schematic view of electromagnetic cascades (lefthand side) and schematic view of hadronic cascades (right-hand side).Dashed lines represent neutral particles (π 0 ) and solid lines charged particles (π ± ).Only one charged hadron interaction is shown for each generation.

Figure 4 .
Figure 4. Particle density at η = 0 for non single diffractive events (NSD) (left-hand side) and multiplicity for |η| < 2.5 of π-air collisions (right-hand side) as a function of center of mass energy.Simulations are done with EPOS LHC (full line), QGSJETII-04 (dotted line), EPOS 1.99 (dashed line) and QGSJETII-03 (dashed-dotted line).Points are data from old experiments and red stars are from CMS experiment[32].

Figure 9 .
Figure 9. Mean number of muons at ground divided by the primary energy for proton and iron induced showers as a function of the primary energy.Predictions of different high-energy hadronic interaction models: full lines for proton and dashed lines for iron with full stars for EPOS 1.99, open squares for QGSJETII-04, open circles for QGSJETII-03, and full triangles for the results of EPOS LHC.

Figure 10 .
Figure 10.Inverse of missing energy factor for proton and iron induced showers as a function of the primary energy.Predictions of different high-energy hadronic interaction models are shown with triangles for EPOS LHC and circles for QGSJETII-04.Lines are averages between models.