Distribution Inference for Physical and Orbital Properties of Jupiter ’ s Moons

According to the physical and orbital characteristics in Carme group, Ananke group, and Pasiphae group of Jupiter’s moons, the distributions of physical and orbital properties in these three groups are investigated by using one-sample Kolmogorov–Smirnov nonparametric test. Eight key characteristics of the moons are found to mainly obey the Birnbaum–Saunders distribution, logistic distribution, Weibull distribution, and t location-scale distribution. Furthermore, for the moons’ physical and orbital properties, the probability density curves of data distributions are generated; the differences of three groups are also demonstrated. Based on the inferred results, one can predict some physical or orbital features ofmoons withmissing data or even newpossiblemoonswithin a reasonable range. In order to better explain the feasibility of the theory, a specific example is illustrated. Therefore, it is helpful to predict some of the properties of Jupiter’s moons that have not yet been discovered with the obtained theoretical distribution inference.


Introduction
There are 69 (the number has been refreshed to 79 by a team from Carnegie Institution for Science in July 2018.https:// sites.google.com/carnegiescience.edu/sheppard/moons/jupitermoons) confirmed moons of Jupiter, around 65 of which have been well investigated [1,2].Considering the formation of Jupiter's moons is influenced by diverse factors, which results in their physical characteristics differing greatly [3], Jupiter's moons are divided into two basic categories: regular and irregular.The regular satellites are so named because they have prograde and near-circular orbits of low inclination, and they are in turn split into two groups: Inner satellites and Galilean [4].The irregular satellites are actually the objects whose orbits are far more distant and eccentric.They form families that share similar orbits (semi-major axis, inclination, and eccentricity) and composition.These families, which are considered to be part of collisions, arise when the larger parent bodies were shattered by impacts from asteroids captured by Jupiter's gravitational field.That is to say, at the early time of moons' formation of the Jupiter, mass of the original moon's ring was still sufficient to absorb the asteroid's power and put it into orbit.So, part of the irregular moons might be created by the captured asteroids and then collided with other moons [5,6], thus forming the various groups we see today.The identification of satellite families is tentative (please see [7,8] for more details), and these families bear the names of their largest members.The most detailed modelling of the collisional origin of the families was reported in [9,10].
According to this identification scheme [11], 60 moons were classified into 8 different groups, including Small Inner Regulars and Rings, Galileans, Themisto group, Carpo group, Himalia group, Carme group, and Ananke group as well as Pasiphae group, in addition to 9 satellites that do not belong to any of previous groups.The detailed information about all the groups of Jupiter's moons can be found in Appendix A.
In recent years, many scientists have paid considerable attention to astronomical observation, physical research, and deep space exploration of small bodies, including asteroids, comets, and satellites, and so on.For example, planetary scientist Carry collected mass and volume estimates of 17 near-Earth asteroids, 230 main-belt and Trojan asteroids, 12 comets, and 28 trans-Neptunian objects from the known 2 Advances in Astronomy literature [12].The accuracy and biases affecting the methods used to estimate these quantities were discussed and best-estimates were strictly selected.For the asteroids in retrograde orbit, there are at least 50 known moons of Jupiter's that are retrograde, some of which are thought to be asteroids or comets that originally formed near the gas giant and were captured when they got too close.Kankiewicz and Włodarczyk selected the 25 asteroids with the best-determined orbital elements and then estimated their dynamical lifetimes by using the latest observational data, including astrometry and physical properties [13].However, few researchers have tried to extrapolate the distribution of Jupiter's satellites through statistical methods as yet.
In this paper, distributions of physical and orbital properties for the moons of Jupiter will be conducted by using one-sample Kolmogorov-Smirnov (K-S) test and maximum likelihood estimation [14][15][16].Based on the analysis of satellites' data, it is found surprisingly that the physical and orbital characteristics obey some distribution, such as Birnbaum-Saunders distribution [17], logistic distribution, Weibull distribution, and t location-scale distribution.Furthermore, the probability density curves of the data distribution are generated, and the differences of physical and orbital characteristics in the three groups are presented.In addition, the results of theoretical inference results are then proved to be feasible through one concrete example.Therefore, the results may be helpful to astronomers to discover new moons of Jupiter in the future.

Method of Distribution Inference
In statistics, the K-S test, one type of nonparametric test, is used to determine whether a sample comes from a population with a specific distribution.The null hypothesis of onesample K-S test is that the Cumulative Distribution Function (CDF) of the data follows the adopted CDF.For one-sample case, null distribution of statistic can be obtained from the null hypothesis that the sample is extracted from a reference distribution.The two-sided test for "unequal" CDF tests the null hypothesis against the alternative that the CDF of the data is different from the adopted CDF.The test statistic is the maximum absolute difference between the empirical CDF calculated by x and the hypothetical CDF: where () is a given CDF and is the empirical distribution function of the observations   .
Here  (−∞,] (  ) is the indicator function with the following form According to Glivenko-Cantelli theorem [18], if the sample comes from distribution (), then   will almost surely converge to zero when  → ∞.Therefore, we only focus on three satellite groups that have more than ten satellites in their groups, respectively.Of all these groups, Themisto group and Carpo group only contain one satellite, respectively, and only 4 moons were found separately in Small Inner Regulars and Rings, as well as in Galileans group.Moreover, there are 5 moons in Himalia group.For these 5 groups, there is no sufficient data for distribution inference, so we will focus on the Carme group, the Ananke group, and the Pasiphae group.
In the following sections, in order to use the one-sample K-S test, three sets of observed data from Jupiter's moons will be tested against some commonly used distributions in statistics.The list of these distributions is shown in Table 1.
For the 9 continuous distributions in Table 1, the onesample K-S test will be used to select the distribution with the highest confidence level.In order to characterize these distributions with well-defined parameter values, maximum likelihood estimation is also used.In addition, the parameter values of these distributions can be calculated from the observed data.However, when confidence level (typically set to 0.05) decreases, the rejection domain of the test becomes smaller, so the observed values that initially fall into the rejection domain may eventually fall into the acceptance domain.This situation will bring some trouble in practical application.To this end, we adopt p value, which represents the obtained confidence level by using the one-sample K-S test.In addition, the use of p value not only avoids determining the level of significance in advance, but also makes it easy to draw conclusions about the test by comparing the p value and significance level of the test.If the p value is greater than 0.05, we declare that the null hypothesis can be accepted.Furthermore, if the p values of several distributions are all greater than 0.05, the distribution with the largest p value should be selected, and the corresponding distribution will be the most appropriate one to fit the observed data.Our results can be found in the tables in Appendix B.

Distribution Inference of Satellite Groups
In this section, distributions of several diverse physical and orbital properties for the Carme group, Ananke group, and Pasiphae group are inferred sequentially.

Carme Group.
There are 15 moons in the Carme group (please see Appendix A.). Due to the lack of enough data from S/2010 J1, the number of adopted moons is 14, which means the length of each data set is 14 from the mathematical perspective.In addition, considering all the mean densities have been calculated at being 2.60 g/cm 3 , the surface gravity of Carme and other moons in this group is 0.017 m/s 2 and 0.001 m/s 2 , respectively [19].Therefore, these data obviously do not obey the distributions.
Based on the previous method of statistical distribution inference and MATLAB 2016a (Intel Core i5-3230 M, CPU 2.60 GHz), distribution inferences in the Carme group can be found in Table 2, and the last column represents the p value.The smaller the p value, the greater the significance because it tells us that hypothesis under consideration may not be sufficient to explain the observations.The hypothesis will be rejected if any of these probabilities is less than or equal to a small, fixed but arbitrarily predefined threshold value.The null hypothesis here refers to data obeying a particular distribution, and the alternative hypothesis assumes that the data does not obey the distribution.
From Table 2, semi-major axis and mean orbit velocity obey the logistic distribution, of which parameter  denotes the average orbital eccentricity and parameter  plays a key role in representing the variance of the data set, through the variance formula  2  2 /3, so the variances of these logistic distributions are 1.3957E-04 and 4.1863E-04, respectively.As the parameter  increases, it indicates that the average semi-major axis and mean orbit velocity of the Carme group increase.As the parameter  increases, the data in these two characteristics gradually disperse, and the discrepancy between the data and the average value increases.The meaning of decreasing the parameter  and  shares the same principle as that of increasing the parameters.The orbit eccentricity follows the Birnbaum-Saunders distribution, which is unimodal with a median of .The mean value and the variance of the distribution can be calculated by the following relationships: Therefore, the mean value of orbital eccentricity can be calculated to be 0.2543 and the variance is 7.08E-9.
The inclination of orbit, equatorial radius, and escape velocity are subject to the t location-scale distribution, which contains the scale parameter , the location parameter , and the shape parameter .Without loss of generality, we assume that the data vector  obeys the t location-scale distribution, and then we have ( − )/ ∼ (), which obeys Student's t-distribution; here  represents the degrees of freedom.As can also be seen in Table 2, the inclination of orbit obeys the t location-scale distribution with parameters (1.6511, 0.0017, 0.8751) and the mean inclination is 165.11∘ .The equatorial radius follows the t location-scale distribution with parameters (1.65708, 0.440683, 1.14501).Therefore, the moons in Carme group have an average equatorial radius of 1.65708 km.In addition, the escape velocity characteristic obeys the t location-scale distribution with parameters (7.32997, 1.58419, 1.06821) and the average escape velocity is 7.32997 km/h.As the parameter  changes, the average equatorial radius and escape velocity of the group also change accordingly.In the t location-scale distribution, the variance is  2 /( − 2), and when the shape parameter  is greater than two, the variance of the distribution is defined.Therefore, the variances of these two specific t location-scale distributions cannot be defined.

Ananke Group.
There are 10 moons in the Ananke group, of which we do not have enough data about S/2010 J2.Thus, the length of each characteristic of the remaining moons in this group will be 9.
From our discussion on the t location-scale distribution in Carme group, it is easy to understand the distribution inference of semi-major axis, mean orbit velocity, equatorial radius, surface gravity, and escape velocity in the Ananke group (see Table 3).However, compared with Table 2, it is noted that the orbit eccentricity and mass properties in the Ananke group are subject to the Weibull distribution.Parameters  and  represent the scale and shape parameters of the distribution [20], respectively, which together determine the mean and variance of the distribution.

Pasiphae Group.
There are 19 moons in the Pasiphae group; due to the lack of data about three moons, S/2011 J2, S/2017 J1, and S/2016 J1, the remaining 16 will be studied in this subsection.
The distributions in Table 4 are inferred to be similar to the distribution in Tables 2 and 3. Therefore, we can easily understand the parameters of these distributions.

Comparison of Data Properties
Table 5 is given according to the previous distribution inference.
Based on the previous distribution inference, the properties of the moons' data can be compared more specifically and conveniently.According to the distribution of the specific parameters, we get the following probability density function (PDF) diagram.
As shown in Figure 1, the semi-major axis in the Ananke group is the smallest, followed by the Carme group and the Pasiphae group.In addition, the PDF of the Pasiphae group is relatively flat, indicating a large dispersion in semi-major axis around Jupiter, while the data in the Carme and Ananke groups differ slightly.As can be seen from Figure 2, the  trend of mean orbital velocity distribution in Carme group is the closest to each other, with the mean value being the smallest among the three groups, followed with the Pasiphae and Ananke groups.Obviously, the curve of the Pasiphae group is the flattest, which shows the mean orbit velocity in the Pasiphae group differing greatly.Figure 3 shows that the mean value in the Ananke group is the smallest, followed by the Carme and Pasiphae groups.It is obvious that the PDF curve of the Pasiphae group is flatter compared to the others,  which means the orbital eccentricity of the Pasiphae group has a relatively large dispersion.In Figure 4, although the inclinations of orbit in the Ananke group and Pasiphae group are relatively close in value compared with the Carme group, and the data looks more dispersed than those in Carme group, the inclination of orbit in the Ananke group has the same distribution as the Carme group.
Figure 5 shows the PDF curves of equatorial radii of these three groups.Data attributes are similar; most of the moons' radiuses are less than 4 km. Figure 6 indicates similarities between the Ananke and Pasiphae groups.As can be seen from these curves, most of the moons in these two groups are of relatively small mass.
Figure 7 illustrates the surface gravity PDF curves in the Ananke and Pasiphae groups.The difference in density between the two groups indicates that the surface gravity of the Ananke moons is higher than that of the Pasiphae group.The PDF plots corresponding to the escape velocity are shown in Figure 8, where it is clear that the Pasiphae group is flatter than the other two groups.The Carme group also has similar escape velocity density to the Ananke group, but the former average escape velocity is smaller.

Verification of Rationality of Theoretical Results
In this section, we take the semi-major axis and mean orbit velocity of the moons in the Carme group as concrete examples to illustrate the rationality of the statistical inferences in the preceding sections.As can be seen in   Note that the orbital and physical properties are not independent of each other.For instance, the semi-major axis  is related to the mean orbit velocity V through the relation V = √/, where  is the mass parameter.Then the PDF of mean orbit velocity can also be derived analytically as follows Although the PDF of mean orbit velocity obtained by different methods has different mathematical representations, from Figure 9, we can find that  ,V (V; , ) (PDF curve represented by red circles) obtained by the analytical method is in good agreement with  ,V (V; , ) (PDF curve represented by blue circles) obtained by statistical inference.
However, what we need to pay attention to here is that some physical features and orbital elements are mixed and can be linked by some mathematical formulas similar to the above.Theoretically, the distribution of another variable in the formula can be solved by a known defined distribution; although this may be a complex process because the probability density function may contain some transcendental functions and gamma functions, it can be achieved.Yet the known distribution becomes uncertain now; that is, the distribution exists with a certain probability.So, there will be a certain risk when we calculate the distribution of the linked variable based on this uncertain distribution, especially when the possibility of the inferred distribution is not very high.
In addition, in order to further show that the results of the KS test agree well with the actual observed results,  we also compared the best-fit CDFs (the CDFs of inferred statistically) and the observed CDFs.From Figures 10 and 11, it can be seen that the best-fit CDF and the observed CDF of semi-major axis agree better than the case of mean orbital velocity.This should be due to the fact that the  value of the former is 0.9988, which is obviously larger than that of the latter 0.6607.

Application of the Distribution Inference
As mentioned previously, identification of irregular satellite families is tentative.However, after the distribution was inferred, it is found that these features of moons in the three different groups obey some selected distributions.Furthermore, the obtained population distribution can also be used to predict the characteristic data.Considering the rationality of proving this prediction method, suppose some characteristic data of a moon is unknown in a given irregular moon group.One specific feature of the moon can be predicted by using other moons' data and one-sample K-S method.Here is an example: Assume that some characteristic data of the moon Erinome (S/2000 J4) in Carme group is poorly known.Now, we try to predict the equatorial radius of the Erinome.First, we use one-sample K-S method to find the most appropriate continuous distribution for these data.The inference results are as shown in Table 6.
From the p values displayed in Table 6, it becomes clear that the corresponding p value 0.6806 is the largest and ℎ = 0, so the best distribution for the remaining characteristic data in the Carme group is t location-scale distribution with the PDF  ( | , , ) where Γ(⋅) is the Gamma function.The distribution implies that average equatorial radius of the moons in Carme group is 1.67257 km with confidence interval [1.23204, 2.1131] (we believe that the confidence interval will be smaller and shorter with continuous progress of observation technology) under the level of significance 0.05.For the true value of Erinome's equatorial radius being 1.6 km, it is easy to find that we can use the estimated value to predict the equatorial radius or as a reference to study other relevant physical and orbital characteristics.

Conclusions
By using the one-sample K-S nonparametric test method of statistical inference, the distribution laws of the physical and orbital properties of Jupiter's moons are investigated statistically in this paper.The physical and orbital characteristics of Jupiter's moons are found to obey the Birnbaum-Saunders distribution, the logistic distribution, the Weibull distribution, and the t location-scale distribution.
In addition, the probability density curves of the data distributions are also generated, and the differences in the physical and orbital characteristics of the three groups are explained in more detail.
Through a specific example, we find that some moons' missing data can be inferred by using the aforementioned distributional model and probability density function.More importantly, with the help of the distribution, it can be even helpful to predict the physical or orbital features of the undiscovered moon.
If future observations will allow for the expansion of the number of Jupiter's moons, we believe that the distribution laws will be slightly modified as potential newly discovered distribution functions fit the increased sample better, and the distributions will probably tend to be more uniform; i.e., some of the different properties follow the same distribution obviously, but these will not change dramatically over a long period of time.

Figure 1 :
Figure 1: The PDF curves of semi-major axis around Jupiter.

Figure 2 :
Figure 2: The PDF curves of mean orbit velocity.

Figure 9 :
Figure 9: Comparison of statistical prediction distribution and analytical derivation distribution.

SemiFigure 10 :
Figure 10: Comparison of the observed CDF and the best-fit CDF of semi-major axis.

2003
The new Jupiter satellites discovered but yet to be numbered or named.All numbered or named satellites are in the table above

Table 1 :
A list of common distributions.

Table 2 :
The distribution inference in each physical and orbital characteristic (Carme group).

Table 3 :
The distribution inference in each orbital or physical characteristic (Ananke group).

Table 4 :
The distribution inference in each orbital or physical characteristic (Pasiphae group).

Table 2
Figure 7: The PDF curves of surface gravity.
Note: h = 0 and h = 1 indicate acceptance of the null hypothesis and rejection of the null hypothesis, respectively.

Table 10 :
Pasiphae group.Figure 11: Comparison of the observed CDF and best-fit CDF of mean orbit velocity.