1. Introduction

COMPLEXITY

Complexity

1099-0526 1076-2787

Hindawi

10.1155/2017/8098574

8098574

Research Article

Statistical Analysis of Video Frame Size Distribution Originating from Scalable Video Codec (SVC)

http://orcid.org/0000-0002-1753-4549

Ahmadpour

Sima

http://orcid.org/0000-0002-3094-8228

Wan

Tat-Chee

² ³ Toghrayee

Zohreh

⁴ HematiGazafi

Fariba

⁴ Cordero

Alicia

Department of Electrical

IT & Computer Sciences

Islamic Azad University

Qazvin Branch

Qazvin

Iran

qiau.ac.ir

National Advanced IPv6 Center

Universiti Sains Malaysia

11800 Penang

Malaysia

usm.my

School of Computer Sciences

Universiti Sains Malaysia (USM)

11800 Penang

Malaysia

usm.my

⁴

Central Bank of the Islamic Republic of Iran

Tehran

Iran

cbi.ir

2017

1932017

2017 30 06 2016 15 01 2017 1932017

2017

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Designing an effective and high performance network requires an accurate characterization and modeling of network traffic. The modeling of video frame sizes is normally applied in simulation studies and mathematical analysis and generating streams for testing and compliance purposes. Besides, video traffic assumed as a major source of multimedia traffic in future heterogeneous network. Therefore, the statistical distribution of video data can be used as the inputs for performance modeling of networks. The finding of this paper comprises the theoretical definition of distribution which seems to be relevant to the video trace in terms of its statistical properties and finds the best distribution using both the graphical method and the hypothesis test. The data set used in this article consists of layered video traces generating from Scalable Video Codec (SVC) video compression technique of three different movies.

Universiti Sains Malaysia

304/PNAV/650817/C127

1. Introduction

Generally, a thorough understanding of the traffic and quality characteristics of encoded video is the basis for traffic modeling and the development of video transport mechanisms [1]. Multimedia transmissions have imposed a huge amount of the today traffic over computer and mobile communication networks. This can be done by simply using a live experiment using real networks and real sources. However, testing real networks is fairly expensive and often it is difficult to come up with realistic results. Another solution to this would be to model the traffic using mathematical analysis or simulation. Trace-driven simulations are thought reliable because they represent an actual traffic load; nevertheless they are usually static and so they provide merely a point representation of the workload space. One more disadvantage of using traces is the difficulty in adjusting parameters and extending the trace if there is a need to continue the simulation beyond the number of packets/frames in the trace file [2]. With this intention, statistical and mathematical traffic models are assumed as better solutions since they can be used to provide a better understanding of various traffic characteristics. This is because they are stochastic in nature, and hence different realizations that represent the actual data can be obtained by varying model parameters.

Among the various characteristics of video traffic, the following two are of major interest in literature:(a)

Distribution of frame sizes.

(b)

Autocorrelation Function (ACF) that captures common dependencies between frame sizes in VBR video.

Among all multimedia applications, video services are demonstrated as the most common ones for generating traffic in communication networks. Obviously, the raw video data requires very high transmission bandwidth and large amount of storage space [3]. Therefore, using video compression techniques is highly recommended and there exist different types of network traffic based on their application. The focus of this paper is on video traces generated from a Scalable Video Codec (SVC) as a compression technique.

SVC is an extension of H.264/AVC which is standardized by The Joint Video Team of the ITU-T VCEG and the ISO/IEC MPEG H.264/AVC standard [4]. SVC has been proposed to support bandwidth efficient and loss resilient video streaming. Meanwhile, the encoding structure of SVC includes one base layer and one or more enhancement layers. H.264 SVC supports layer-scalable coding which presents Temporal Scalability, spatial scalability, and quality (SNR) Scalability [5]. SVC provides two types of quality scalability, known as coarse grain scalability (CGS) and medium grain scalability (MGS). In this paper, the statistical analysis of CGS encoding has been pondered.

In this strategy, each layer has an independent prediction procedure (all references have the same quality level) in a similar fashion to the MPEG-2. In fact, the CGS strategy can be regarded as a special case of spatial scalability when consecutive layers have the same resolution [6]. Coarse grain SNR scalable coding is achieved using the concepts for spatial scalability. The same interlayer prediction mechanisms are employed. The only difference is that base and enhancement layers have the same resolution. The CGS only allows a few selected bitrates to be supported in a scalable bitstream. In general, the number of supported rate points is identical to the number of layers. Switching between different CGS layers can only be done at defined points in the bitstream [7].

Communication network measurements have indicated that many quantities which are characterizing the network performance have long-tail probability distributions. The quantities have the tails that decay more slowly than exponential. This long-tail behavior is mostly related to the terms such as file lengths, call holding times, scene lengths in video streams, and intervals between connection requests in Internet traffic. Long-tail distributions can have a significant effect on performance [8]. long-tail service-time distributions lead to long-tail waiting-time distributions in the queues [9]. Since performance models with long-tail distributions are usually difficult to analyze, it is usually difficult to describe them in detail. To address this problem, finding the best distribution among all of the different distributions has come to account. The aim is to derive a statistical distribution to fit the real data accurately. Since they are the most common distributions related to data with long-tails distributions they were chosen. In other words these data have probability distributions with high skewness which are difficult to analysis by usual statistical models. Therefore, nonsymmetric distributions need to be addressed in this study.

The organization of this paper is as follows. The notable frame size distribution related works are presented in Section 2. Section 3 describes the methodology of the study in which different distributions are explained with the statistical properties in detail. Meanwhile, Section 4 performs the result of fitting these distributions to the data based on statistical criteria. Finally, the conclusion of this study is provided in Section 5.

2. Related Works

Several works have been conducted in order to analyze the video frame sizes. The early work performed by Heyman et al. [10, 11] and Xu and Huang [12] presented the marginal distribution of videoconference encoded by H.261 which were generated by different hardware coders with different coding algorithms is gamma distribution. Aforementioned authors applied this result to design a discrete autoregressive model (DAR) of order one. Krunz and Hughes in [13] modeled the frame sizes which are compressed by MPEG-2 standard. In this study, the best fit for the distribution of frame sizes was lognormal distribution among gamma, Weibull, and lognormal distributions. They used the fitting distribution for three types of frame sizes such as I frames, B frames, and P frames. Fitzek and Reisslein [14] have provided a public available library of frame size sequences including MPEG-4, H. 263, and H.263+ encoded video with a detail of statistical analysis of generated traces. It was shown that the movies as visual content cause a frame generation with gamma-like frame size histogram. Poon and Lo [15] presented that a normal mixture distribution for fitting the sample histogram of video traces encoded by H.261 and H.263. It was proved that this method is better than simple Gamma and Lognormal. Lazaris et al. [16] indicated that Gamma and Lognormal distribution are not always the best fit for MPEG-4 videoconference traffic. Furthermore, they presented the notion that for single videoconference sources the best fit is Pearson type V distribution among all examined distributions. Koumaras et al. [3] discovered that gamma distribution is the best fit for frame sizes compressed by H.264 standard where this fits three types of video frame. Furthermore, Masi et al. in [17] indicated that the Erlang or gamma distributions are fitted appropriately in to the three data sets of actual video frame sizes. In their work the data set consists of two different video compression standards, H.263 and H.264. The best fit of frame size distributions was used to generate packet streams for use in packet level congestion models. Salah et al. in [18] figured out that gamma is a well fitted distribution to the data and Weibull distributions and inverse Gaussian distribution is ranked second after these distributions.

For modeling single source trace, the best distribution needs to be found. Although there were few studies that analyzed packet size distribution, this article considered frame size analysis as output of SVC layers.

3. Evaluation Setup of Video Sequences

The data set presented in this paper consists of three different video traces with the CIF (352 × 288) resolution, a frame rate of 30 frames/second, GOP pattern: G16B15, and the quantization parameters (I, P, and B): 48, N/A, 48 taken from [19]. A video trace characterizes an encoded video stream by providing time stamp, frame type (e.g., I, P, or B), frame size (in byte), and PSNR quality for each encoded frame (and layer of a scalable encoding). Video traces can be readily fed into simulation models of video transport systems, thus, facilitating the evaluation of novel transport mechanism. The video traces under this study include the following:(i)

NBC News sequence (48992 frames) 60 minutes long divided into one base layer and one enhancement layer in which the frame types are intraframes (I frames) and bidirectional frames (B frames).

(ii)

Sony Demo sequence (17,664 frames) 60 minutes long divided into one base layer and one enhancement layer same as the previous video sequence with I and B frame types.

(iii)

Silence of the Lambs sequence (53984 frames) with the similar properties of the former sequences.

Above-mentioned sequences demonstrate video sequences with low or moderate scene changes. The particular encoder tool which is used for encoding purpose is JSVM encoder taken from [19].

4. Methodology

The proper methodology for the paper is as follows:(i)

Investigate the hypothesized distribution families which are suitable in terms of the overall shape of the data under the study.

(ii)

Estimate the parameters of selected distributions by writing code.

(iii)

Find the best distribution for the data by goodness of fit tests.

(iv)

Investigate the autocorrelation function.

Each of these steps will be described in detail in next subsections. As the distributions studied in this paper are widely used in most literature, they will be addressed in terms of statistical theory and the relevant applications in the video traffic modeling.

4.1. Investigating the Different Distributions

Since there are numerous statistical distributions, it is not common to investigate all of them to find the best one for the data set. Therefore, plotting the density function of the data provides a preliminary point of view to identify what kinds of distributions should be studied.

Figure 1 showed that the shape of density function of NBC News with different layers and frame types is not symmetric and has high skewness. This plot had been performed for other two movies and same results are concluded. However, due to the space limitation, they were not presented for every step of implementation for rest of the article. Hence, to implement the fitting of an appropriate distribution to the data, the more common distributions with skewness include exponential, lognormal, logistic, log-logistic, Weibull, gamma, normal, inverse Gaussian, negative-binomial, and Pearson family distributions. In this article, the Pearson distribution was identified in detail, since it contains four parameters which lead to achieve more appropriate fit.

Figure 1

(a) The probability density function of I frame, in base layer of NBC News. (b) The probability density functions of B frame, in base layer of NBC News. (c) The probability density function of I frame, in enhancement layer of NBC News. (d) The probability density function of I frame, in enhancement layer of NBC News.

(a) (b) (c) (d)

4.1.1. Pearson Distribution

Since one of the most common used distributions in literature is Pearson distribution, four-parameter system of probability density functions, which is provided by Abate et al. [9], will be explained in more detail here. On the other hand, four parameters compromise location, scale, skewness, and kurtosis of the distribution and describe the data better than the density functions with less parameters. More parameters in a distribution need more precise description of the data. Depending on some conditions which will be discussed later, there are five Pearson distribution types. In fact, skewness and kurtosis characterize these Pearson types. Let γ1 and γ2 denote skewness and kurtosis of the distribution; then some reparameterization is constructed as follows:(1)β1=γ12β2=γ2+3.A valid solution of the following differential equation specifies a Pearson density family:(2)p′xpx+a+x-λb2x-λ2+b1x-λ+b0=0;therefore:(3)p′x=-pxa+x-λb2x-λ2+b1x-λ+b0.The different types of Pearson distributions can be obtained by solving (3). It is wise to call equation (3) as a first-order linear differential equation with variable coefficients. For simplicity, λ is assumed zero. Thus, (6) can be rewrite as follows:(4)p′x+pxa+xb2x2+b1x+b0=0.To clarify how this equation will be solved, a simple linear order differential equation is described which has the general form of(5)Dyx+fxyx=gx.At first the proof of (5) is provided in Appendix A in detail and then the assumptions (6) and (7) can be released.(6)fx=a+xb2x2+b1x+b0=0,yx=pxgx=0,(7)px∝e-∫x-a/b2x2+b1x+b0dx.Now it is time to solve (7) which is the primer beginning of different types of Pearson distribution. According to the discriminate of equation in denominator, there are two general cases which leads to production of different kinds of Pearson distribution which are displayed in next subsection.

(A) Case 1 Nonnegative Discriminate. If the discrimination is positive, the equation contains two roots named a1 and a2, being maybe the same:(8)a1=-b1-b12-4b0b22b2a2=-b1+b12-4b0b22b2.A complete solution of (7) is delivered in Appendix B and the following results have been obtained at the end:(9)k=1b2a1-a2⟹px∝x-a1-ka1-ax-a2ka2-a.The density is only proportional to this mathematical phase, 4 types of the Pearson distribution can be obtained as follows.

(I) Pearson Type 1 Distribution. The Pearson type 1 distribution occurs when the denominator of (7) includes opposite sign roots, a1<0<a2. For more detailed proof please refer to the Appendix C.(10)fx=Γm1+m2+2Γm1+4Γm1+1x-λa2-a1m11-x-λa2-a1m2.

(II) The Pearson Type 2 Distribution. This distribution is a specified case of Beta distribution which is symmetric [20]. The probability density function is as follows:(11)fx=Γ2m1+1Γm1+12x-λa2-a11-x-λa2-a1m1.

(III) The Pearson Type 3 Distribution. If scale parameter is allowed to be negative, negative skewness, the Pearson type 3 distribution is obtained as follows [21]:(12)fx=1a2-a1m1+1Γm1+1x-λm1e-x-λ/a2-a1.

(IV) The Pearson Type 5 Distribution. Finally, the Pearson type IV distribution is inverse gamma distribution in which again negative scale parameter is allowed [22]. So, the probability density function is as follows:(13)fx=1a2-a1m1+1Γm1+1x-λ-m1-2e-a2-a1/x-λ.

(B) Case 2: Negative Discriminate-Pearson Type 4 Distribution. In this case the discriminant of (7) is negative (b12-4b2b0<0). The denominator of (7) contains no roots. A detailed proof of Pearson type 4 is given in Appendix D.(14)px=Γm+v/2i/Γmαβm-1/2,1/221+x-λα2-mexp⁡-varctan⁡gx-λα.This is the Pearson type IV distribution [23].

Eventually, there are five types of Pearson distribution based on positive or negative of discriminant cases of quadratic function in the basic p(x) in this distribution. Furthermore, many well-known distributions are special case of these types of the Pearson distributions. Table 1 provides the summary of distributions.

Table 1

The special cases of Pearson types.

Distribution	Form of distribution	Type
Beta distribution	f x ; α , β = 1 B α , β x α - 1 1 - x β - 1	1
Cauchy distribution	f x ; x 0 , γ = 1 n γ x - x 0 2 + γ	4
Chi-square distribution	f x ; k = x k / 2 - 1 e - x / 2 2 k / 2 Γ k / 2	3
Continuous uniform distribution	f ( x ) = 1 b - a a ≤ x ≤ b	1
Gamma distribution	f x ; k , θ = 1 θ k 1 Γ k x k - 1 e - x / θ	3
Inverse-chi-squared distribution	f ( x ; k ) = 2 - k / 2 Γ k / 2 x - k / 2 - 1 e - 1 / 2 x	5
Inverse Gamma distribution	f x ; α , β = β α Γ α x - α - 1 e - β / x	5

After describing the different distributions, the parameter estimation is the second step of methodology which will be addressed in the next subsection.

4.2. Estimating the Parameters

In statistics, there are three general estimation methods including the following:(i)

Maximum Likelihood estimation method [24].

(ii)

Least square estimation method.

(iii)

Moment estimation method.

In most statistical analyses, maximum likelihood estimation (MLE) method is used to estimate parameters because it produces unbiased, consistent, and efficient estimates as it is stated by Myung [25]. Based on Kelton and Law [26] the MLE of a distribution are the parameters of that function that maximize the likelihood of the distribution given a set of observational data. Given a set of observational data x and a probability density function (PDF) f the likelihood function is(15)l=∏i=1nfxi, parameters.MLE tends to determine the values of the parameters that maximize the function l. Therefore, this paper is focused MLE method as well for parameter estimation.

4.3. Find the Best Distribution

The final step of the methodology is finding the best distribution for the data set. In statistics, there are two common approaches to fit the best distribution to the data: graphical methods and hypothesis testing which will be explained in next subsections.

4.3.1. Graphical Methods

Graphical method is a way to represent the results from the fitting process. It visually, evaluates how well a distribution matches with the input data. The most widely used and powerful goodness of fit test is quantile-quantile (Q-Q). Q-Q plot is used to calculate the quantile values of two probability distributions and plot them against each other. A point (x,y) on the plot is related to one of the quantiles of the data (x-axes) plotted against the quantiles of the considered distribution (y-axes). Therefore, if the fit is good, the points of the plot will lie approximately along the line y=x (45° reference line).

4.3.2. Hypothesis Test

Generally, in statistic, there are three different statistical hypotheses [27] known as follows:(a)

Comparing two models which are placed in H0 and H1.

(b)

Testing parameters of a hypothesized model which are put in H0 and H1.

(c)

Comparing two different distributions which are stated in H0 and H1.

The last one is the goal of this study. To test all of these hypothesizes, the appropriate statistic, a variable with a specified distribution, needs to be established. One popular statistic to test the considered hypothesis in this study is the Kolmogorov-Smirnov (K-S) statistic which is constructed based on the cumulative distribution functions. K-S test is a nonparametric and distribution free test [28]. The K-S test applies the maximum vertical deviation among the two plots and can be identified in the manner similar to the ones described in [16].(16)D=supx⁡Fx-Gx,where Fx is an empirical distribution function of the real video traffic and Gx is the cumulative distribution function of the model.

The small values of D lead to accepting the null hypothesis in this study:(17)H0: Fx=Gxversus H1: Fx≠Gx.The main result needs to be obtained based on p value. If p value is less than 0.05, usual significant level, the null hypothesis, H0, is rejected. The distribution Fx is a suitable probability function for the data if the p value is larger than 0.05.

4.4. Autocorrelation Function

Capturing the ACF structure of VBR video traffic is actually more challenging because of the fact that VBR traces exhibit both Long-Range Dependent (LRD) and short-range dependent (SRD) properties. If the autocorrelation function decays exponentially fast it can be referred to as an SRD process, but if it decays slowly, then the source is referred to as an LRD process.

As it can be seen from the Figure 2 the autocorrelation remains high even for large numbers of lags and it decay very slowly; both these facts are a clear indication of the importance of LRD. So, strong autocorrelation coefficients are found due to the periodic repetition of I, B, and P frames, and the autocorrelation function has a very slow exponential decrease.

Figure 2

(a) and (b) ACF of NBC NEWS base layer and enhancement layer and (c) and (d) ACF of Sony Demo base layer and enhancement layer.

(a) (b) (c) (d)

5. Result and Discussion

As mentioned, in this paper, SVC video trace includes two layers called base layer and enhancement layer; each layer consists of I frames and B frames. The video traces that are considered here belong to NBC News, Silence of the Lambs, and Sony Demo with low or moderate scene changes. To fit the different distributions and choose the best one, the data are analyzed in R software [29], statistical software, which provides the K-S statistics, its p value, and the Q-Q plots. For the first step the graphical evaluation of the distributions using Q-Q plot and K-S test was performed.

5.1. Q-Q Plot

All distributions being studied should be fitted to the data to determine the closest distribution to the actual data. To do so, the Q-Q plot was employed to detect the closer graph to the reference line, which is the best fit.

As it can be seen from Figure 3, the exponential distribution is further from the reference line compared to the other distributions in both figures. The Pearson curve is quite nearer to the reference line in comparison with other curves graphically. Although the tails are quite far in bigger size of frames, Pearson distribution can be considered as the best fit. In other words, the Pearson distribution type IV is the best distribution for NBC News in all cases which is one of the four-parameter distributions describing the data with high accuracy degree.

Figure 3

(a) The Q-Q plot for I frame, base layer of NBC News. (b) The Q-Q plot for B frame, base layer of NBC News.

(a) (b)

5.2. K-S Test

In order to validate the previous results, another test, known as K-S, is performed on all of the examined data. This test is able to determine if two datasets are significantly different. One of the advantages of K-S test is that it makes no assumption about the distribution of the data [16]. The closer the graph is to the reference line, the less it was subjected to the K-S test.

Figure 4 showed that the K-S graph belonging to the Pearson distribution was the closest to the reference line. This was confirmed by the value of K-S test, which was 0.0502 for the NBC News of base layer and 0.0294 for the NBC News of the enhancement layer.

Figure 4

(a) The K-S test of the frame size of I frames in the NBC News base layer and (b) the K-S test of the frame size of I frames in the NBC News enhancement layer.

(a) (b)

5.3. Statistical Evaluation

In order to provide the statistical confirmation to the visual finding from to the earlier section, a statistical test has been performed. The lower K-S value causes the better fit. This can be verified by p value in a way that the highest p value belongs to the better fit.

Table 2 indicates that the best distribution, which is fitted to the data, is the Pearson distribution type IV for the NBC News base layer and the enhancement layer (I and B frames). These findings were derived from the K-S tests and the p values. As for base layer of the I frames in the NBC News, the K-S test and p value related to the Pearson distribution type IV are 0.0502, which is the lowest, and 0.1105, which is the highest among the other distributions. In the same vein, the corresponding values for the B frames are 0.0223 and 0.2214, respectively. Although the p value exceeds 0.05 (significant level for testing), this value is better than the others, making it suitable for the best distribution in the context of this sequence of frame sizes.

Table 2

The statistical evaluation for the distributions of NBC News.

Movie	Distributions	Parameters	K-S statistic	p value	Best fit
I frame in base layer	Pearson IV	m = 53.5727 , nu=-186.5132, location=-2052.722, scale=1714.155	0.0292	0.1105	Pearson IV
	Weibull	Shape = 3.1016, scale = 1100.322	0.0335	0.0014
	Gamma	Shape = 7.1376, Rate = 0.0072	0.0433	2.20 E - 16
	Exponential	R a t e = 1.0157 e - 03	0.3341	2.20 E - 16
	Lognormal	Location = 6.8204, shape = 0.4116	0.0628	7.71 e - 11
	Normal	Location = 984.4909, scale = 342.2462	0.5146	0.0049
	Logistic	Location = 979.0349, scale = 195.6807	0.0547	0.0013

B frame in base layer	Pearson IV	m = 3.4146 , nu=-11.9247, location=-20.0412, scale=171.6336	0.0252	0.2214
	Weibull	Shape = 1.9873, scale = 456.1013	0.0327	1.47 E - 14
	Gamma	Shape = 3.9226, Rate = 0.0097	0.0679	2.20 E - 16
	Exponential	R a t e = 2.4851 E - 03	0.3176	2.20 E - 16
	Lognormal	Location = 5.8645, shape = 0.5268	0.0946	0.0668
	Normal	Location = 984.4909, scale = 342.2462	0.5027	2.20 E - 16
	Logistic	Location = 374.6126, scale = 114.4575	0.0865	2.20 E - 16

For the enhancement layer of I frames in the NBC News, the lowest K-S test, which was 0.0332, and the highest p-value, which was 0.0474, belong to Pearson type IV. Similarly, for the B frames, those values are 0.0204 and 0.2824, respectively, which proves that the Pearson distribution type IV is the best. A similar form of analysis was done for both Silence of the Lambs and Sony Demo in both I and B frames of the base layer and enhancement layer and again, due to the presence of space limitation, the results are not shown. The results show that for all of the p values, which are effectively less than 0.05, none of the distributions are well fitted to these data, and, however, the lowest K-S test value or the maximum p value can be chosen for an appropriate distribution for the data. Among those distributions with the skewness mentioned above, the Pearson distribution resulted in the best fitting for all video traces in the study (for both I frames and B frames).

6. Conclusion

In this study, frame types and the corresponding layers of SVC video traces are considered as data set for which the different distributions are investigated. Eventually, the density function of frame sizes has right skewness tail. It is also true for other video traces under the study, although the limitation of space does not allow showing all of them here. As it was discussed, the Pearson distributions are appropriate to fit this data set. To achieve this finding, the analysis was started with plotting the density function of data to get some information on the data. Then by applying the Q-Q plot and K-S test, each of distribution was fitted to the data. Finally, based on the k-s statistic and the p value, the best distribution was chosen which was shown by the Q-Q plot too. Although there have been some studies in which the Pearson distributions were chosen as the best distribution, this result was for H.264 and MPEG encoded video data. Except the B frame trace of base layer, it is worth noting that the findings of this study result in the Pearson distributions as the best ones related to SVC video data for the first time and this is the first contribution of this paper. This result has been used to provide a traffic model in [30] by same author.

To the best of our knowledge, none of the existing researches explains all of the relevant distributions details in terms of concepts of parameters and theory. Furthermore, the work presented in this paper can be considered as an applicable collection of statistical distributions which are applied in the computer science and it can be regarded as the second contribution of this paper. Nevertheless, to find the best distribution for some data set under specified situations in which none of these distributions cannot be chosen regarding all of statistical criteria, applying some conversion or the mixture distribution for the data will be done as future works.

Appendix A. Proof of Equation (<xref ref-type="disp-formula" rid="EEq5">5</xref>)

A simple linear order differential equation is described which has the following general form:(A.1)Dyx+fxyx=gx, Equation 5.Both sides of this equation are multiplied by e∫fxdx so we have(A.2)Dyxe∫fxdx+fxyxe∫fxdx=gxe∫fxdx.Based on the product rule in differential analysis, it equals to(A.3)Dyxe∫fxdx=gxe∫fxdx;then(A.4)yxe∫fxdx=∫gxe∫fxdxdx+cyx=∫gxe∫fxdxdx+ce∫fxdx.The assumption (7) comes from the comparison between (6) and (A.5)yx=e-∫fxdx∫gxe∫fxdxdx+k.

B. The Solution of Equation (<xref ref-type="disp-formula" rid="EEq7">7</xref>)

This appendix covers the solution of (7) in more detail. In case of positive discrimination, there are two real roots like a1 and a2. Therefore, the quadratic function can be rewritten as follows:(B.1)fx=b2x-a1x-a2.Rewriting (7) as(B.2)px∝exp⁡-1b2∫x-ax-a1x-a2dxthen the solution result of the integral is as follows:(B.3)∫x-ax-a1x-a2dx=∫a1-ax-a1a1-a2dx+∫a-a2x-a2a1-a2dx∫x-ax-a1x-a2dx=a1-aln⁡x-a1a1-a2+a-a2ln⁡x-a2a1-a2+c.Considering k=1/b2a1-a2, (7) can be rewritten as p(x)∝x-a1-ka1-ax-a2ka2-a.

C. The Proof of Pearson Distribution Type 1

Appendix C presents the proof of Pearson type 1 for better understanding to the readers. To find the density function of X, it can be written as a linear function of Y and its density function is in [31].(C.1)X=a1+ya2-a1 where 0<y<1py∝a1-a2a1yva1-aa2-a1a11-yva2-apy∝ym11-ym2,where m1=a-a1/b2(a1-a2) and m2=a-a2/b2a1-a2 indicate the shape parameter(C.2)X-λa2-a1~βm1+1,m2+1,where λ=μ1-a2-a1m1+1/m1+m2+2a1 is a location parameter and a2-a1 is scale parameter.(C.3)Pearson type 1: fx=Γm1+m2+2Γm1+4Γm1+1x-λa2-a1m11-x-λa2-a1m2.

D. The Proof of Pearson Distribution Type 4

In this case, there are no roots but by defining a new variable as below, the Pearson distribution type 4 is produced and p(x) can be obtained.(D.1)y=x+b12b2,α=4b2b0-b122b2.So if(D.2)hx=b2x2+b1x+b0then hx=b2y2+α2,(D.3)py∝exp⁡-1b2∫y-b1/2b2-ay2+α2dyon the other hand, the integral can be solved as follows:(D.4)∫y-b1/2b2-ay2+α2dy=12ln⁡y2+α2-2b2a+b12b2αarctan⁡gyα+c0;therefore(D.5)py∝1+y2α2-mexp⁡-varctan⁡gyα,where(D.6)m=12b2,v=-2b2a+b12b2α,As it was mentioned, λ was ignored in the first equation for simplicity but here it is included to the equation(D.7)Pearson type 4 Equation 14: px=Γm+v/2i/Γmαβm-1/2,1/221+x-λα2-mexp⁡-varctan⁡gx-λα.

Competing Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This work was supported in part by Universiti Sains Malaysia through Grant no. 304/PNAV/650817/C127.

Gupta

Pulipaka

Seeling

Karam

L. J.

Reisslein

H.264 coarse grain scalable (CGS) and medium grain scalable (MGS) encoded video: a trace based traffic and quality evaluation

IEEE Transactions on Broadcasting 2012 58 3 428 439

10.1109/tbc.2012.2191702

2-s2.0-84865338036

Tanwir

Perros

A survey of VBR video traffic models

IEEE Communications Surveys and Tutorials 2013 15 4 1778 1802

10.1109/surv.2013.010413.00071

2-s2.0-84888357894

Koumaras

Skianis

Gardikis

Kourtis

Analysis of H.264 video encoded traffic

Proceedings of the 5th International Network Conference (INC '05)

July 2005

Samos, Greece

441 448

2-s2.0-84905233340

Lin

C.-M.

Zao

J. K.

Peng

W.-H.

C.-C.

Chen

H.-M.

Yang

C.-K.

Bandwidth efficient video streaming based upon multipath SVC multicasting

Proceedings of the International Wireless Communications and Mobile Computing Conference (IWCMC '08)

August 2008

Crete Island, Greece

406 412

10.1109/iwcmc.2008.71

2-s2.0-52949131572

Schwarz

Marpe

Wiegand

Overview of the scalable video coding extension of the H.264/AVC standard

IEEE Transactions on Circuits and Systems for Video Technology 2007 17 9 1103 1120

10.1109/tcsvt.2007.905532

2-s2.0-34748835762

Unanue

Urteaga

Husemann

Del Ser

Roesler

Rodriguez

Sanchez

Del Ser Lorente

A tutorial on H.264/SVC scalable video coding and its tradeoff between quality, coding efficiency and performance

Recent Advances on Video Coding 2011 chapter 1

Rijeka, Croatia

InTech

10.5772/19227

Shahid

Chaumont

Puech

Scalable video coding

Effective Video Coding for Multimedia Applications 2011

InTech

10.5772/14739

Feldmann

Whitt

Fitting mixtures of exponentials to long tail distributions to analyze network performance moldes

Proceedings of the 16th Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOM '97)

April 1997

Kobe, Japan

Abate

Choudhury

G. L.

Whitt

Waiting-time tail probabilities in queues with long-tail service-time distributions

Queueing Systems 1994 16 3 311 338

10.1007/bf01158960

2-s2.0-21344494520

Heyman

D. P.

Tabatabai

Lakshman

T. V.

Statistical analysis and simulation study of video teleconference traffic in ATM networks

IEEE Transactions on Circuits and Systems for Video Technology 1992 2 1 49 59

10.1109/76.134371

2-s2.0-0026837808

Heyman

Lakshman

T. V.

Tabatabai

Heeke

Modeling teleconference traffic from VBR video coders

Proceedings of the IEEE International Conference on Communications (ICC '94), Conference Record, ‘Serving Humanity through Communications’ (SUPERCOMM/ICC '94)

May 1994

1744 1748

10.1109/ICC.1994.368735

Huang

A gamma autoregressive video model on ATM networks

IEEE Transactions on Circuits and Systems for Video Technology 1998 8 2 138 142

10.1109/76.664098

2-s2.0-0032049867

Krunz

Hughes

A traffic for MPEG-coded VBR streams

ACM SIGMETRICS Performance Evaluation Review 1995 23 1 47 55

10.1145/223586.223592

Fitzek

F. H. P.

Reisslein

MPEG-4 and H.263 video traces for network performance evaluation

IEEE Network 2001 15 6 40 54

10.1109/65.967596

2-s2.0-0035510001

Poon

W.-C.

K.-T.

A refined version of M/G/∞ processes for modelling VBR video traffic

Computer Communications 2001 24 11 1105 1114

10.1016/S0140-3664(00)00325-X

2-s2.0-0035875946

Lazaris

Koutsakis

Paterakis

A new model for video traffic originating from multiplexed MPEG-4 videoconference streams

Performance Evaluation 2008 65 1 51 70

10.1016/j.peva.2007.02.004

2-s2.0-35549012037

Masi

D. M. B.

Fischer

M. J.

Garbin

D. A.

Video frame size distribution analysis

The Telecommunications Review 2008 19 74 86

Salah

Al-Haidari

Omar

M. H.

Chaudhry

Statistical analysis of H.264 video frame size distribution

IET Communications 2011 5 14 1978 1986

10.1049/iet-com.2010.0868

2-s2.0-80053548499

Video Trace Library, http://trace.eas.asu.edu/

Garcia

J. A. D.

Gutierrez-Jaimez

Matricvariate and matrix multivariate Pearson type II distributions

https://arxiv.org/abs/1011.5083v1

Olshen

A. C.

Transformations of the pearson type III distribution

The Annals of Mathematical Statistics 1938 9 3 176 200

10.1214/aoms/1177732309

Elderton

W. P.

Johnson

N. L.

Systems of frequency curves 1969

London, UK

Cambridge University Press

MR0258166

Heinrich

A Guide to the Pearson Type IV Distribution 2004

http://www-cdf.fnal.gov/physics/statistics/notes/cdf6820_pearson4.pdf

Nagahara

The PDF and CF of Pearson type IV distributions and the ML estimation of the parameters

Statistics and Probability Letters 1999 43 3 251 264

10.1016/S0167-7152(98)00265-X

ZBL0930.62013

2-s2.0-0039350903

Myung

I. J.

Tutorial on maximum likelihood estimation

Journal of Mathematical Psychology 2003 47 1 90 100

10.1016/S0022-2496(02)00028-7

MR1982757

ZBL1023.62112

2-s2.0-0038172395

Kelton

W. D.

Law

A. M.

Simulation Modeling and Analysis 2000

Boston, Mass, USA

McGraw-Hill

MR630193

Casella

Berger

R. L.

Statistical inference

Journal of Time Series Analysis 2001 9

Asai

Dashzeveg

A distribution-free test for symmetry with an application to S&P index returns

Applied Economics Letters 2008 15 6 461 464

10.1080/13504850600706438

2-s2.0-42549092363

Stockwell

Niche Modeling Predictions from Statistical Distributions 2007

Boca Raton, Fla, USA

Chapman & Hall/CRC Taylor & Francis

Chapman & Hall/CRC Mathematical and Computational Biology Series

MR2374574

Ahmadpour

Toghrayee

Wan

T.-C.

An extended discrete autoregressive model for variable bit rate video traffic encoded by scalable video codec

International Journal of Communication Systems 2015 28 18 2239 2254

10.1002/dac.3011

2-s2.0-84958603197

Pearson

Mathematical contributions to the theory of evolution.—X. Supplement to a memoir on skew variation

Proceedings of the Royal Society of London 1901 68 442–450 372 373

10.1098/rspl.1901.0060